Category talk:VI
Add topicAppearance
Latest comment: 11 days ago by AtUkr in topic Content problems
Content problems
[edit](From w:vi:Thảo luận Wikipedia:Guestbook for non-Vietnamese speakers § Beta Wikiversity)
I skimmed through the pages. There are two big types that can probably be speedy deleted:
- Most, or at least a large number of these pages were created by Quachthanh/Thanhtrungquach, who was blocked several times via various accounts and IPs for copyright violation. According to one of his user pages that I can no longer find, he is based in Canada.
- Pages like Chu Văn Vương Bát Quái, Bồ-đề-đạt-ma and Bình định Tam Tần (something to do with old Chinese history/medicine/philosophy/legends/I Ching) are definitely his. Maths/physics-related pages (like Hổn số, Lượng tử nhiệt quang, etc.) are most likely his too. For any given Vietnamese page, it is safe to assume that he created it if the IP creating that page geolocates to Canada.
- He also created English guides (like Cut and other pages in Category:Từ vựng tiếng Anh), collections of idioms (like H) and various other kinds of off-topic contents.
- Another big chunk is copies of official textbooks and related resources. Vietnamese textbooks are not in PD. The names of these pages follow the regex pattern
.+ ([6789]|1[012]).- Công nghệ (note the similarities: on-wiki copy, source (page 155)), Giáo dục công dân, Hóa học, Lịch sử, Sinh học, etc.
- Some might not be placed under a common pseudo-namespace, like Cuộc chia tay của những con búp bê (likely copied from this source).
I would recommend deleting all of them as a start. NguoiDungKhongDinhDanh 20:25, 28 January 2026 (UTC)
- This is a start, thank you.
> For any given Vietnamese page, it is safe to assume that he created it if the IP creating that page geolocates to Canada.
Unfortunately, this became impossible to discern after temporary accounts were introduced. There's been an influx of new pages created anonymously since last September. You can see the accounts doing this using catanalysis. AtUkr (talk) 21:04, 28 January 2026 (UTC)- @AtUkr: You should be able to view the underlying IPs as a sysop; see mw:Trust and Safety Product/Temporary Accounts/Access to IP. NguoiDungKhongDinhDanh 21:39, 28 January 2026 (UTC)
- Wow, never realised that was a feature. Well now, how come this person isn't globally blocked? AtUkr (talk) 21:51, 28 January 2026 (UTC)
- @AtUkr: I suppose that's because no one reported him to m:SRG. In any case, the accounts have long been inactive, so a lock is unlikely. You can block his IP ranges locally if you feel that is necessary. NguoiDungKhongDinhDanh 02:24, 29 January 2026 (UTC)
- Wow, never realised that was a feature. Well now, how come this person isn't globally blocked? AtUkr (talk) 21:51, 28 January 2026 (UTC)
- @AtUkr: You should be able to view the underlying IPs as a sysop; see mw:Trust and Safety Product/Temporary Accounts/Access to IP. NguoiDungKhongDinhDanh 21:39, 28 January 2026 (UTC)
- Alright, that's 3,000 pages gone. What about the remainder? How good are those pages? AtUkr (talk) 00:07, 30 January 2026 (UTC)
- @AtUkr: That was fast. Thanks for your work. A lot of the remaining 400+ don't seem to be of any value whatsoever at a glance, and can probably be deleted as well:
- Aggregation pages: Vật lý, Bộ dữ liệu sức khỏe, plus portals (this wiki has no Portal namespace, so these are actually in mainspace).
- Broken redirects: Đạo Lão, Bói, Cắt nghĩa văn hóa, etc.
- Empty categories: Category:Khí công, Category:Thí nghiệm, Category:Tần Thủy Hoàng, etc.
- Obvious machine translations, likely from enwiki: Golf, Thu hút giữa các cá nhân, Chương trình giảng dạy ẩn, etc.
- More copyright violations: Đòi nợ đúng luật (source), etc.
- Empty, almost empty or having no useful content: Tâm lý học Internet, etc.
- I'll check them one by one and tag for deletion as necessary. As for the rest (that is, if there are even any left), I'll need to understand exactly what is expected of Wikiversity articles. I read en:Wikiversity:What is Wikiversity? but that wasn't very helpful.
- NguoiDungKhongDinhDanh 04:51, 30 January 2026 (UTC)
- A decent number of articles are technology/programming-related; those were mostly, if not all, created by Kimkha. They are certainly on-topic for Wikibooks, but I'm not sure about Wikiversity. As a tangentially relevant note, some of them (like Các thành phần cơ bản của ngôn ngữ C) explicitly state that they were written by someone else but licensed under CC-BY. NguoiDungKhongDinhDanh 05:24, 30 January 2026 (UTC)
- > They are certainly on-topic for Wikibooks, but I'm not sure about Wikiversity.
Some of them are listed as "textbooks", so perhaps they do belong on Wikibooks rather than here. Provided they're still useful that is. Who knows with how old they are.
> As a tangentially relevant note, some of them (like Các thành phần cơ bản của ngôn ngữ C) explicitly state that they were written by someone else but licensed under CC-BY.
Don't they say those are articles Kimkha rewrote? So not original texts authored by other people. AtUkr (talk) 21:31, 30 January 2026 (UTC)- @AtUkr: Some of them might have been, but we would need the source texts to know whether those claims are truthful. For instance, Giới thiệu tổng quan về AJAX says it's a rewrite, but is exactly the same as this forum post by a poster named "đặng việt phương" (archived in 2014, unknown publish date). On the other hand, that old site does have this CC-BY notice at the footer:
Các tài liệu đều tuân thủ giấy phép Creative Commons Attribution 3.0 trừ khi ghi chú rõ ngoại lệ.
- NguoiDungKhongDinhDanh 22:07, 30 January 2026 (UTC)
- Giới thiệu về lập trình hướng đối tượng and Các thành phần cơ bản của ngôn ngữ C were probably copied from this old Connexions collection, clearly noted as having been licensed under CC-BY 3.0. A copy of this collection can be found here. NguoiDungKhongDinhDanh 22:23, 30 January 2026 (UTC)
- > They are certainly on-topic for Wikibooks, but I'm not sure about Wikiversity.
- Checked all of them.
- Pages created by NewUniverse are all machine translations, likely from enwiki and enwikiversity (without attribution, which also makes them technically copyright violations). I didn't tag those short enough not to contain errors, but they were certainly not rechecked by the creator.
- JohnsonLee01's are also mostly machine translations, but were manually edited to some extent. Also no attribution, however.
- Plutus's pages were likely generated using LLMs. See also this discussion on viwikibooks.
- Masonthetrex127 is a known cross-wiki hoaxer who spams dinosaur images. Pages they edited should probably be deleted: Haikouichthys, Phthinosuchus, Plesiadapis. These are likely machine translations anyway.
- NguoiDungKhongDinhDanh 07:08, 30 January 2026 (UTC)
- I'll have to take a better look at Plutus' pages, but the rest has been pretty much wiped out. There are two collections that remain of interest: Spanish 1, probably also a bad machine translation? It has some very suspicious headers.
The second one is Triết học không chuyên which talks about philosophy I think; a single subpage from this collection was marked by you as copyvio which seems strange. What about the rest of it? - UPD: one question down. AtUkr (talk) 21:36, 30 January 2026 (UTC)
- @AtUkr: Yes, Spanish 1 and subpages sure look like machine translations. NguoiDungKhongDinhDanh 21:42, 30 January 2026 (UTC)
- I'll have to take a better look at Plutus' pages, but the rest has been pretty much wiped out. There are two collections that remain of interest: Spanish 1, probably also a bad machine translation? It has some very suspicious headers.
- Went about deleting those as well as I could. We're down to around 100 content pages now.
> Obvious machine translations
Alas, not so obvious for me, which is why I'm very grateful for your help.
> As for the rest (that is, if there are even any left), I'll need to understand exactly what is expected of Wikiversity articles. I read en:Wikiversity:What is Wikiversity? but that wasn't very helpful.
To keep it brief, an article needs to be useful for at least one of three things: learning something, teaching something, or keeping track of a research project. The first point means a certain overlap with Wikibooks' scope, but Wikiversity doesn't require texts to be structured and ordered as in a textbook. en:Wikiversity:Scope might have a better explanation than the one you read. AtUkr (talk) 21:26, 30 January 2026 (UTC)- @AtUkr: Excellent work and thanks for the pointer. I think you can also go ahead and delete all these pages as well. I checked a sample and all were copyright violations, trivially findable using Earwig's Copyvio Detector. NguoiDungKhongDinhDanh 21:32, 30 January 2026 (UTC)
- All
Done AtUkr (talk) 21:55, 30 January 2026 (UTC)
- All
- @AtUkr: Excellent work and thanks for the pointer. I think you can also go ahead and delete all these pages as well. I checked a sample and all were copyright violations, trivially findable using Earwig's Copyvio Detector. NguoiDungKhongDinhDanh 21:32, 30 January 2026 (UTC)
- A decent number of articles are technology/programming-related; those were mostly, if not all, created by Kimkha. They are certainly on-topic for Wikibooks, but I'm not sure about Wikiversity. As a tangentially relevant note, some of them (like Các thành phần cơ bản của ngôn ngữ C) explicitly state that they were written by someone else but licensed under CC-BY. NguoiDungKhongDinhDanh 05:24, 30 January 2026 (UTC)
- @AtUkr: That was fast. Thanks for your work. A lot of the remaining 400+ don't seem to be of any value whatsoever at a glance, and can probably be deleted as well:
- @NguoiDungKhongDinhDanh: Tôi không hoạt động nhiều ở dự án này nên có gì thì nhờ bạn rà soát chặt các IP của Quachthanh (không chỉ là 205.xxx đâu). Có vẻ như sau khi bị tôi cấm IP trên Wikibooks thì lại chuyển sang đây tạo hàng loạt các trang rác. Nvdtn19 (talk) 08:13, 31 January 2026 (UTC)
- @Nvdtn19 I already deleted everything that came from that IP range, as well as other ranges located in the vicinity of Toronto (given his Wikibooks user page) AtUkr (talk) 14:13, 31 January 2026 (UTC)