Commons:Bots/Requests/PencakeBot
PencakeBot (talk · contribs)
Operator: 虹易 (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought: Upload ancient books ({{PD-Scan}}) downloaded from http://read.nlc.cn. There are more than 20k books[1], each with several volumes (files).
Automatic or manually assisted: Automatic. The booklist is expected to be built at first and then roughly (but not exhaustively) reviewed.
Edit type (e.g. Continuous, daily, one time run): One time (but probably intermittently) or multiple times.
Maximum edit rate (e.g. edits per minute): Not fixed. Depending on the workload.
Bot flag requested: (Y/N): Yes. (not very sure)
Programming language(s): Python with pywikibot or custom scripts. 虹易 (talk) 12:49, 3 June 2021 (UTC)
- Discussion
- Please make test run. --EugeneZelenko (talk) 13:58, 3 June 2021 (UTC)
- @虹易: ? --Krd 07:03, 12 July 2021 (UTC)
- @Krd: Thanks for the ping. I have been a little busy these days, I will finish my script and make some test edits ASAP (possibly by the end of this week).--虹易 (talk) 03:44, 15 July 2021 (UTC)
Closing as stale, please report when ready to test. --Krd 07:42, 18 August 2021 (UTC)
- Reopened per request. --Krd 09:20, 12 October 2021 (UTC)
- The bot has made some experimental edits as a part of my debugging process. It is not very stable due to the problems of chunked-uploading of MediaWiki API. Also, source files are not well organized. So I was constantly monitoring the uploading and tweak the script / templates / generated pages. The current ongoing task batch of 100+ books and 700~ files for the test is categorized as Category:National_Library_of_China-Yunnan_Provincial_Library_Ancient_Books. I am not sure whether those categorizations, naming of files, usage of templates, etc. are proper or not. Any suggestions would help a lot. --虹易 (talk) 09:29, 12 October 2021 (UTC)
- Is Category:太史升菴文集 really non-translatable? --EugeneZelenko (talk) 15:02, 12 October 2021 (UTC)
- Hi, EugeneZelenko. Actually, I did not have a plan to translate those book titles. Is that necessary or recommended? One primary problem is that the number of books to upload is too huge for me to translate one by one. Also, I find them pretty difficult to translate properly, whether automatically or manually, possibly overweighing the benefits of translation. For example, "太史升菴文集" can be segmented into "太史 升菴 文集" where:
- "太史" (Q10941101) is an official position/title in the Chinese traditional imperial court system. "太" means great/grand and "史" means history. 太史 is primarily in charge of the compilation of historical events and astronomical calendar. I did not find a proper translation other than simply pinyinizing (i.e. latinizing/romanization) it as Tai Shi, which is ambiguous without context and even confiting with Grand Preceptor (Q836410) (太师 or Grand Preceptor, also Tai Shi).
- "升菴" (pinyin: Sheng An), is the courtesy name of Yang Shen (Q711808).
- "文集" (pinyin: Wen ji) means "Collected Works".
- which then can be translated literally in whole as "Collected Works of Tai Shi Sheng An"/"... Taishi Shengan" as suggested by Google Translate. But, I think "Tai Shi Sheng An" is just a string of ambiguous symbols, conveying no meaning as it is not actually an accepted term by researchers/scholars/readers. Also, 太史升菴文集 is a set of books (or a book with decades of volumes, depending on one's understanding) that is collected and compiled by ancient scholars. In other words, this book/"collection" is formed in history, probably dating back to Ming Dynasty, containing some additional notes and preamble written by those scholars/editors (other than the author Sheng An). So if we choose this name, "Category:Collected Works of Tai Shi Sheng An" would be a little misleading as it should not include all works by Sheng An. That is, even implied by its name, the category would be only for 太史升菴文集 -- the book with volumes or the established set of books instead of all works of an author. If we just pinyinizing all the characters, we have "Tai Shi Sheng An Wen ji" as used by [2] (machine-translated, I suppose) which looks a little more strange to me and are hardly understandable/identifiable. ---虹易 (talk) 07:27, 13 October 2021 (UTC)
- Hi, EugeneZelenko. Actually, I did not have a plan to translate those book titles. Is that necessary or recommended? One primary problem is that the number of books to upload is too huge for me to translate one by one. Also, I find them pretty difficult to translate properly, whether automatically or manually, possibly overweighing the benefits of translation. For example, "太史升菴文集" can be segmented into "太史 升菴 文集" where:
- Is Special:Diff/597254540 an automated edit? If not, why are manual edits made with the bot account? --Krd 13:43, 19 October 2021 (UTC)
- @Krd: No, it is a manual edit. The bot was under intensive debugging then. And that original filename was auto-generated by the bot incorrectly. The interactive user interface for raising Rename requests prevents me from choosing Criterion 1 (Per the original uploader's request) with my primary account. So I just logged in with the bot account without thinking much about it. Now I realize it is somewhat improper. I will avoid such actions.--虹易 (talk) 03:33, 20 October 2021 (UTC)
- This looks good to me. Let's keep this request open for a few days in case there is additional feedback. --Krd 04:37, 20 October 2021 (UTC)
- @Krd: No, it is a manual edit. The bot was under intensive debugging then. And that original filename was auto-generated by the bot incorrectly. The interactive user interface for raising Rename requests prevents me from choosing Criterion 1 (Per the original uploader's request) with my primary account. So I just logged in with the bot account without thinking much about it. Now I realize it is somewhat improper. I will avoid such actions.--虹易 (talk) 03:33, 20 October 2021 (UTC)
Approved. --Krd 07:54, 25 October 2021 (UTC)