Commons:Bots/Requests/MidleadingBot
Operator: Midleading (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)
Bot's tasks for which permission is being sought: Batch upload files.
Automatic or manually assisted: Manually prepare the file list and files.
Edit type (e.g. Continuous, daily, one time run): Task
Maximum edit rate (e.g. edits per minute): upload: Depends on network, but usually two or one upload per min. edit: 6 per min
Bot flag requested: (Y/N): Y(management of files, query in batch of 5000 instead of 500)
Programming language(s): C#
Midleading (talk) 09:21, 13 May 2017 (UTC)
Discussion
- "Batch upload files" is very general. What files will be uploaded? Only scans of old books? // 你所说的“批量上传”太宽泛。有哪些文件将会被上传?只有古代书籍的扫描吗? --Zhuyifei1999 (talk) 09:43, 13 May 2017 (UTC)
Yes.是的。--Midleading (talk) 11:56, 13 May 2017 (UTC)
- Definitely information related to copyrights is missing in descriptions. Language tags should be used too. --EugeneZelenko (talk) 14:29, 13 May 2017 (UTC)
- There is a total list of uploads at zh:s:Special:PermanentLink/863487. It is about ~10 GB and ~2k files. All books are in the public domain. Description and detailed version information of each book is also provided in that table. Due to many volumes of a book may refer to the same description, it needs some time to associate these descriptions with each file, and re-debug of bot. I was thinking about adding these descriptions with another bot edit because it is easier to write a script to do text editing than uploading. I will fix the descriptions once I finish uploading or when I have written the code to update the info. I was running the bot because it uploads so slowly that it has very little impact on patrollers.--Midleading (talk) 16:10, 13 May 2017 (UTC)
- Alternatively you could generate the all description before uploading, and then let us check a few to see if they are okay. Also would you mind adding some interwiki links to the description pages? (eg the source of File:A1746:方苞望溪先生集12-09.djvu could link to zh:四部叢刊.) Regarding scope and copyright LGTM. --Zhuyifei1999 (talk) 16:42, 13 May 2017 (UTC)
- I have associated the descriptions and the files uploaded so far(about ~1K files). The descriptions are deployed by MidleadingBot on Wikisource now. An example is zh:s:Index:A0042:尔雅.djvu. The description is placed below the cover of the book. I can use my database to associate the files and descriptions on commons as well with this bot. 至于文件链接至维基百科,我觉得没有必要,因为分类已经链接至维基百科了,如果一定要链接,文件应当链接至相应的作者,而不是四部丛刊。--Midleading (talk) 02:29, 14 May 2017 (UTC)
- 然而我并没有看到Category:四部叢刊初編或者Category:四部叢刊集部有维基百科链接。链接作者当然也行,有总比没有好 --Zhuyifei1999 (talk) 04:22, 14 May 2017 (UTC)
- I have associated the descriptions and the files uploaded so far(about ~1K files). The descriptions are deployed by MidleadingBot on Wikisource now. An example is zh:s:Index:A0042:尔雅.djvu. The description is placed below the cover of the book. I can use my database to associate the files and descriptions on commons as well with this bot. 至于文件链接至维基百科,我觉得没有必要,因为分类已经链接至维基百科了,如果一定要链接,文件应当链接至相应的作者,而不是四部丛刊。--Midleading (talk) 02:29, 14 May 2017 (UTC)
- Alternatively you could generate the all description before uploading, and then let us check a few to see if they are okay. Also would you mind adding some interwiki links to the description pages? (eg the source of File:A1746:方苞望溪先生集12-09.djvu could link to zh:四部叢刊.) Regarding scope and copyright LGTM. --Zhuyifei1999 (talk) 16:42, 13 May 2017 (UTC)
剛才我檢查了維基百科,發現以下作者已建立維基百科條目:
岑參,崔豹,常璩,戴復古,戴良,戴表元,戴震,晁公武,晁補之,晁說之,曹植,曾國藩,曾慥,曾鞏,查慎行,查繼佐,班固,白居易,程敏政,蔡邕,鄧析,長孫無忌,陳子昂,陳師道,陳彭年,陳思,陳淵,陳獻章,陳維崧,陳與義,鮑照,伏勝,房玄齡,房祺,方孝孺,方苞,杜光庭,杜牧,杜甫,杜預,歸有光,段成式,獨孤及,竇常,范仲淹,范成大,范甯,葛洪,董仲舒,郭忠恕,郭璞,郭茂倩,郭象,韓非,顧炎武,馮贄,高仲武,高啟,高誘,高適,龔自珍,何休,何晏,姜尚,嵇康,忽思慧,桓寬,江淹,洪亮吉,洪咨夔,皇甫冉,皇甫湜,胡安國,胡曾,計有功,許慎,許謙,賈島,賈思勰,賈昌朝,賈誼,韓偓,韓嬰,韓愈,駱賓王,黃宗羲,黃帝,黃庭堅,黃溍,黃滔,京房,劉克莊,劉向,劉基,劉安,劉安世,劉徽,劉恕,劉敞,劉熙,劉蛻,劉邵,厲鶚,孔元措,孔穎達,孔鮒,寇準,揭傒斯,李中,李商隱,李復言,李德裕,李昉,李白,李群玉,李翺,李覯,李隆基,李頻,林逋,焦贛,酈道元,劉勰,劉因,劉歆,劉知幾,劉禹錫,劉義慶,呂本中,呂溫,呂祖謙,墨子,孟浩然,孟郊,柳宗元,柳開,梅堯臣,歐陽脩,毛亨,盧仝,穆修,繆荃孫,羅隱,陸九淵,陸德明,陸機,陸賈,陸贄,陸雲,陸龜蒙,馬令,全祖望,商鞅,彭孫貽,權德輿,歐陽詹,沈亞之,沈括,沈約,沈遼,皮日休,秦觀,邵雍,錢大昕,錢若水,錢謙益,錢起,阮閱,司空圖,司馬光,唐庚,唐慎微,唐順之,宋濂,汪中,汪琬,汪藻,王九思,王充,王冰,王勃,王十朋,王士元,王夫之,王安石,王弼,王明清,王符,王若虛,田穰苴,蘇洵,蘇舜欽,蘇軾,蘇轍,陶宗儀,陶弘景,陶潛,吳偉業,吳兢,吳寬,吳縝,吴起,徐枋,徐賁,徐鍇,徐陵,文同,文天祥,王叔和,王士禛,王應麟,王澍,王灼,王炎午,王禹偁,王維,王肅,王通,王逸,王陽明,蕭統,謝朓,謝枋得,邢昺,韋昭,魏,魏了翁,魏徵,元稹,姚合,姚燧,姚鉉,姚鼐,尹文,尹洙,庾信,徐夤,應劭,揚雄,晏子,楊倞,楊基,楊時,楊維楨,楊衒之,殷璠,耶律楚材,荀悅,虞集,袁宏,袁桷,袁樞,岳珂,顏之推,顏真卿,周煇,張九齡,張仲景,張元濟,張師正,張昱,張栻,張湛,張綱,張說,張載,張邦基,真德秀,章衡,趙孟頫,趙岐,趙明誠,趙曄,趙爽,趙秉文,鄭玄,朱慶餘,朱松,朱熹,朱震,莊廷鑨
尚未擁有維基百科條目,或者尚未建立重定向頁,或者需要改為簡體字才能鏈接至維基百科,約31%:
鮑彪,鮑山,貝瓊,陳傅浪,陳基,程端禮,程俱,敕撰,崔致遠,杜本,範坰,範浚,範梈,範攄,傅習,管時敏,桂萬榮,郭若虛,郭勛,河上公,洪邁,洪適,許渾,許月卿,華嶽,黃昇,黃仲元,吉天保,賈公顏,姜夔撰,孔安國,孔晁,李賀,李鹹用,李建勳,李劉,李元弼,劉長卿,柳貫,婁機,樓鑰,盧辯,盧文弨,盧照鄰,陸佃,陸遊,馬總,倪瓚,聶嵩義,歐陽玄,闕名,阮元撰,芮挺章,薩都刺,邵亭貞,沈與求,盛熙明,史容,史遊,史炤,釋道潛,釋道世,釋道宣,釋道原,釋法雲,釋貫休,釋寒山,釋行均,釋惠洪,釋皎然,釋齊己,釋契嵩,釋僧佑,釋玄奘,釋重顯,宋之問,蘇伯衡,蘇天爵,蘇象先,孫樵,孫星衍,王績,王俅,王惲,韋穀,韋應物,韋莊,溫庭筠,吳萊,蕭立等,謝肅,謝應芳,徐靈府,徐乾,徐鉉,楊朝英,楊炯,楊萬裏,楊雄,楊載,葉適,餘闕,元好問,元結,袁康,惲敬,章樵,張楚叔,張惠言,張籍,張九成,張君房,張耒,張孝祥,張養浩,張雨,張翥,趙崇祚,趙以夫,鄭穀,鄭思肖,周賀,朱德潤,朱彜尊
因此可以嘗試將文件鏈接至作者的維基百科
Midleading (talk) 05:55, 14 May 2017 (UTC)
- 维基百科简体繁体会自动转换,所以使用interwiki一般都没事。另目前状态通过维基数据连接维基百科条目与维基共享资源分类存在争议,之前有机器人专门删除此类链接,所以我不建议让维基数据链接作为唯一链接。interwiki相比保险很多 --Zhuyifei1999 (talk) 06:24, 14 May 2017 (UTC)
你觉得这个机器人现在还有什么问题需要解决的呢?需要等待一周行政员才过来授权吗?刚才那个洋人说的“Language tags”是什么,要放到哪里?还有,我这里还有大约600GB的其他扫描古籍,这些丛书大多没有整理得那么好的目录和作者信息,需要自己整理版权信息,即便是用半自动化的整理方法也需要很久,这些书籍也可以上传吗,还是需要整理以后才能上传,还是维持现状、不上传?--Midleading (talk) 07:12, 14 May 2017 (UTC)
- "Language tag" 指如 {{En}}, {{Zh-hant}} 子类的模板。一般而言文件说明要使用,可以参考刚评选出的2016年年度图片。其他600GB建议整理后上传,因为批量上传经常由于文件说明缺少质量引发争议(我不反对批量上传但有好几人反对)。关于授权,可能要等一阵子,这里BRFA进度慢得有时我都难受。 --Zhuyifei1999 (talk) 08:20, 14 May 2017 (UTC)
- Support with advice:
- 支持上传大量古籍,但是我建议您在大量上传前在维基文库发起讨论。
- 维基共享资源是所有语言维基媒体共享的资源库,标题中应包含英文拼音。
- 版本明确是《四部丛刊》的一大特点,建议在标题中体现。作者也应体现。版本和作者信息可以从这里找到。
- 古籍以繁体字命名。
- 编号非古籍本身的一部分,而是现代人所编。我觉得体现在标题里不当,可以在描述中体现。
- 支持上传大量古籍,但是我建议您在大量上传前在维基文库发起讨论。
如File:A1763:戴震东原集4-1.djvu我觉得最好命名成:File:Sibu Congkan - 戴震東原集 - 戴震 - 上海涵芬樓藏經韻樓刊本 - 1.djvu。建议对未上传的图书按此方案命名,已经上传的图书移动。--維基小霸王 (talk) 01:17, 19 May 2017 (UTC)
- 我认为还是要在文件名中加入编号,因为这样既便于自己管理文件也便于他人管理文件。你那个版本信息不全,我已经自己整理了版本。--Midleading (talk) 08:01, 23 May 2017 (UTC)
If possible please give a short summary of the discussion in English when finished. Thx. --Krd 15:21, 21 May 2017 (UTC)
Summary: rename existing files to new naming convention proposed in zh:s:Project:Bot policy#User:MidleadingBot, and add file descriptions and version information from database at zh:s:Special:PermanentLink/863487. New uploads are uploaded with file names and descriptions in accordance to consensus. Add interwiki links to Chinese Wikipedia pages of the author. This task is part of a project to import 430k pages to zhwikisource. A file mover at Commons is requested for service. More details about this bot is at zh:s:Project:Bot policy#User:MidleadingBot--Midleading (talk) 02:39, 25 May 2017 (UTC)
Approved. --Krd 05:17, 29 May 2017 (UTC)