Commons:Bots/Requests/MidleadingBot

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

MidleadingBot (talk · contribs)

Operator: Midleading (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)

Bot's tasks for which permission is being sought: Batch upload files.

Automatic or manually assisted: Manually prepare the file list and files.

Edit type (e.g. Continuous, daily, one time run): Task

Maximum edit rate (e.g. edits per minute): upload: Depends on network, but usually two or one upload per min. edit: 6 per min

Bot flag requested: (Y/N): Y(management of files, query in batch of 5000 instead of 500)

Programming language(s): C#

Midleading (talk) 09:21, 13 May 2017 (UTC)[reply]

Discussion

  • "Batch upload files" is very general. What files will be uploaded? Only scans of old books? // 你所说的“批量上传”太宽泛。有哪些文件将会被上传?只有古代书籍的扫描吗? --Zhuyifei1999 (talk) 09:43, 13 May 2017 (UTC)[reply]

Yes.是的。--Midleading (talk) 11:56, 13 May 2017 (UTC)[reply]

There is a total list of uploads at zh:s:Special:PermanentLink/863487. It is about ~10 GB and ~2k files. All books are in the public domain. Description and detailed version information of each book is also provided in that table. Due to many volumes of a book may refer to the same description, it needs some time to associate these descriptions with each file, and re-debug of bot. I was thinking about adding these descriptions with another bot edit because it is easier to write a script to do text editing than uploading. I will fix the descriptions once I finish uploading or when I have written the code to update the info. I was running the bot because it uploads so slowly that it has very little impact on patrollers.--Midleading (talk) 16:10, 13 May 2017 (UTC)[reply]
Alternatively you could generate the all description before uploading, and then let us check a few to see if they are okay. Also would you mind adding some interwiki links to the description pages? (eg the source of File:A1746:方苞望溪先生集12-09.djvu could link to zh:四部叢刊.) Regarding scope and copyright LGTM. --Zhuyifei1999 (talk) 16:42, 13 May 2017 (UTC)[reply]
I have associated the descriptions and the files uploaded so far(about ~1K files). The descriptions are deployed by MidleadingBot on Wikisource now. An example is zh:s:Index:A0042:尔雅.djvu. The description is placed below the cover of the book. I can use my database to associate the files and descriptions on commons as well with this bot. 至于文件链接至维基百科,我觉得没有必要,因为分类已经链接至维基百科了,如果一定要链接,文件应当链接至相应的作者,而不是四部丛刊。--Midleading (talk) 02:29, 14 May 2017 (UTC)[reply]
然而我并没有看到Category:四部叢刊初編或者Category:四部叢刊集部有维基百科链接。链接作者当然也行,有总比没有好 --Zhuyifei1999 (talk) 04:22, 14 May 2017 (UTC)[reply]
请注意父分类Category:四部叢刊--Midleading (talk) 04:53, 14 May 2017 (UTC)[reply]

剛才我檢查了維基百科,發現以下作者已建立維基百科條目:

岑參崔豹常璩戴復古戴良戴表元戴震晁公武晁補之晁說之曹植曾國藩曾慥曾鞏查慎行查繼佐班固白居易程敏政蔡邕鄧析長孫無忌陳子昂陳師道陳彭年陳思陳淵陳獻章陳維崧陳與義鮑照伏勝房玄齡房祺方孝孺方苞杜光庭杜牧杜甫杜預歸有光段成式獨孤及竇常范仲淹范成大范甯葛洪董仲舒郭忠恕郭璞郭茂倩郭象韓非顧炎武馮贄高仲武高啟高誘高適龔自珍何休何晏姜尚嵇康忽思慧桓寬江淹洪亮吉洪咨夔皇甫冉皇甫湜胡安國胡曾計有功許慎許謙賈島賈思勰賈昌朝賈誼韓偓韓嬰韓愈駱賓王黃宗羲黃帝黃庭堅黃溍黃滔京房劉克莊劉向劉基劉安劉安世劉徽劉恕劉敞劉熙劉蛻劉邵厲鶚孔元措孔穎達孔鮒寇準揭傒斯李中李商隱李復言李德裕李昉李白李群玉李翺李覯李隆基李頻林逋焦贛酈道元劉勰劉因劉歆劉知幾劉禹錫劉義慶呂本中呂溫呂祖謙墨子孟浩然孟郊柳宗元柳開梅堯臣歐陽脩毛亨盧仝穆修繆荃孫羅隱陸九淵陸德明陸機陸賈陸贄陸雲陸龜蒙馬令全祖望商鞅彭孫貽權德輿歐陽詹沈亞之沈括沈約沈遼皮日休秦觀邵雍錢大昕錢若水錢謙益錢起阮閱司空圖司馬光唐庚唐慎微唐順之宋濂汪中汪琬汪藻王九思王充王冰王勃王十朋王士元王夫之王安石王弼王明清王符王若虛田穰苴蘇洵蘇舜欽蘇軾蘇轍陶宗儀陶弘景陶潛吳偉業吳兢吳寬吳縝吴起徐枋徐賁徐鍇徐陵文同文天祥王叔和王士禛王應麟王澍王灼王炎午王禹偁王維王肅王通王逸王陽明蕭統謝朓謝枋得邢昺韋昭魏了翁魏徵元稹姚合姚燧姚鉉姚鼐尹文尹洙庾信徐夤應劭揚雄晏子楊倞楊基楊時楊維楨楊衒之殷璠耶律楚材荀悅虞集袁宏袁桷袁樞岳珂顏之推顏真卿周煇張九齡張仲景張元濟張師正張昱張栻張湛張綱張說張載張邦基真德秀章衡趙孟頫趙岐趙明誠趙曄趙爽趙秉文鄭玄朱慶餘朱松朱熹朱震莊廷鑨

尚未擁有維基百科條目,或者尚未建立重定向頁,或者需要改為簡體字才能鏈接至維基百科,約31%:

鮑彪鮑山貝瓊陳傅浪陳基程端禮程俱敕撰崔致遠杜本範坰範浚範梈範攄傅習管時敏桂萬榮郭若虛郭勛河上公洪邁洪適許渾許月卿華嶽黃昇黃仲元吉天保賈公顏姜夔撰孔安國孔晁李賀李鹹用李建勳李劉李元弼劉長卿柳貫婁機樓鑰盧辯盧文弨盧照鄰陸佃陸遊馬總倪瓚聶嵩義歐陽玄闕名阮元撰芮挺章薩都刺邵亭貞沈與求盛熙明史容史遊史炤釋道潛釋道世釋道宣釋道原釋法雲釋貫休釋寒山釋行均釋惠洪釋皎然釋齊己釋契嵩釋僧佑釋玄奘釋重顯宋之問蘇伯衡蘇天爵蘇象先孫樵孫星衍王績王俅王惲韋穀韋應物韋莊溫庭筠吳萊蕭立等謝肅謝應芳徐靈府徐乾徐鉉楊朝英楊炯楊萬裏楊雄楊載葉適餘闕元好問元結袁康惲敬章樵張楚叔張惠言張籍張九成張君房張耒張孝祥張養浩張雨張翥趙崇祚趙以夫鄭穀鄭思肖周賀朱德潤朱彜尊

因此可以嘗試將文件鏈接至作者的維基百科

Midleading (talk) 05:55, 14 May 2017 (UTC)[reply]

维基百科简体繁体会自动转换,所以使用interwiki一般都没事。另目前状态通过维基数据连接维基百科条目与维基共享资源分类存在争议,之前有机器人专门删除此类链接,所以我不建议让维基数据链接作为唯一链接。interwiki相比保险很多 --Zhuyifei1999 (talk) 06:24, 14 May 2017 (UTC)[reply]

你觉得这个机器人现在还有什么问题需要解决的呢?需要等待一周行政员才过来授权吗?刚才那个洋人说的“Language tags”是什么,要放到哪里?还有,我这里还有大约600GB的其他扫描古籍,这些丛书大多没有整理得那么好的目录和作者信息,需要自己整理版权信息,即便是用半自动化的整理方法也需要很久,这些书籍也可以上传吗,还是需要整理以后才能上传,还是维持现状、不上传?--Midleading (talk) 07:12, 14 May 2017 (UTC)[reply]

"Language tag" 指如 {{En}}, {{Zh-hant}} 子类的模板。一般而言文件说明要使用,可以参考刚评选出的2016年年度图片。其他600GB建议整理后上传,因为批量上传经常由于文件说明缺少质量引发争议(我不反对批量上传但有好几人反对)。关于授权,可能要等一阵子,这里BRFA进度慢得有时我都难受。 --Zhuyifei1999 (talk) 08:20, 14 May 2017 (UTC)[reply]
 Support with advice:
支持上传大量古籍,但是我建议您在大量上传前在维基文库发起讨论。
  1. 维基共享资源是所有语言维基媒体共享的资源库,标题中应包含英文拼音。
  2. 版本明确是《四部丛刊》的一大特点,建议在标题中体现。作者也应体现。版本和作者信息可以从这里找到。
  3. 古籍以繁体字命名。
  4. 编号非古籍本身的一部分,而是现代人所编。我觉得体现在标题里不当,可以在描述中体现。

File:A1763:戴震东原集4-1.djvu我觉得最好命名成:File:Sibu Congkan - 戴震東原集 - 戴震 - 上海涵芬樓藏經韻樓刊本 - 1.djvu。建议对未上传的图书按此方案命名,已经上传的图书移动。--維基小霸王 (talk) 01:17, 19 May 2017 (UTC)[reply]

我认为还是要在文件名中加入编号,因为这样既便于自己管理文件也便于他人管理文件。你那个版本信息不全,我已经自己整理了版本。--Midleading (talk) 08:01, 23 May 2017 (UTC)[reply]

If possible please give a short summary of the discussion in English when finished. Thx. --Krd 15:21, 21 May 2017 (UTC)[reply]

Summary: rename existing files to new naming convention proposed in zh:s:Project:Bot policy#User:MidleadingBot, and add file descriptions and version information from database at zh:s:Special:PermanentLink/863487. New uploads are uploaded with file names and descriptions in accordance to consensus. Add interwiki links to Chinese Wikipedia pages of the author. This task is part of a project to import 430k pages to zhwikisource. A file mover at Commons is requested for service. More details about this bot is at zh:s:Project:Bot policy#User:MidleadingBot--Midleading (talk) 02:39, 25 May 2017 (UTC)[reply]


Approved. --Krd 05:17, 29 May 2017 (UTC)[reply]