User talk:Zhuyifei1999/Archive 54
This is an archive of past discussions. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
FlickreviewR_2 is stuck
Hi Zhuyifei1999, your bot has stopped working yesterday morning. Could you please relaunch it? The category "Flickr review needed" is filling up again since Magnus Manske repaired the bug on his Flickr2commons tool. Thank you! --Edelseider (talk) 09:15, 15 January 2020 (UTC)
- Done --Zhuyifei1999 (talk) 11:04, 15 January 2020 (UTC)
我再去找他會不會變成騷擾?
他有說「kindly refrain from coming to me anymore」這話,我不知道我再去找他討論這件事會不會變成騷擾?因為他自己說「Will a supervisor prefer a worker who does the same amount of work in 30 minutes or someone who takes a whole 30 days to finish the same thing?」這話,然後他叫別人轉達給我,我收到之後,可是我一直搞不懂他是想要我怎麼做。幾次談下來,一下他說沒收到信,一下他說我在浪費他時間,感覺上他似乎在閃躲他當初說的話。如果我真的有他說「Will a supervisor prefer a worker who does the same amount of work in 30 minutes or someone who takes a whole 30 days to finish the same thing?」這個問題,他沒理由不回答我該怎麼做,因此我搞不懂他為何閃躲我,最後他叫我不要再找他。我找他就是為了確認看看,我是不是真的有這問題,若是有,希望他可以告訴我怎麼做,畢竟話是他說的,問他當然會比較清楚。我來這裡再次說一次給你,多一人知道,是為了避免別人當我在騷擾他。--Kai3952 (talk) 19:58, 17 January 2020 (UTC)
- 我现在真的一点空都没有。你可以找 zhwiki 的人解释 --Zhuyifei1999 (talk) 08:17, 18 January 2020 (UTC)
BONNIE bERNSTEIN
I added info to show what Flickr fiys was used.Mwinog2777 (talk) 01:49, 18 January 2020 (UTC)
- Which image are you referring to? --Zhuyifei1999 (talk) 08:18, 18 January 2020 (UTC)
Missing message/name for Commons:Picture of the Year/2019/R1/Gallery/dummy
Please go to a POTY 2019 gallery page, e.g. Commons:Picture of the Year/2019/R1/Gallery/Videos and Animations. Somewhere on the page you should see Error in Module:POTY. Message not found: cat-dummy). Unfortunately, I am not able to fix it myself. Although I suspect that the gallery should be deleted. Or MediaWiki:com-poty-cat-dummy should be created (and translated). --jdx Re: 06:55, 19 January 2020 (UTC)
- Yeah, this is to be expected. It's a temporary "category" for uncategorized images in Commons:Picture_of_the_Year/2019/Candidates/warnbig --Zhuyifei1999 (talk) 12:11, 19 January 2020 (UTC)
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Changes later this week
- The new version of MediaWiki will be on test wikis and MediaWiki.org from 21 January. It will be on non-Wikipedia wikis and some Wikipedias from 22 January. It will be on all wikis from 23 January (calendar).
Future changes
- There is a new suggestion for what to show when someone edits without registering an account. This is to give unregistered editors better privacy and make some anti-vandalism work go faster. You can give feedback.
- Pywikibot is a Python library to automate work on wikis. It will no longer support Python 2. Use the
python2
tag if you need to continue running Python 2 scripts. The Pywikibot team strongly recommends to migrate to Python 3. You can get help to do so. [1] - The weekly MediaWiki branch cut will soon become automated. The timing for this cut may change. You can discuss in Phabricator if this affects you. [2]
- You can read about coming technical events and mentoring interns.
Tech news prepared by Tech News writers and posted by bot • Contribute • Translate • Get help • Give feedback • Subscribe or unsubscribe.
19:42, 20 January 2020 (UTC)
能否合并简繁首页和今日图片
您好,我看到互助客栈已经完成简繁合并了。首页和今日图片能否也可以简繁合并呢?否则今日图片的翻译如果翻译至zh的话,简繁首页一个不会进行简繁转换,一个直接不显示。—-VulpesVulpes825 (talk) 19:51, 21 January 2020 (UTC)
- 首页 & Commons:今日图片 Done --Zhuyifei1999 (talk) 20:10, 21 January 2020 (UTC)
- 十分感谢阁下的帮忙,我已经快完成合并工作了。能否劳烦阁下也将Commons:今日影音的页面语言也设置成zh呢?这应该是合并所需的最后嵌入在主页的页面了。 --VulpesVulpes825 (talk) 23:29, 21 January 2020 (UTC)
- Done --Zhuyifei1999 (talk) 00:21, 22 January 2020 (UTC)
- 十分感谢阁下的帮忙,我已经快完成合并工作了。能否劳烦阁下也将Commons:今日影音的页面语言也设置成zh呢?这应该是合并所需的最后嵌入在主页的页面了。 --VulpesVulpes825 (talk) 23:29, 21 January 2020 (UTC)
Delinker
User talk:Zhuyifei1999/Archive 51#Delinker
It broke again didn't it? - Alexis Jazz ping plz 01:06, 22 January 2020 (UTC)
- toolforge:replag --Zhuyifei1999 (talk) 02:27, 22 January 2020 (UTC)
- 54 hours of lag.. That should be an easy frag. - Alexis Jazz ping plz 14:41, 22 January 2020 (UTC)
- I don't get the reference --Zhuyifei1999 (talk) 15:34, 22 January 2020 (UTC)
- 54 hours of lag.. That should be an easy frag. - Alexis Jazz ping plz 14:41, 22 January 2020 (UTC)
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Problems
- Some mobile diffs have problems. A couple of buttons are not shown. Structured data diffs on Commons are confusing. The developers are working on fixing it. [3][4]
- Administrators on wikis that use Structured Discussions can't move discussion pages. This is a bug. The developers are working on fixing it. [5]
Changes later this week
- There is no new MediaWiki version this week.
Future changes
- There is JavaScript code on Special:Undelete for administrators that makes it possible to automatically select multiple checkboxes by holding the "Shift" key and clicking. This code is also loaded by accident on other special pages and on articles. This makes pages slower to load. This will be fixed. If you know of other special pages where this is useful please tell the developers at phab:T232688.
Tech news prepared by Tech News writers and posted by bot • Contribute • Translate • Get help • Give feedback • Subscribe or unsubscribe.
18:53, 27 January 2020 (UTC)
Solomon203
我有事想請管理員協助。Solomon203有編輯上這些問題:
- 第一、濫用文件重命名
- 據我觀察他三年,他一直有習慣將文件名從沒有日期改成日期命名之傾向,無論有沒有必要,他都這麼做,例如:File:Xinzhuang Fuduxin Platform2.jpg→File:Platform 2, Xinzhuang Fuduxin Station 20170318.jpg,只是添加20170318這串數字來表達日期,但文件名稱的原意仍保持不變,一樣是第二月台,失去了合乎COM:RENAME的合理條件。我有告訴過他二次,應該要遵循COM:FNC,但是他說「名從主人」,我不知道他在想什麼,而且他過去也被人發現他有不適合為文件重命名的問題,因此被撤下file mover權限。我想這已經不是我一個人能處理,希望請你用管理員身份向他溝通。
- 第二、不寫編輯摘要
- 他習慣不寫編輯摘要來說明其理由,即使我要求他寫編輯摘要,他下次仍然不寫,長久之下,他為我帶來了困擾。因為我常常不知道他在做什麼(請見:第三、編輯有破壞的傾向),但是我每一次發現就每一次問他,總是這樣並非長遠之計,我只好向他溝通,好讓他理解我的處境,他的答覆卻是「我從未覺得自己曾經跟蹤任何人的編輯」這種態度,絲毫不想面對他編輯上問題,希望請你用管理員身份向他溝通。
- 第三、編輯有破壞的傾向
- 依他習慣不寫編輯摘要,當他對別人的編輯進行移除或修改時,別人很難察覺他的編輯是不是有破壞,直到我警覺到他不是一次、二次,我才開始去注意他的編輯,長久之下,我才發現到他的編輯有這些問題:
- 他常常會移除別人添加的類別,例如:diff/363933605、diff/382055511、diff/353380910、diff/308737276
- 他會將別人原本正確的編輯改成不正確,例如:diff/372424804,照片是在2008年拍攝,連作者在Flickr也是這樣寫,但是他仍然改成2012年。
- 他有時會對別人添加的類別篡改,但是原本的類別並沒有問題,例如:
- diff/370528735,他將Taiwan Railway Administration改成Mascots of Taiwan、TRA CK124,有問題的是Mascots of Taiwan並不是Mascots of Taiwan Railway Administration,後來我有幫他解決,但是他總是這樣,並不是個辦法。
- diff/353660814,他將Fruit of Taiwan改成Newspapers of Taiwan。依分類樹概念,將檳榔放在Fruit of Taiwan類別沒有問題,因為我觀察別人也是這樣分類,所以我以為他將檳榔看成是報紙,但是看到他再一次移除類別,我才知道他是故意破壞。
- diff/388434073,他將Religion in Hsinchu City改成Taisun Enterprise、Grass jelly。由於未來照片量一多,可能需要進一步分類成Offerings in Hsinchu City,那麼它的父類別就是Religious rituals in Hsinchu City,但是他移除Religion in Hsinchu City就會造成我找不到這張照片,對我分類工作是不利的。
- 以上三種篡改的情況,顯示出他對他所編輯的類別並不了解,我有建議他試著學習了解,但是他有自己的理由。在溝通無效之下,我懷疑他可能有CIR問題。我請管理員判斷他的編輯是否對Wikimedia Commons有益。--Kai3952 (talk) 16:27, 21 January 2020 (UTC)
- 移动文件的问题已经在2017年以权限解除的方式处理了,我觉得没必要去提起两个多年前已解决的事。
- 其他的,我对这人不太了解。如果这是有长期破坏历史的话,我建议COM:AN/U。如果语言有难度的话,我可以帮你改。
- --Zhuyifei1999 (talk) 17:10, 21 January 2020 (UTC)
- 可能就像你上次說,你一點空也沒有。經過Minorax給我「Will a supervisor prefer a worker who does the same amount of work in 30 minutes or someone who takes a whole 30 days to finish the same thing?」這句話啟發,我必須將時間放在工作上,我只好將Solomon203這事擱著。--Kai3952 (talk) 21:13, 21 January 2020 (UTC)
- 所以我也没空研究他到底做错了什么,你对他有不满请你找其他人。我已经让步所我可以帮你改ANU,既然你不要那就算了。 --Zhuyifei1999 (talk) 21:35, 21 January 2020 (UTC)
- 我知道你沒空,更何況你叫我去ANU,那麼此事就與你無關,但不知道你說「你对他有不满」此話,「他」是指是Minorax還是Solomon203?如果是Minorax,我會去反省自己;如果是Solomon203,我會去ANU撤下這個舉報。我看到你說「没空研究他到底做错了什么」,無論是Minorax還是Solomon203,我都不應該浪費時間去研究或反省——我不知道你要我怎麼做,但是你突顯一個意思:無論是不是對Wikimedia Commons有幫助,你都不想理會我與別人之間發生什麼事,所以我明白你希望我從此之後不要再找你,尤其是你說「我已经让步」,顯然我造成你困擾。在不要找你之前,我仍需要你明確的答覆,是Minorax還是Solomon203?--Kai3952 (talk) 21:03, 28 January 2020 (UTC)
- Solomon203。如题 --Zhuyifei1999 (talk) 22:31, 28 January 2020 (UTC)
- 撤不撤是你的事,其他管理员可能会介入,但我不会 --Zhuyifei1999 (talk) 22:34, 28 January 2020 (UTC)
- 我知道你沒空,更何況你叫我去ANU,那麼此事就與你無關,但不知道你說「你对他有不满」此話,「他」是指是Minorax還是Solomon203?如果是Minorax,我會去反省自己;如果是Solomon203,我會去ANU撤下這個舉報。我看到你說「没空研究他到底做错了什么」,無論是Minorax還是Solomon203,我都不應該浪費時間去研究或反省——我不知道你要我怎麼做,但是你突顯一個意思:無論是不是對Wikimedia Commons有幫助,你都不想理會我與別人之間發生什麼事,所以我明白你希望我從此之後不要再找你,尤其是你說「我已经让步」,顯然我造成你困擾。在不要找你之前,我仍需要你明確的答覆,是Minorax還是Solomon203?--Kai3952 (talk) 21:03, 28 January 2020 (UTC)
- 所以我也没空研究他到底做错了什么,你对他有不满请你找其他人。我已经让步所我可以帮你改ANU,既然你不要那就算了。 --Zhuyifei1999 (talk) 21:35, 21 January 2020 (UTC)
- 可能就像你上次說,你一點空也沒有。經過Minorax給我「Will a supervisor prefer a worker who does the same amount of work in 30 minutes or someone who takes a whole 30 days to finish the same thing?」這句話啟發,我必須將時間放在工作上,我只好將Solomon203這事擱著。--Kai3952 (talk) 21:13, 21 January 2020 (UTC)
- diff/372424804,雖然EXIF數據顯示是2008年拍攝,但是Flickr同一相簿其他照片中的舞台帆布顯示實際年份是2012年。EXIF數據不能作為判定照片拍攝年份的唯一依據。例如我是2010年2月3日加入維基人行列,此時才開始拿起數位相機拍照上傳維基共享資源,剛開始不知道EXIF數據是什麼東西,所以當時上傳的照片會有EXIF數據顯示拍攝年份是2009年。像這張照片,2010年最初上傳的版本EXIF數據顯示拍攝年份是2009年,直到2019年12月12日再次修圖時才手動把拍攝年份修正為2010年。--Solomon203 (talk) 14:21, 23 January 2020 (UTC)
It appears that YouTube hasn't blocked the public IP from retrieving googlevideo.com data, you can get the DirectLink client side/Open Proxy/TOR.
# Try running this on k8s before 1580743805, or generate a different DirectLink using youtube-dl -g https://www.youtube.com/watch?v=npjF032TDDQ on an IP that isn't blocked.
import urllib.request
DirectLink = "https://r4---sn-vgqsrn7s.googlevideo.com/videoplayback?expire=1580743805&ei=Heg3Xr39LsiVhwaiu72QBA&ip=216.151.180.102&id=o-AAnTHfo74ExX-M57Qe5Nn5SVLoIrV97GcZ4ZilyVIsZI&itag=22&source=youtube&requiressl=yes&mm=31%2C29&mn=sn-vgqsrn7s%2Csn-vgqskn7l&ms=au%2Crdu&mv=m&mvi=3&pl=25&initcwndbps=593750&vprv=1&mime=video%2Fmp4&ratebypass=yes&dur=93.390&lmt=1575738724893529&mt=1580722124&fvip=4&fexp=23842630&c=WEB&txp=5432432&sparams=expire%2Cei%2Cip%2Cid%2Citag%2Csource%2Crequiressl%2Cvprv%2Cmime%2Cratebypass%2Cdur%2Clmt&sig=ALgxI2wwRQIgVkbl3TQ4kXARgJv5E3jmSPTNf3uKLLwAKWCDi_-OxZ0CIQCCvP9w_PljqLWV6tefqtPQSd90gsb1jq8oSNkWnltAHg%3D%3D&lsparams=mm%2Cmn%2Cms%2Cmv%2Cmvi%2Cpl%2Cinitcwndbps&lsig=AHylml4wRAIgUVbXoQLeG_NZFEA_s10SeONtidhgS1336e-9rngFpQ0CIBi4MscqcBZuISZEqxLzipmZ_MBhaiEQRVVxVSs3dc-t&contentlength=5411116&video_id=npjF032TDDQ&title=Raining+Tacos+-+Parry+Gripp+%26+BooneBum"
FileName= "DownloadTestYouTube.mp4"
try:
print("Downloading")
urllib.request.urlretrieve(DirectLink , FileName)
print(" Task complete ")
except Exception as e:
print(e)
SideNote: Any updates from YouTube ? -- Eatcha (talk) 10:18, 3 February 2020 (UTC)
- You are required to run
youtube-dl -g https://www.youtube.com/watch?v=npjF032TDDQ
on TOR/Open Proxy/Whatever -- Eatcha (talk) 10:20, 3 February 2020 (UTC)- I just feel like sooner or later you would get blocked for every endpoint you can find.
- Any updates? Not anything productive that I am aware of. WMF was at All-Hands last week so there may be new updates to the thread. However, I might not be on that email thread.
- Also, did I mention the workaround that I was working on? I currently have COI and is refraining from working on it. The code has been sent to someone else who is interested in working on this. --Zhuyifei1999 (talk) 16:10, 3 February 2020 (UTC)
- slimerjs + x11vnc , a client needs to download a software(downloader + uploader), and the files would be re-encoded on WMCS. Cool + no risk of getting blocked by YouTube, although there will be some users with static IP address who may end up blocked, but not a big deal for average users like me + All I need to evade the block is reboot the router. -- Eatcha (talk) 17:59, 3 February 2020 (UTC)
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Changes later this week
- The new version of MediaWiki will be on test wikis and MediaWiki.org from 4 February. It will be on non-Wikipedia wikis and some Wikipedias from 5 February. It will be on all wikis from 6 February (calendar).
Tech news prepared by Tech News writers and posted by bot • Contribute • Translate • Get help • Give feedback • Subscribe or unsubscribe.
20:05, 3 February 2020 (UTC)
FontCDN
Hello Zhuyifei1999,
May I ask your help with adding a font to the FontCDN? I reckon you've developed it, or at least a big part of it. I'm not sure though, if it's only a mirror or specific fonts can be added too.
The font I'm about to use in a skin is the free, monospaced Hack designed for source code.
It has a node package for cdns, however the .css is a bit messy so I've cleaned it up to be consistent with fontcdn css (source).
Thank you in advance,
—Aron Man.🍂 edits🌾 11:04, 5 February 2020 (UTC)
- No I cannot. It is a mirror of Google Fonts. You must ask Google. Sorry. --Zhuyifei1999 (talk) 18:49, 5 February 2020 (UTC)
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Recent changes
- There is a new version of the Wikimedia Commons app for Android. It should fix the failed uploads problem. [6]
Problems
- There was a problem with the new MediaWiki version last week. It deleted some messages by accident. The new version was late because it was stopped to fix things. [7]
Changes later this week
- The MediaWiki action API is used by various tools like bots and gadgets. Some error codes will change. Some parameter values that do not follow the standard will no longer work. [8]
- The new version of MediaWiki will be on test wikis and MediaWiki.org from 11 February. It will be on non-Wikipedia wikis and some Wikipedias from 12 February. It will be on all wikis from 13 February (calendar).
Tech news prepared by Tech News writers and posted by bot • Contribute • Translate • Get help • Give feedback • Subscribe or unsubscribe.
19:12, 10 February 2020 (UTC)
Regarding FlickreviewR 2
I would be grateful if you can programme FlickreviewR 2, so that it will accept images from the Flickr account "myfwcmedia/Florida Fish and Wildlife", which are public domain images licensed under Template:PD-FLGov. It seems that somehow they have mistakenly tagged these images as CC-BY-NC-ND-2.0, and has resulted in wrong reviews in some of my uploads (eg. this). Many thanks.廣九直通車 (talk) 10:53, 10 February 2020 (UTC)
- The bot doesn't run exceptions, just acts according to what is shown on the flickr page; if I added all those exceptions requested so far that would be quite a few. You'd better either contact them or not add flickr review at all if it could be confirmed that the flickr stream is fully PD --Zhuyifei1999 (talk) 18:36, 10 February 2020 (UTC)
- OK, but why the bot can identify some images from US Federal government that are public domain mark, such as this? Regards.廣九直通車 (talk) 05:13, 11 February 2020 (UTC)
- The image is marked with PDM on flickr. See Commons:Requests_for_comment/Flickr_and_PD_images --Zhuyifei1999 (talk) 05:14, 11 February 2020 (UTC)
- OK, but why the bot can identify some images from US Federal government that are public domain mark, such as this? Regards.廣九直通車 (talk) 05:13, 11 February 2020 (UTC)
- OK, thanks for your explanation.廣九直通車 (talk) 06:08, 12 February 2020 (UTC)
Bot false positive (about the signature)
Hello! I have a question about the bot. See [9] for details. Maybe, I haven't formatted the message correctly? ParticipantOfTheEncyclopedia (talk) 15:18, 13 February 2020 (UTC)
- It thinks Special:Diff/393957059 is a comment alone --Zhuyifei1999 (talk) 17:03, 13 February 2020 (UTC)
Latest tech news from the Wikimedia technical community. Please tell other users about these changes. Not all changes will affect you. Translations are available.
Recent changes
- Pages on Wikidata and Commons now load faster. You can read more about page load performance. [10][11]
Changes later this week
- The new version of MediaWiki will be on test wikis and MediaWiki.org from 18 February. It will be on non-Wikipedia wikis and some Wikipedias from 19 February. It will be on all wikis from 20 February (calendar).
Tech news prepared by Tech News writers and posted by bot • Contribute • Translate • Get help • Give feedback • Subscribe or unsubscribe.
16:18, 17 February 2020 (UTC)
Add archiving functionality to FlickreviewR 2
Zhuyifei1999 and Eatcha, would it be possible to teach FlickreviewR 2 to archive Flickr images so we have a record of the license? Photographers often attempt to rescind a license and we have no lasting proof that at one time a proper license existed. For legal reasons, it would be great if FlickreviewR 2 used http://web.archive.org/save/ and kept a permanent copy of the permission for historical record.
I noticed that YouTubeReviewBot triggers the WayBack machine. Would it be possible to add the same functionality to FlickreviewR 2? --- C&C (Coffeeandcrumbs) 17:26, 16 February 2020 (UTC)
- I would like to know how many images it reviews in a year, more than 500.000 ? Would that be too much load for Flickr? I use https://github.com/pastpages/savepagenow for archiving, but it's a little buggy. The issue #16, but not a problem if you can access the site/API directly unlike YouTube. -- Eatcha (talk) 18:45, 16 February 2020 (UTC)
- I got no idea about how many reviews it does. Would you like to create a patch? source code --Zhuyifei1999 (talk) 19:13, 16 February 2020 (UTC)
- @Eatcha: Assuming Thursday, 13 February, was a typical day. It averages about or more than 2 million reviews a year.--- C&C (Coffeeandcrumbs) 23:29, 16 February 2020 (UTC)
- @Eatcha: Another option is to first check https://web.archive.org/web/2/https://www.flickr.com/photos/... to see if an archive already exists and reduce the load. --- C&C (Coffeeandcrumbs) 23:36, 16 February 2020 (UTC)
- According to Similar Web and Sitechecker, Flickr receives ~70 million visits per month. I think the website can handle another 2 million per year with no problem. --- C&C (Coffeeandcrumbs) 23:47, 16 February 2020 (UTC)
- Coffeeandcrumbs Do you want the bot to add an additional parameter |archive=Archive_URL_Here. And what about the current |sourceurl= param -- Eatcha (talk) 03:46, 17 February 2020 (UTC)
- And I check for older archives before forcing a newer one. -- Eatcha (talk) 03:55, 17 February 2020 (UTC)
- @Eatcha: I think "(archive)" in addition to the existing URL is best. But I have no real preferences on the matter. As long as the archive is created, it should be enough. If you can add an archive link param in {{FlickreviewR}}, that would be great. --- C&C (Coffeeandcrumbs) 04:05, 17 February 2020 (UTC)
- Okay, will add the archive parameter in addition to the source parameter, if for some reason the archiving fails(extremely rare, but possible) it will add |archive=False. -- Eatcha (talk) 04:33, 17 February 2020 (UTC)
- Okay, will add the archive parameter in addition to the source parameter, if for some reason the archiving fails(extremely rare, but possible) it will add |archive=False. -- Eatcha (talk) 04:33, 17 February 2020 (UTC)
- @Eatcha: I think "(archive)" in addition to the existing URL is best. But I have no real preferences on the matter. As long as the archive is created, it should be enough. If you can add an archive link param in {{FlickreviewR}}, that would be great. --- C&C (Coffeeandcrumbs) 04:05, 17 February 2020 (UTC)
- And I check for older archives before forcing a newer one. -- Eatcha (talk) 03:55, 17 February 2020 (UTC)
- Coffeeandcrumbs Do you want the bot to add an additional parameter |archive=Archive_URL_Here. And what about the current |sourceurl= param -- Eatcha (talk) 03:46, 17 February 2020 (UTC)
- ┌─────────────────────┘
- https://github.com/eatcha-wikimedia/FlickrReviewR -- Eatcha (talk) 05:46, 17 February 2020 (UTC)
- The archiving doesn't check the license, so in theory the page could be archived with non-free license after it was changed to free license and get reviewed. Is this an issue? --Zhuyifei1999 (talk) 07:16, 17 February 2020 (UTC)
- Thinking about it, I don't even want the remote possibility that someone could launch this attach against the bot: 1. upload the image to flickr and set free license 2. transfer to commons 3. bot starts review, sees free 4. set non-free 5. bot tells archive.org to archive the page 6. archive.org sees non free 7. adversary holds me liable for the bot's "malfunction".
- So, is it possible to only pass afrer the archive.org has a archive of the page with free license? --Zhuyifei1999 (talk) 08:40, 17 February 2020 (UTC)
- I can only think of two ways to fix this:
1 Always force new archive before adding the template(More stable, and easier to implement).
2 Read the webpage(often reading real-time archive fails) to verify the license. Why fails? Maybe Internet-Archive queues the requests in busy hours. -- Eatcha (talk) 09:11, 17 February 2020 (UTC)- I don't understand what you meany by bot starts review, sees free 4. set non-free 5. bot tells archive.org to archive the page 6. archive.org sees non free
Why will it set non-free after seeing free ? I made no changes to the template, just added an extra step. Maybe I don't understand why you mean by set non-free. I assume by set you mean the license that bot adds in the review template. -- Eatcha (talk) 09:23, 17 February 2020 (UTC)- The adversary changes the license on flickr to non-free --Zhuyifei1999 (talk) 09:56, 17 February 2020 (UTC)
- Okay, in your opinion, can option 1 stop them ? Thanks-- Eatcha (talk) 10:33, 17 February 2020 (UTC)
- Yeah, lgtm --Zhuyifei1999 (talk) 10:42, 17 February 2020 (UTC)
- Done -- Eatcha (talk) 11:56, 17 February 2020 (UTC)
- Ok, will merge tonight. Thanks --Zhuyifei1999 (talk) 19:06, 17 February 2020 (UTC)
- @Eatcha: https://github.com/eatcha-wikimedia/FlickrReviewR/search?q=archive_source&unscoped_q=archive_source I don't see where archive_source is being called. Is this expected? I'm thinking of adding it somewhere like https://github.com/eatcha-wikimedia/FlickrReviewR/blob/4981e81d1ce1aa532c2e3eae44d9bf2811d84916/flickreviewr.py#L135 --Zhuyifei1999 (talk) 05:56, 18 February 2020 (UTC)
- https://github.com/eatcha-wikimedia/FlickrReviewR/commit/83be25b325edf17f940bfbe62fe1f393af433107 -- Eatcha (talk) 06:06, 18 February 2020 (UTC)
- Alright. https://tools.wmflabs.org/yifeibot/gitweb/?p=botscripts.git;a=commitdiff;h=f3f40e05a1ce2c44473d6a2dd47463a99153197e I trimmed everything I found unnecessary. --Zhuyifei1999 (talk) 07:31, 18 February 2020 (UTC)
- https://github.com/eatcha-wikimedia/FlickrReviewR/commit/83be25b325edf17f940bfbe62fe1f393af433107 -- Eatcha (talk) 06:06, 18 February 2020 (UTC)
- Done -- Eatcha (talk) 11:56, 17 February 2020 (UTC)
- Yeah, lgtm --Zhuyifei1999 (talk) 10:42, 17 February 2020 (UTC)
- Okay, in your opinion, can option 1 stop them ? Thanks-- Eatcha (talk) 10:33, 17 February 2020 (UTC)
- The adversary changes the license on flickr to non-free --Zhuyifei1999 (talk) 09:56, 17 February 2020 (UTC)
- I don't understand what you meany by bot starts review, sees free 4. set non-free 5. bot tells archive.org to archive the page 6. archive.org sees non free
- I can only think of two ways to fix this:
@~riley and Jarekt: FlickreviewR 2 now records |archive=
in {{FlickreviewR}} (See example edit by bot and above discussion). Would either of you be willing and able to add this functionality to the template so it displays in the user facing output? Something like "(archive)" where the word archive links to the URL given in the parameter. --- C&C (Coffeeandcrumbs) 01:05, 19 February 2020 (UTC)
- I can add it to the template, if you spell out on the template page the exact changes you would like to make. That way, there is better record of changes and I do not have to guess what to do. Feel free to ping me when you are ready. --Jarekt (talk) 14:37, 19 February 2020 (UTC)
- Just noting that this has been done for cases where
status=pass
. --- C&C (Coffeeandcrumbs) 12:19, 20 February 2020 (UTC)
- Just noting that this has been done for cases where
Would it be crazy to ask the bot to do add an archive link on the old files that have been reviewed? Of course only where it is possible. --MGA73 (talk) 15:23, 19 February 2020 (UTC)
- If someone write the code for it, then probably yeah. I currently have like barely any time for this :/ --Zhuyifei1999 (talk) 19:28, 19 February 2020 (UTC)
@Eatcha: There seem to be a lot of inconsistent state at archive.org. Eg: File:Gloucester Cathedral Cloister (49059814147).jpg File:President_Trump_Delivers_Remarks_on_Water_Accessibility_(49560778913).jpg File:Kham_-_Road_side.jpg, and there are a lot of traces like:
[2020-02-20 17:04:03] Exception archiving https://flickr.com/photos/41464593@N02/28920833663 Traceback (most recent call last): File "/data/project/yifeibot/o/toolserver/bryan/flickr/bots/flickreviewr.py", line 444, in archive_source archived_url = savepagenow.capture(source, user_agent='User:FlickreviewR 2 on Wikimedia Commons') File "/data/project/yifeibot/.local/lib/python3.5/site-packages/savepagenow/api.py", line 42, in capture raise WaybackRuntimeError(error_header) savepagenow.api.WaybackRuntimeError: LiveDocumentNotAvailableException: https://flickr.com/photos/41464593@N02/28920833663: live document unavailable: java.lang.IllegalStateException: Payload size does not match content-length!
Any ideas what is going on? --Zhuyifei1999 (talk) 17:07, 20 February 2020 (UTC)
- After exception add a second try. Or more appropriately loop try statement until you get what you want. Performance issues. -- Eatcha (talk) 17:51, 20 February 2020 (UTC)
- @Eatcha: What about the inconsistency issues linked above? How do you detect that? --Zhuyifei1999 (talk) 18:07, 20 February 2020 (UTC)
- I never encountered that, as I am required to read the webpage to get details. If the bot fails to find the license the bot ignores the file. For Flickr reviewer , we need another line of code to check status, on 303 return false or try after sometime. Another option is read and search the page to check if it's a flickr page. Can you wait before adding the mentioned checks ? It can also be due to a lag and the page might be already archived. -- Eatcha (talk) 18:29, 20 February 2020 (UTC)
- It was certainly a replication lag. Try Trump's image again, it loads perfectly for me. -- Eatcha (talk) 18:35, 20 February 2020 (UTC)
- Hmm okay. Not sure about doing retries though without being sure what is actually happening on IA's side. @Cyberpower678: By any chance, you got any idea? --Zhuyifei1999 (talk) 20:04, 20 February 2020 (UTC)
- Per this, I don't see any harm in doing retries. If IA responds, then good but what harm can retrying do ? -- Eatcha (talk) 07:19, 21 February 2020 (UTC)
- I'll wait for cp678. This isn't urgent --Zhuyifei1999 (talk) 07:34, 21 February 2020 (UTC)
- Per this, I don't see any harm in doing retries. If IA responds, then good but what harm can retrying do ? -- Eatcha (talk) 07:19, 21 February 2020 (UTC)
- Hmm okay. Not sure about doing retries though without being sure what is actually happening on IA's side. @Cyberpower678: By any chance, you got any idea? --Zhuyifei1999 (talk) 20:04, 20 February 2020 (UTC)
- It was certainly a replication lag. Try Trump's image again, it loads perfectly for me. -- Eatcha (talk) 18:35, 20 February 2020 (UTC)
- I never encountered that, as I am required to read the webpage to get details. If the bot fails to find the license the bot ignores the file. For Flickr reviewer , we need another line of code to check status, on 303 return false or try after sometime. Another option is read and search the page to check if it's a flickr page. Can you wait before adding the mentioned checks ? It can also be due to a lag and the page might be already archived. -- Eatcha (talk) 18:29, 20 February 2020 (UTC)
- @Eatcha: What about the inconsistency issues linked above? How do you detect that? --Zhuyifei1999 (talk) 18:07, 20 February 2020 (UTC)