User talk:Zhuyifei1999/Archive 42

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

Another way to circumvent the bot

Could you look at Special:Undelete/File:Bnelite.pdf, Special:Undelete/File:Untitledhassn.pdf and Special:Undelete/File:Unhahatitled.pdf? These PDFs contain embedded data, most likely Flash videos. Strangely, the bot has not found anything in the first two. --jdx Re: 06:26, 10 October 2018 (UTC)

(venv) zhuyifei1999@zhuyifei1999-ThinkPad-X260:~/embeddeddata$ time python2 -c 'print(__import__("detection").detect("testfiles/Bnelite.pdf"))'
Traceback (most recent call last):
  File "detection/middleware/__init__.py", line 42, in detect
    for item in middleware(f) or []:
  File "detection/middleware/pdfminer.py", line 48, in pdfminer_EmbeddedFile
    caching=False  # we can run OOM if we cache all the files
  File "/home/zhuyifei1999/embeddeddata/venv/lib/python2.7/site-packages/pdfminer/pdfdocument.py", line 326, in __init__
    self._initialize_password(password)
  File "/home/zhuyifei1999/embeddeddata/venv/lib/python2.7/site-packages/pdfminer/pdfdocument.py", line 348, in _initialize_password
    raise PDFEncryptionError('Unknown algorithm: param=%r' % param)
PDFEncryptionError: Unknown algorithm: param={'CF': {'StdCF': {'Length': 16, 'CFM': /AESV2, 'AuthEvent': /DocOpen}}, 'O': '\xe3\x16\x12\x03\xf1:X\xf0\x07\x1d=Hq\x9e\x9b\xbd$\xf5x\xe1q\xf8\xb2\xa1Tr\xb2\xcah\xee\x7f\xff', 'Filter': /Standard, 'P': -1052, 'Length': 128, 'R': 4, 'U': 'y\xf4@M\xe3\xe399Y\xe5\x92uP\x03\x08\x88\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00\x00', 'V': 4, 'StmF': /StdCF, 'StrF': /StdCF}
[]

real	0m27.014s
user	0m23.567s
sys	0m3.213s
Is this PDF encrypted? --Zhuyifei1999 (talk) 17:25, 10 October 2018 (UTC)
Probably. The metadata says: Encrypted: yes (print:yes copy:no change:no addNotes:yes algorithm:AES) --jdx Re: 18:15, 10 October 2018 (UTC)
Maybe we should ban encrypted PDFs altogether? --Zhuyifei1999 (talk) 19:23, 10 October 2018 (UTC)
(talk page stalker) but I will still be allowed to upload my audio and video with DRM, right? - Alexis Jazz ping plz 19:38, 10 October 2018 (UTC)
They should get tagged by the pdfminer middleware --Zhuyifei1999 (talk) 22:56, 10 October 2018 (UTC)
There is no flood of such files, so let's see what future brings. Even if not tagged by the bot, these files are rather easy to catch thanks to Abuse Filter's rule #166 because of their unusually big size. IMO easier way would be to set up an AF's rule which would just reject huge PDFs (let's say bigger than 200 MiB) uploaded from fresh accounts (let's say 2 hours old). BTW, another one: Special:Undelete/File:Untitled,j.pdf. This one has been tagged by the bot. --jdx Re: 05:53, 11 October 2018 (UTC)

22:40, 15 October 2018 (UTC)

Der Bot hat deswegen das Size auf der Flickr-Quelle nicht gefunden, erstens habe ich das Bild gedreht und zweitens etwas weggeschnitten damit die Fresken genau mittig sind.--Ricardalovesmonuments (talk) 07:50, 16 October 2018 (UTC)

I don't speak German (and Google Translation doesn't seem to make sense). The file looks ARR on Flickr. @Steinsplitter: Could you explain to them? --Zhuyifei1999 (talk) 14:29, 16 October 2018 (UTC)
@Ricardalovesmonuments: Alle Rechte vorbehalten ist eine für Commons unzulässige Lizenz, bitte COM:L lesen. Die Lizenz muss auf Flickr angepasst werden oder wir müssen die Datei löschen. --Steinsplitter (talk) 14:52, 16 October 2018 (UTC)
Kannst du denn nicht für mich die rechtliche Anpassung übernehmen, ich kenn mich da nicht aus. Mein Vorgehen war deshalb so weil das so ausreicht (dachte ich so): erstens ist das Bild frei herunterladbar und zweitens habe ich ja die Seite als Quelle verlinkt.--Ricardalovesmonuments (talk) 15:05, 16 October 2018 (UTC)
@Ricardalovesmonuments: Kann nur der Urheber, ist das Foto von dir? Dan passe bitte die Lizenz auf Flickr direkt an (Habe kein flickr, kann dir daher nicht sagen wo genau). Danke. --Steinsplitter (talk) 16:37, 16 October 2018 (UTC)

Nein ich habe das Bild nicht auf Flickr veröffentlicht, sondern wie @Zhuyifei1999 erwähnte Theuerkauf.--Ricardalovesmonuments (talk) 16:55, 16 October 2018 (UTC)

In dem Fall kann nur Herr Theuerkauf selbst eine freie Lizenz vergeben, und du hättest das hier gar nicht erst hochladen dürfen – gedreht oder nicht. "Frei herunterladbar" ohne ausdrückliche Lizenz heisst eben nicht, dass alle damit machen dürfen, was sie wollen, sondern es erlaubt höchstens den Privatgebrauch solcher Bilder. Ich habe die Datei hier daher gelöscht. @Zhuyifei1999: I deleted the file as a copyvio. De728631 (talk) 21:56, 16 October 2018 (UTC)

:(

This "bot" is silly! I'm using a formatted signature. Still, it's harassing me not to sign it. Repair someone's algorithm please!

Double sign:

--WikiHuns vita 23:49, 18 October 2018 (UTC) --WikiHuns vita 23:49, 18 October 2018 (UTC) — Preceding unsigned comment added by WikiHuns (talk • contribs) 23:49, 18 October 2018 (UTC) This is a formatted signature nice robot: --WikiHuns vita 23:52, 18 October 2018 (UTC) — Preceding unsigned comment added by WikiHuns (talk • contribs) 23:52, 18 October 2018 (UTC)

Red link --Zhuyifei1999 (talk) 00:19, 19 October 2018 (UTC)
(talk page stalker) @WikiHuns: our namespaces are in English, not Hungarian. - Alexis Jazz ping plz 12:00, 21 October 2018 (UTC)

23:11, 22 October 2018 (UTC)

20:08, 29 October 2018 (UTC)

Page Module talk:User:Zhuyifei1999/sandbox is in the Category:Pages with script errors. Occasionally I try to clean up that category (as most entries have something to do with my edits to Lua Modules) and I always check that page. I have no idea why it is a Pages with script errors, do you? Maybe we can either fix the errors or remove something so it does not show. --Jarekt (talk) 12:56, 1 November 2018 (UTC)

Looking at the JS mw.config.get('ScribuntoErrors') it shows <p>Script error: The function \"ISOdate\" does not exist.</p><p>No further details are available.</p>. No idea why the red "script error" message doesn't show like usual. I blanked the page. --Zhuyifei1999 (talk) 14:47, 1 November 2018 (UTC)

Structured Data - IRC office hour today, 1 November

There will be an IRC office hour for Structured Data on Commons today, 1 October 2018, from 17:00-18:00 UTC in #wikimedia-office. You can find date/time conversion, as well as a link to join the chat in your browser if needed, on the IRC Office hours page on Meta. I realize this may be short notice for some people; I am experimenting with advanced notice times to see what works best for the most people, I'll be giving more warning before the next office hour. I look forward to seeing you there. -- Keegan (WMF) (talk) 16:02, 1 November 2018 (UTC)

Structured Data - IRC office hour today, 1 November

The above message says 1 October in the body when it should say 1 November, as the subject line says. Apologies for making a new section by mass message, it's the only way to get this out quickly. See you in twenty minutes! -- Keegan (WMF) (talk) 16:37, 1 November 2018 (UTC)

Structured Data - copyright and licensing statements

I've posted a second round of designs for modeling copyright and licensing in structured data. These redesigns are based off the feedback received in the first round of designs, and the development team is looking for more discussion. These designs are extremely important for the Commons community to review, as they deal with how copyright and licensing is translated from templates into structured form. I look forward to seeing you over there. -- Keegan (WMF) (talk) 16:25, 2 November 2018 (UTC)

17:29, 5 November 2018 (UTC)

19:21, 12 November 2018 (UTC)

SignBot

https://commons.wikimedia.org/w/index.php?title=User_talk:Guanaco&diff=324841859&oldid=323520210

Should have been signed I think. - Alexis Jazz ping plz 13:28, 21 October 2018 (UTC)

I think the bot just died. Restarted --Zhuyifei1999 (talk) 16:23, 21 October 2018 (UTC)
https://commons.wikimedia.org/w/index.php?title=User_talk:Vermont&diff=325044125&oldid=325044109
Double signature.. Not so practical when dealing with vandals. - Alexis Jazz ping plz 22:14, 22 October 2018 (UTC)
Hmm. Shall I just ignore all those with an 'Undo' tag? --Zhuyifei1999 (talk) 23:02, 22 October 2018 (UTC)
I see no downside. - Alexis Jazz ping plz 20:43, 23 October 2018 (UTC)
Weird, eventstreams data don't seem to include tags:
{u'bot': False,
 u'comment': u'Undo revision 325142475 by [[Special:Contributions/Zhuyifei1999|Zhuyifei1999]] ([[User talk:Zhuyifei1999|talk]])',
 u'id': 1150800646,
 u'length': {u'new': 59, u'old': 54},
 u'meta': {u'domain': u'commons.wikimedia.org',
           u'dt': u'2018-10-23T21:25:39+00:00',
           u'id': u'309c743a-d70a-11e8-84f4-141877613bad',
           u'offset': 1150685733,
           u'partition': 0,
           u'request_id': u'40cfe01e-38fa-4854-8d96-b354506b4a71',
           u'schema_uri': u'mediawiki/recentchange/2',
           u'topic': u'eqiad.mediawiki.recentchange',
           u'uri': u'https://commons.wikimedia.org/wiki/Commons:Sandbox'},
 u'minor': True,
 u'namespace': 4,
 u'parsedcomment': u'Undo revision 325142475 by <a href="/wiki/Special:Contributions/Zhuyifei1999" title="Special:Contributions/Zhuyifei1999">Zhuyifei1999</a> (<a href="/wiki/User_talk:Zhuyifei1999" title="User talk:Zhuyifei1999">talk</a>)',
 u'patrolled': True,
 u'revision': {u'new': 325142490, u'old': 325142475},
 u'server_name': u'commons.wikimedia.org',
 u'server_script_path': u'/w',
 u'server_url': u'https://commons.wikimedia.org',
 u'timestamp': 1540329939,
 u'title': u'Commons:Sandbox',
 u'type': u'edit',
 u'user': u'Zhuyifei1999',
 u'wiki': u'commonswiki'}
I'll find to find an API that shows the tags. --Zhuyifei1999 (talk) 21:35, 23 October 2018 (UTC)
>>> list(pywikibot.Page(pywikibot.Site(), 'Commons:Sandbox').revisions(total=10))[1]
{'comment': u'Undo revision 325142475 by [[Special:Contributions/Zhuyifei1999|Zhuyifei1999]] ([[User talk:Zhuyifei1999|talk]])', '_sha1': u'97e94487c0452b48055cfc5e035ba11a9db35b2d', 'text': None, 'revid': 325142490, 'anon': False, 'user': u'Zhuyifei1999', 'rollbacktoken': None, 'timestamp': Timestamp(2018, 10, 23, 21, 25, 39), '_content_model': u'wikitext', '_parent_id': 325142475, 'minor': True}
:( --Zhuyifei1999 (talk) 23:01, 23 October 2018 (UTC)
You could go by the edit comment, it's not perfect I know.. - Alexis Jazz ping plz 23:07, 23 October 2018 (UTC)
I got this:
>>> def get_tags(page, revision):
...     req = page.site._simple_request(
...         action='query',
...         prop='revisions',
...         titles=page,
...         rvprop='tags',
...         rvstartid=revision.revid,
...         rvendid=revision.revid,
...         rvlimit=1
...     )
...     try:
...         res = req.submit()
...     except Exception as e:
...         pywikibot.exception(e)
...         return []
...     else:
...         p = res['query']['pages']
...         if not p:
...             return []
...         r = p[p.keys()[0]]['revisions']
...         if not r:
...             return []
...         return r[0]['tags']
... 
>>> p = pywikibot.Page(pywikibot.Site(), 'Commons:Sandbox')
>>> r = list(p.revisions(total=10))[1]
>>> p, r
(Page(Commons:Sandbox), {'comment': u'Undo revision 325142475 by [[Special:Contributions/Zhuyifei1999|Zhuyifei1999]] ([[User talk:Zhuyifei1999|talk]])', '_sha1': u'97e94487c0452b48055cfc5e035ba11a9db35b2d', 'text': None, 'revid': 325142490, 'anon': False, 'user': u'Zhuyifei1999', 'rollbacktoken': None, 'timestamp': Timestamp(2018, 10, 23, 21, 25, 39), '_content_model': u'wikitext', '_parent_id': 325142475, 'minor': True})
>>> get_tags(p, r)
[u'mw-undo']
Gotta call the MediaWiki API manually. It sucks but should work reliably. --Zhuyifei1999 (talk) 23:19, 23 October 2018 (UTC)

✓ Done [16] seems to work. Please tell if not. --Zhuyifei1999 (talk) 23:36, 23 October 2018 (UTC)

Thanks. Atm I guess SignBot dozed off again?[17] - Alexis Jazz ping plz 09:13, 28 October 2018 (UTC)
Restarted. The self-restarting code was triggered, but idk what happened later --Zhuyifei1999 (talk) 14:20, 28 October 2018 (UTC)
Could edits like [18] also be excluded? - Alexis Jazz ping plz 07:01, 8 November 2018 (UTC)
What is the criteria you are proposing? --Zhuyifei1999 (talk) 07:37, 8 November 2018 (UTC)
"Nominating for deletion" summary. - Alexis Jazz ping plz 08:26, 8 November 2018 (UTC)
I don't think that is a good idea, no. What if someone edits with a summary of something like 'Nominating for deletion is what they did'? What if the gadget changes to use a different summary? What if another exception should be added like 'Taging for speedy deletion'?
I'm not inclined to code all the minor exceptions into the bot, unless they happen a lot, or is very generic. --Zhuyifei1999 (talk) 15:45, 8 November 2018 (UTC)
What is a lot? I think 0.75% of all SignBot edits is quite a bit. [19] [20] [21] [22] [23] [24] [25] [26] [27] [28] [29] [30] [31] [32] [33] [34] [35] [36] [37] [38] [39] [40] [41] [42] [43] [44] [45] [46] [47]
A few are not vandalism, but none of them should have been signed. - Alexis Jazz ping plz 20:52, 8 November 2018 (UTC)
User:SignBot/exclude_regex exists. I'm not going to use edit comments. --Zhuyifei1999 (talk) 23:33, 8 November 2018 (UTC)
Thanks, I forgot about that. Added. - Alexis Jazz ping plz 00:32, 9 November 2018 (UTC)
https://commons.wikimedia.org/w/index.php?title=User_talk:Jcb&diff=prev&oldid=327410596
Not sure why that wasn't signed. - Alexis Jazz ping plz 01:01, 12 November 2018 (UTC)
[[commons:User talk:Jcb]]: Handling
@@ -185,0 +186,4 @@ ***
+  ***
+ == Seeing a wrong thing  == ***
+  ***
+ [[User:MaterialScientist|MaterialScientist]] blocked user [[User:CottoHottoLotto|CottoHottoLotto]] indefinitely just for 1 edit that doesn't involve swear words, but might be inappropriate or vandalism but this user has only made 1 edit. If you find the block wrong or is wrong, please unblock this user. ***

[[commons:User talk:Jcb]]: Waiting
[[commons:Commons:Deletion requests/File:Arthur Schnitzler (1862–1931). Sein letztes Porträt von Wilhelm Victor Krausz (1878–1959).jpg]]: Handling
[[commons:Commons:Deletion requests/File:Arthur Schnitzler (1862–1931). Sein letztes Porträt von Wilhelm Victor Krausz (1878–1959).jpg]]: [[commons:User:Walter Anton]] opted-out
[[commons:User talk:Jcb]]: Handling
[[commons:Commons:Copyright rules by territory/Tanzania-Tanganyika]]: Handling
@@ -189 +189 @@ ***
- [[User:MaterialScientist|MaterialScientist]] blocked user [[User:CottoHottoLotto|CottoHottoLotto]] indefinitely just for 1 edit that doesn't involve swear words, but might be inappropriate or vandalism but this user has only made 1 edit. If you find the block wrong or is wrong, please unblock this user. ***
+ [[User:Materialscientist|Materialscientist]] blocked user [[User:CottoHottoLotto|CottoHottoLotto]] indefinitely just for 1 edit that doesn't involve swear words, but might be inappropriate or vandalism but this user has only made 1 edit. If you find the block wrong or is wrong, please unblock this user. ***

[[commons:User talk:Jcb]]: No inserts
[[commons:Commons:Copyright rules by territory/Tanzania-Tanganyika]]: Not a discussion
[[commons:User talk:Jcb]]: Line no longer found, probably signed
--Zhuyifei1999 (talk) 02:50, 12 November 2018 (UTC)
Something strange happened here when Super Wang moved his comment. I'm not sure how to fix it. Maybe an exclusion on User:SignBot/exclude regex for "\[\[User.*20[1-9][0-9] (UTC)$", would that work? (is there a variable for the username?) - Alexis Jazz ping plz 00:33, 15 November 2018 (UTC)
The bot thinks it's your comment, rather than his, being moved. Sigh dunno how to fix this one. And I'm against adding signature regexes to exclude regex --Zhuyifei1999 (talk) 03:38, 15 November 2018 (UTC)

23:28, 19 November 2018 (UTC)