Commons:Bots/Requests/Smallbot 8

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

Smallbot (talk · contribs)

Operator: Smallman12q (talk · contributions · Statistics · Recent activity · block log · User rights log · uploads · Global account information)

Bot's tasks for which permission is being sought: To upload files related to Commons:Batch uploading/ECGPedia The files that will be uploaded can be found at User:Smallbot/source/Cardionetworks

This is based on OTRS Ticket#2011102310008874 . The ticket is a year old.

Related discussions:

Automatic or manually assisted:automatic

Edit type (e.g. Continuous, daily, one time run): One time run...though there may be future uploads

Maximum edit rate (eg edits per minute): depends on upload speed (10)

Bot flag requested: (Y/N): N

Programming language(s): VBScript (Javascript, XMLHTTP, MSHTML, XMLDOM, COM).

Smallman12q (talk) 16:56, 1 December 2012 (UTC)[reply]

Discussion

A lot of the files lack a description. Also, how should author information be handled? There are a number of .swf, a format that will never be supported on commons Bugzilla26269, that could be converted to a supported video format. The .avi could be converted to a supported video format.Smallman12q (talk) 16:56, 1 December 2012 (UTC)[reply]

Please make test uploads. --EugeneZelenko (talk) 15:42, 2 December 2012 (UTC)[reply]
First, I need to verify the OTRS ticket and make the relevant template. I've asked at Commons:OTRS/Noticeboard#Cardio_Networks.Smallman12q (talk) 17:33, 2 December 2012 (UTC)[reply]
The answer from another OTRS volunteer on that page:
License template exist at Template:Cardionetworks permission that provides the relevant details. Regards -- KTC (talk) 18:14, 2 December 2012 (UTC)
Hope this helps, if not, don't hesitate to remove the {{Section resolved}} template and ask further! Trijnsteltalk 18:57, 2 December 2012 (UTC)[reply]
Thanks. Didn't know the template was already made. Will do upload later this week.Smallman12q (talk) 20:39, 2 December 2012 (UTC)[reply]
Wonderful. Will these images all be put together in a single category? Do you need anything further from my end? James Heilman, MD (talk) 18:34, 13 December 2012 (UTC)[reply]
I don't see the test uploads yet. Also can you supply links to a few swf files? If they are either static or videos they could be converted as well. Do you have a solution for converting the avi videos yet? --Dschwen (talk) 00:46, 16 January 2013 (UTC)[reply]

Working-I'm going to do this upload this week..was busy with Commons:Gerald R. Ford Presidential Library and Museum but they seem to be stuck in a bureaucratic quagmire. The .swf files have an equivalent .avi file (my guess is the .swf is just a flash wrapper for those who can't play .avi but have flash). The .avi will be converted to the webm format(which the commons now supports) before upload. I am in the process of downloading the files... probably around 10GB + for each wiki. I've written a script that will download all files, the file history table for each file, and text of the file for a wiki so this may be handy for future uploads. A lot of the files lack any text so I'm not sure what to put for these. Any thoughts are welcome.Smallman12q (talk) 00:41, 28 January 2013 (UTC)[reply]

What are some examples of fills missing text? James Heilman, MD (talk) 01:02, 28 January 2013 (UTC)[reply]
There are quite a few files missing text. Have a look at User:Smallbot/source/Cardionetworks/Example to see how the uploads will look (including those missing text). Let me know if it looks okay.Smallman12q (talk) 17:18, 29 January 2013 (UTC)[reply]
Yes that is okay. We will need to go through and add this after the fact. James Heilman, MD (talk) 02:01, 31 January 2013 (UTC)[reply]
A few peculiarities
  • Afib_ecg.jpg (and a few more) == Description == inside the description value in the Information template.
  • 2072.jpg this description is not useful (maybe tag everything with a description of less then 10 chars as needing description)
  • Course.jpg this is most likely a copyright violation (stock image, compare to this image or do google image search or tineye.
That last point makes me a bit uneasy about the whole task. --Dschwen (talk) 21:39, 31 January 2013 (UTC)[reply]
I've uploaded ECGpedia at Category:Media from CardioNetworks ECGpedia. I fixed the description issue I believe. I will go back and add {{Description missing}} to descriptions with less than 10 characters. I've requested speedy-delete for File:Course (CardioNetworks ECGpedia).jpg I'll do uploads of the other pediass tmrw.Smallman12q (talk) 03:49, 1 February 2013 (UTC)[reply]
Sure so 2) the HR = 32 means that the heart rate is 32, standard shorthand and useful 3) I have sent a note to the up loader there and we will figure out who has copied from who. Does look like it is from here though [1]. The point is the ECGs though and not the clip art. James (log in appears broken)
Ok, so it was probably a fluke. I suggest you go ahead and do a few more test uploads. We should be able to resolve this quickly if you have time. --Dschwen (talk) 02:49, 2 February 2013 (UTC)[reply]
@Dschwen Resolve what exactly? I'll do echopedia.org next.Smallman12q (talk) 19:32, 2 February 2013 (UTC)[reply]
This bot request. --Dschwen (talk) 15:23, 3 February 2013 (UTC)[reply]

For Echopedia, a lot of the files have no description, I will see if I can get some from the template on case files pages.Smallman12q (talk) 02:32, 5 February 2013 (UTC)[reply]

I have had a look at both ECGpedia and Echopedia and am sceptical as to the usefulness of this bot approach. What it does on the Commons end (i.e. after spidering the source side) is (1) upload the files with (2) information on source, author and licensing, (3) add maintenance categories. For most of the files, this leaves Commons users with the tasks of (4) providing descriptions, (5) providing content categories and (6) renaming the files to Commons standards. That is a lot to ask for, and so I would personally favour an approach more akin to Flickr2Commons, in which individual Commons users import selected materials, and thus take responsibility for them here. Another thing I noticed is that the image pages over there link to "description pages" on Commons, which never exist because the files have the " (CardioNetworks ECGpedia)" suffix here. Has that anything to do with their collaboration with us or is this simply an error in their MediaWiki configuration? -- Daniel Mietchen - WiR/OS (talk) 22:38, 8 February 2013 (UTC)[reply]
I tend to agree with Daniel here. There hasn't been a lot of discussion whether this stuff should be imported. I can see that some of the files can provide educational value, but this is largely dependent on good descriptions. Is there an anticipated use for those file on Wikimedia projects? Ie. Wikiversity cardiology? ;-) --Dschwen (talk) 23:01, 8 February 2013 (UTC)[reply]
An obvious use case would be WikiProject Medicine but the usefulness certainly depends on the quality of the descriptions and discoverability by way of categories (or perhaps file names). -- Daniel Mietchen - WiR/OS (talk) 23:05, 8 February 2013 (UTC)[reply]
For the stuff already imported, I suggest adding specific (rather than things like {{Description missing}}) maintenance categories that address the points 4-6 in my comment from an hour ago, i.e. something like Category:Media from CardioNetworks ECGpedia missing descriptions (for which an alternative would be CatScan), Category:Media from CardioNetworks ECGpedia missing categories (these have not been tagged at all, by the way, since Category:Media from CardioNetworks ECGpedia had not been marked as hidden and thus counted as a content category) and Category:Media from CardioNetworks ECGpedia needing file name review. This way, people interested in helping out with ECGpedia can dive right in and do not have to spend time searching for files in need of their help. -- Daniel Mietchen - WiR/OS (talk) 23:25, 8 February 2013 (UTC)[reply]

In terms of converting multimedia files to OGG, it may be worthwhile to have a look at media.py. Adding WebM support to that is easy. -- Daniel Mietchen - WiR/OS (talk) 23:02, 8 February 2013 (UTC)[reply]

For converting to .webm, I'll be calling w:FFmpeg in a subprocess from python. Daniel Mietchen's description above is fairly accurate as my bot is spidering and then uploading whatever info it can get...which isn't that much. The links to commons from the pedia pages are non-existent (it's a bug). I could add a custom description missing template and whatnot. So far, none of the files are being used...so I'm not sure if I should upload more files for Echopedia. I will post a note to WikiProject Medicine as to whether this upload is worthwhile.Smallman12q (talk) 22:37, 14 February 2013 (UTC)[reply]

I plan to go through the ECGs, add descriptions and than add many of them to the appropriate Wikipedia article. James Heilman, MD (talk) 01:37, 15 February 2013 (UTC)[reply]

I've uploaded ECHOpedia to Category:Media from CardioNetworks ECHOpedia. I could do animated gifs for the videos, but most would fall in the 25-30 million total pixel range and currently only less those with 25 million will be rendered.Smallman12q (talk) 17:19, 18 February 2013 (UTC)[reply]
Upload videos as videos. It makes no sense to convert them into animated GIFs nowadays. --Dschwen (talk) 22:20, 18 February 2013 (UTC)[reply]
Animated
Thumbnail not animated

The videos are 1-2 second clips. Here's what a gif version would look like.Smallman12q (talk) 22:39, 18 February 2013 (UTC)[reply]

Five times the file size, conversion loss, and no apparent advantage. Why on earth would you convert to anmated GIF?! --Dschwen (talk) 23:33, 18 February 2013 (UTC)[reply]
Gifs may be easier to use in some places and can loop by default. Anyhow, it's probably not worthwhile. PCIpedia is up next...it's ~200 files.Smallman12q (talk) 01:06, 19 February 2013 (UTC)[reply]

Closing as stale since Smallman12q has retired. Feel free to reopen this if you return. --99of9 (talk) 03:47, 13 January 2014 (UTC)[reply]