Commons:Bots/Requests/YarluFileBot
< Commons:Bots | Requests
YarluFileBot (talk · contribs)
Bot's tasks for which permission is being sought: upload images
Automatic or manually assisted: automatic
Edit type (e.g. Continuous, daily, one time run): intermittent
Maximum edit rate (eg edits per minute): 1-6
Bot flag requested: (Y/N): Y
Programming language(s): Java (custom soft.)
Yarl ✉ 19:56, 16 January 2012 (UTC)
Discussion
- This bot will upload CC-BY-SA, self-made images from website http://fotopolska.eu/. Example upload: File:Namysłów, Linia kolejowa nr 143 - 179492 - fotopolska.eu.jpg. Yarl ✉ 19:56, 16 January 2012 (UTC)
- Please add {{Licensereview}}. If you're going to upload a lot of images you might want to create a custom one like {{Flickrreview}}. Multichill (talk) 21:15, 16 January 2012 (UTC)
- Agree with Multichill about {{Licensereview}}. Other problem I see is the watermark, if they are frequent, can you crop them prior to upload? Are any images goecodded, and is so is there any way to capture that information? Can you upload few dozen of other images so there are few more examples. --Jarekt (talk) 03:15, 17 January 2012 (UTC)
- Sure, I'll add {{Licensereview}}. Watermark is on every image and I don't want to crop them, because I upload them directly from Fotopolska server to Commons server. Moreover I think sometimes it's better to photoshop watermark instead crop image. I'll try to upload some images today. Yarl ✉ 12:02, 17 January 2012 (UTC)
- If all images have watermark than {{Watermark}} template should be also added. It does not have to be done at the upload time but it might be good to have a plan to remove those watermarks eventually, as not to add to 11k backlog of images with watermarks we have at the moment. --Jarekt (talk) 12:50, 17 January 2012 (UTC)
- Exactly, maybe some actions on wikiproject in Polish Wikipedia? Yarl ✉ 15:18, 17 January 2012 (UTC)
- I've uploaded some images, see Special:Contributions/YarluFileBot. Yarl ✉ 15:18, 17 January 2012 (UTC)
- City name is present in file title. Is it possible to extract (or copy) it to add category? --EugeneZelenko (talk) 15:44, 17 January 2012 (UTC)
- Sure, I just wanted to avoid too crowded cities' main categories. So, maybe I'll add city category and Category:Images from Fotopolska needing category review. This should be better? Yarl ✉ 16:05, 17 January 2012 (UTC)
- I think the bot should add categories and get as close as it can, but it is better to add city main category than not add categories at all. Such categories often are used as "not yet categorized images from ..." anyway. However there is a bigger problem , see for example File:Wrocław - fotopolska.eu (58087).jpg. The source provides location of the image as "Polska / woj. dolnośląskie / Wrocław / Muchobór Mały / ul. Hiszpańska 12 / Hiszpańska 12" and coordinates as {{location dec|51.10798485657522|16.961064040660858}} while the only description copied to Commons was "Dom nr 12" ("House #12"). The bot needs to be able to capture the whole description and coordinates. --Jarekt (talk) 18:57, 17 January 2012 (UTC)
- I don't have full access to database, only output from script made by Fotopolska administrator. However I'll try to talk to him and fill gaps in description. Yarl ✉ 19:14, 17 January 2012 (UTC)
- You might be able to scrape them from the HTML of the page. For example closer inspection of the source code of the source of File:Lądek-Zdrój - fotopolska.eu (57918).jpg reveals that there are coordinates in the code : "... window.open('/Mapa.php?lat=(50.34610399248642, 16.871690154075622)&zoom=17&maptype=hybrid' ..." which can be copied to {{object location dec|50.34610399248642|16.871690154075622}} and added to the image. The addresses of the buildings can be harvested in a similar way and possibly added to {{Building address}} and added to image description. Hopefully that can be done with the blessing of the site administrators. --Jarekt (talk) 19:28, 17 January 2012 (UTC)
- Any updates on this? Also, please apply the {{Watermark}} template to the files you have already uploaded. --99of9 (talk) 13:33, 23 May 2012 (UTC)
- It's temporary suspended because of tough contact with fotopolska admin. Yarl ✉ 18:00, 31 May 2012 (UTC)
- After discussing this with Yarl, this request is being archived for the time being, and the request can be reactivated in the future when required. russavia (talk) 13:40, 25 September 2012 (UTC)
- I finally found some time to write custom software for this import, instead of fotopolska API and upload.py. I used web scraping, so there are all imformation in file desc. Please take a look at new uploads. Yarl ✉ 21:13, 10 October 2012 (UTC)
- I checked new uploads and they look quite good. Few small things: add "other_fields_1 = " in front of {{Building address}} (see example). Current version does not work correctly for all browsers (so I was told), because HTML table format has some issues. I also added parameters "Gmina" and "Powiat" to {{Building address}}, that can be used if Country code is PL. You might be able to use them. Otherwise it looks good to me. --Jarekt (talk) 02:28, 11 October 2012 (UTC)
- Looks good Yarl, if you will incorporate suggestions by Jarekt, and do another small run, I think we can approve this almost straight away for you. russavia (talk) 10:14, 11 October 2012 (UTC)
- Done, I also fixed all desc. in previous files. I noticed, that some images, eg. this don't have watermark, but it is rather exception. Yarl ✉ 22:09, 12 October 2012 (UTC)
- I look through the new uploads and I can not think of any improvements. --Jarekt (talk) 02:53, 13 October 2012 (UTC)
- Done, I also fixed all desc. in previous files. I noticed, that some images, eg. this don't have watermark, but it is rather exception. Yarl ✉ 22:09, 12 October 2012 (UTC)
Time to approve? --Jarekt (talk) 18:45, 15 October 2012 (UTC)
- Approved russavia (talk) 00:31, 17 October 2012 (UTC)
- Thanks. Yarl ✉ 11:44, 17 October 2012 (UTC)