Commons:Batch uploading/Ryhiner Collection

From Wikimedia Commons, the free media repository
Jump to navigation Jump to search

Maps from Ryhiner Collection[edit]

Available from www.stub.unibe.ch/stub/ryhiner/ I´ve dealing with this collection for time (see this file for a example). This collection consists in "over 16000 high resolution images: maps, town plans and topographical views from the 16th to the early 19th century". So, if this declaration can be taken in face value, there is no problem with copyright because this maps are already in Public domain and being a 2D works their digital copies are also in PD. So if this statements are correct all their collection could uploaded by a bot to commons. Their maps are avaible in high resolution using zoomify (see the exemple map in their site). Tm (talk) 13:20, 22 April 2009 (UTC)[reply]

Opinions[edit]

Looks like a great collection. Is it possible to access the source files? Did you try contacting them? Multichill (talk) 14:03, 7 June 2009 (UTC)[reply]

Sorry for the delayed answer. To aswer your first question, i don´t know if it´s possible to have online acess to their source files, and i am not very techie savy. Also i didn´t try to contact them. What is your opinion of what are the next steps to take? Tm (talk) 01:25, 15 June 2009 (UTC)[reply]

I´ve sent today an email asking for their permission to make this batch upload. I thought that asking now if their source files are avaible online in this stage would be too soon. Tm (talk) 15:10, 2 July 2009 (UTC)[reply]
Sorry about not responding sooner, looks like i forgot to watchlist this page. We're in the non tech phase. Try to contact them, see if they like it. If that turns out alright we can start the actual data retrieval and uploading part. Writing a general story about this is still on my list. I'll see if I can make a first version. Multichill (talk) 16:59, 2 July 2009 (UTC)[reply]

Just a quick update to tell that i received a automatic answer about the absence of the person contacted by my email, and i forward it to a email i received in the answer. When and if i receive a answer i´ll update this page. Tm (talk) 00:48, 3 July 2009 (UTC)[reply]

I received a aswer, and already replied to it, but i am waiting permission to republish the email or the contents of the aswer that i received. Tm (talk) 04:10, 12 July 2009 (UTC)[reply]

You can always use OTRS if you want to keep it private. Multichill (talk) 10:56, 12 July 2009 (UTC)[reply]

The question isn’t exactly about privacy, but more about building trust between the parts, after the NPG case (I fully support Dcoetzee)‎‎, with might have been heard by this people and gave them a bad impression of Wikimedia Commons and its users. I can tell, without breaking the secrecy correspondence, that the answer that I received was slightly positive to the possibility of cooperation, but the person that answered made some questions, doubts and remarks that need to be addressed, about this possible cooperation, (I gave my opinion), but requested that its answer be publish so that more people can give their input. Despite this I received an automatic answer to my second email telling that I might not receive a second email until 10 of August. Tm (talk) 07:39, 19 July 2009 (UTC)[reply]

Any update on this one? Multichill (talk) 23:13, 4 September 2009 (UTC)[reply]

Not much. I´ve received a email on 11 of August telling, that do to the holidays of the person that i´ve send the mail, the answer would be delayed but i´ve not received nothing subsequently, until now. Tm (talk) 23:43, 4 September 2009 (UTC)[reply]

I have send an email today. as i´ve only received a email on 15 of September telling me that the person i contacted had contacted the library but was still waiting an answer. In this email i asked if there is already an answer. When i receive a answer i´ll update this page. Tm (talk) 04:05, 21 November 2009 (UTC)[reply]
I´ve received an email, some days ago, from the same person that i´am contacting from the beginning, saying that still there isnt any aswer, from the library responsible for this collection, about the enquerie i made some months ago. Comments? Tm (talk) 13:11, 6 December 2009 (UTC)[reply]
I have to report that the library that keeps this collection, unfortunetly, decided to reject the request made some months ago as, according to the person i exchanged emails, this request "lacks a formal application and there is no treatment needed because the maps are already available online for the public." Tm (talk) 23:34, 14 January 2010 (UTC)[reply]
Ok. Looks like we're going to scrape their site after all. I'll have a look at it. Multichill (talk) 23:51, 14 January 2010 (UTC)[reply]
These images are easily scrapable through a bit of regex and looping. The various galleries are listed here Where each gallery has about 40 images of the same subject, different periods probably. Next to each gallery the name of the place is listed, where the category could just be like Category:Scotland maps or the like. We've done uploads through the Zoomify upload before so the experience is there.--Diaa abdelmoneim (talk) 10:07, 17 October 2009 (UTC)[reply]
I had a look at {{PD-art}}. Seems to work in Switzerland so no NPG issues ;-)
The plan:
  • Loop over the galleries at http://www.zb.unibe.ch/maps/ryhiner/sammlung/?group=volume (does that contain all maps?)
  • Loop over all images in a gallery
  • For each image pull the metadata. Several sources. Have to see what information is useful
  • Pull the image with some dezoomify tool
  • Generate filename, description and categories
  • Upload to Commons
What metadata to use exactly is somewhat tricky. Also the dezoomify if a bit of extra work. Multichill (talk) 15:43, 15 January 2010 (UTC)[reply]
Multichill, might I volunteer my dezoomify.py script, which will take in a web page holding a zoomify Flash object, regex for the location of the image tiles automatically and download and recompose the highest zoom level available. Have a look at: this page, which has a full code listing. Example of its work can be seen here. I hope it's useful. Inductiveload (talk) 02:58, 19 February 2010 (UTC)[reply]
Sure. Looks nice at first glance, but you should split it up in functions and use objects so it can be used in other programs (like pywikipedia). Probably best to make a lib part and a commandline part (which uses the lib part). What license is you code? Do you need some help restructuring it? Did you take a look at this script when you wrote your code? Multichill (talk) 09:19, 19 February 2010 (UTC)[reply]

Any progress? -- RE rillke questions? 18:25, 4 June 2012 (UTC)[reply]

Assigned to Progress Bot name