English subtitles for clip: File:OpenRefine Commons - editing - retrieve wikitext from Commons files.webm
Jump to navigation
Jump to search
1 00:00:06,880 --> 00:00:12,960 This is an OpenRefine project that is loaded from a Wikimedia Commons category. 2 00:00:12,960 --> 00:00:15,200 I am looking at a selection of images here, 3 00:00:15,200 --> 00:00:18,920 and I am interested in adding structured data to them. 4 00:00:18,920 --> 00:00:24,640 For instance: what is being depicted in the files, the photographer, etcetera. 5 00:00:24,640 --> 00:00:28,280 One way to add structured data is to actually 6 00:00:28,280 --> 00:00:32,360 take the wikitext - unstructured description from these files... 7 00:00:32,360 --> 00:00:37,240 ... and create a column with that wikitext. 8 00:00:37,240 --> 00:00:40,640 Later on, you can then extract data from that wikitext 9 00:00:40,640 --> 00:00:42,520 and convert it to structured data. 10 00:00:42,520 --> 00:00:45,000 This is a very handy thing to do. 11 00:00:45,000 --> 00:00:47,080 How do you go about that? 12 00:00:47,080 --> 00:00:49,920 You select the column with your file names. 13 00:00:49,920 --> 00:00:52,600 As you can see, the file names have been reconciled 14 00:00:52,600 --> 00:00:54,120 with Wikimedia Commons. 15 00:00:54,120 --> 00:00:57,960 So they are blue and they show a thumbnail. 16 00:00:57,960 --> 00:01:05,760 I select the column, and then I go to the function "Add columns from reconciled values...". 17 00:01:05,760 --> 00:01:09,680 I get several options of things I can retrieve about this file. 18 00:01:09,680 --> 00:01:12,000 I choose wikitext. 19 00:01:12,000 --> 00:01:16,476 Then it will load for a while, and show me a preview ... 20 00:01:16,476 --> 00:01:22,080 ... and then I click "OK", and then OpenRefine will generate 21 00:01:22,080 --> 00:01:28,960 a column for me with Wikitext.