English subtitles for clip: File:OpenRefine Commons - upload - fill schema and preview edits.webm
Jump to navigation
Jump to search
1 00:00:04,280 --> 00:00:07,560 This is an OpenRefine project in which I am ready to start 2 00:00:07,560 --> 00:00:10,880 uploading files to Wikimedia Commons. 3 00:00:10,880 --> 00:00:17,040 The files are files like this: they are silhouette images of species. 4 00:00:17,920 --> 00:00:23,800 I have prepared all the columns that are needed to do a proper upload. 5 00:00:23,800 --> 00:00:26,480 My next step is going to be to build my schema. 6 00:00:26,480 --> 00:00:28,640 So I jump to the schema. 7 00:00:28,640 --> 00:00:33,320 I can do that via the "Edit Wikibase schema..." function 8 00:00:33,320 --> 00:00:35,840 here in the Wikibase menu, 9 00:00:35,840 --> 00:00:38,120 or by clicking here. 10 00:00:38,120 --> 00:00:43,440 Then, I am arriving at the schema screen in OpenRefine. 11 00:00:43,440 --> 00:00:46,480 As I will be doing uploads to Wikimedia Commons, 12 00:00:46,480 --> 00:00:51,920 I need to make sure I am using Wikimedia Commons as a Wikibase instance. 13 00:00:51,920 --> 00:00:54,960 So I switch to that one. 14 00:00:54,960 --> 00:00:57,400 Then I can click on the "Add media" button, 15 00:00:57,400 --> 00:00:59,920 and some pre-filled fields will appear. 16 00:01:01,160 --> 00:01:04,360 If you are familiar with editing Wikidata, you will be a bit surprised... 17 00:01:04,360 --> 00:01:09,240 ... because there are a few more fields to fill in. 18 00:01:09,240 --> 00:01:13,560 So I have the main entity here: the main thing that I want to work on. 19 00:01:13,560 --> 00:01:14,960 The main entity. 20 00:01:14,960 --> 00:01:18,200 And here I drag my reconciled file name. 21 00:01:18,200 --> 00:01:21,360 So, in the previous step, I have made sure 22 00:01:21,360 --> 00:01:23,940 that my file name on Wikimedia Commons has been reconciled,... 23 00:01:23,940 --> 00:01:28,200 ... and has been indicated as "OpenRefine should create new items". 24 00:01:28,200 --> 00:01:30,000 So I drag that one. 25 00:01:30,000 --> 00:01:33,080 That indicates that all the information below this 26 00:01:33,080 --> 00:01:37,560 will apply to that specific new file that will be created. 27 00:01:37,560 --> 00:01:42,720 Then, I have three fields that I have to drag columns towards. 28 00:01:42,720 --> 00:01:44,440 The file path here. 29 00:01:44,440 --> 00:01:49,480 I will drag either the path from my hard drive - the column that indicates that,... 30 00:01:49,480 --> 00:01:54,400 ...or the path on the web where the media file is living. 31 00:01:54,400 --> 00:01:58,960 In my case, that's the "vector-href" column. 32 00:01:58,960 --> 00:02:01,440 Going back to my project, it is this column,... 33 00:02:01,440 --> 00:02:04,200 ... and if I click it, this is where the actual images 34 00:02:04,200 --> 00:02:05,240 live on the internet. 35 00:02:05,240 --> 00:02:09,560 So these are the images that will be uploaded. It's vector images. 36 00:02:09,560 --> 00:02:14,560 Going back to my schema, I have to drag again - the file name here. 37 00:02:14,560 --> 00:02:19,800 So this field will tell Wikimedia Commons which name the file should get. 38 00:02:19,800 --> 00:02:22,240 So I drag that one here as well. 39 00:02:22,240 --> 00:02:26,440 And then I should also always provide my column with wikitext, 40 00:02:26,440 --> 00:02:33,800 which I also, here, prepared in my project. 41 00:02:33,800 --> 00:02:38,720 Next, I can add captions to my file. That's always best practice. 42 00:02:38,720 --> 00:02:41,520 I have prepared one column with English captions,... 43 00:02:41,520 --> 00:02:45,120 ...and I have to indicate I will use the language English. 44 00:02:45,120 --> 00:02:49,200 Of course, I can add multiple if I want. 45 00:02:49,200 --> 00:02:53,400 And next I can also add all the statements that apply to my file. 46 00:02:53,400 --> 00:02:57,560 So, I will add various statements here, like: 47 00:02:57,560 --> 00:02:59,240 Creator, 48 00:02:59,240 --> 00:03:01,920 The time when the file was created, 49 00:03:01,920 --> 00:03:02,840 The source, 50 00:03:02,840 --> 00:03:05,000 And the copyright license. 51 00:03:05,000 --> 00:03:24,960 The basic structured data that every file needs on Wikimedia Commons. 52 00:03:24,960 --> 00:03:28,320 I have filled in my entire schema. 53 00:03:28,320 --> 00:03:32,280 So, you can see that I have all the default statements 54 00:03:32,280 --> 00:03:35,520 that need to be present in the structured data of a file. 55 00:03:35,520 --> 00:03:39,680 The Creator statement, Inception, Source of the file, 56 00:03:39,680 --> 00:03:43,440 the Copyright and License statements. 57 00:03:43,440 --> 00:03:46,640 And I also added Depicts statements, 58 00:03:46,640 --> 00:03:51,840 with the taxa that are being depicted in the files. 59 00:03:51,840 --> 00:03:56,260 I can then also use the "Preview" tab in my project... 60 00:03:56,260 --> 00:03:58,800 ... to see what my edits will look like. 61 00:03:58,800 --> 00:04:03,840 So you see that all this information is displaying here. 62 00:04:03,840 --> 00:04:06,160 And I can doublecheck what that looks like. 63 00:04:06,160 --> 00:04:10,240 It is always a good idea to just do a first small upload 64 00:04:10,240 --> 00:04:11,280 with just a few files,... 65 00:04:11,280 --> 00:04:13,740 ... to check them on Wikimedia Commons as well... 66 00:04:13,740 --> 00:04:18,920 ... before you start doing a really big upload.