File:Similarity analysis and clustering of modules.pdf
Original file (2,133 × 148,614 pixels, file size: 429 KB, MIME type: application/pdf)
Captions
Summary[edit]
DescriptionSimilarity analysis and clustering of modules.pdf |
English: This report is part of Abstract_Wikipedia data science project (Abstract_Wikipedia/Data). The final project is hosted in abstract-wiki-ds.toolforge.org and the code is in GitHub (github.com/wikimedia/abstract-wikipedia-data-science).
Testing various clustering algorithms and analyzing their results to find a suitable match for our task (determining which modules are similar and possible candidates to be merged). Also contains a brief literature review of code similarity detection. List of possible candidates for improvement of clustering using better algorithms. |
Date | |
Source | Own work |
Author | Aisha Khatun |
Licensing[edit]
- You are free:
- to share – to copy, distribute and transmit the work
- to remix – to adapt the work
- Under the following conditions:
- attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- share alike – If you remix, transform, or build upon the material, you must distribute your contributions under the same or compatible license as the original.
File history
Click on a date/time to view the file as it appeared at that time.
Date/Time | Thumbnail | Dimensions | User | Comment | |
---|---|---|---|---|---|
current | 07:02, 8 March 2021 | 2,133 × 148,614 (429 KB) | Aisha Khatun (talk | contribs) | Uploaded own work with UploadWizard |
You cannot overwrite this file.
File usage on Commons
There are no pages that use this file.
Metadata
This file contains additional information such as Exif metadata which may have been added by the digital camera, scanner, or software program used to create or digitize it. If the file has been modified from its original state, some details such as the timestamp may not fully reflect those of the original file. The timestamp is only as accurate as the clock in the camera, and it may be completely wrong.
Software used | Chromium |
---|---|
Conversion program | Skia/PDF m88 |
Encrypted | no |
Page size | 1024.08 x 71335.9 pts |
Version of PDF format | 1.4 |