File:Zipf-chin-1 Chinese texts - Red Mansion, Pentateuch, Voice of America.svg
Original file (SVG file, nominally 512 × 504 pixels, file size: 908 KB)
Captions
Summary
[edit]DescriptionZipf-chin-1 Chinese texts - Red Mansion, Pentateuch, Voice of America.svg |
English: Zipf law plot (frequency as function of frequency rank) for the words in five texts in Chinese (Mandarin) language. The texts and the word frequency files are:
In all these plots, each character (syllable, logoogram) is treated as a separate word. In the first four plots, the Chinese characters of the original text were mapped 1:1 from GB (Guo Biao) to pinyn with tone marks and disambiguating suffixes '.1', '.2' etc, so as to distinguish characters with the same pinyin -- e.g. 'zuo4', 'zuo4.1', 'zuo4.2'. In the last plot, the original file was a version transliterated by Ocrat.com, in pinyin with tone marks but without disambiguating suffixes, e.g. 'zuo4'; so that the same pinyin word may represent two or more different characters. The word frequency files '*/*/*/gud.wfr' are available at the UNICAMP website. The original annotated full texts, before truncation/filtering, are in the companion files */*/org/main.src. The truncated/filtered texts -- one word per line, without punctuation -- are in */*/*/gud.tlw. |
Date | |
Source | Own work |
Author | Jorge Stolfi |
Licensing
[edit]- You are free:
- to share – to copy, distribute and transmit the work
- to remix – to adapt the work
- Under the following conditions:
- attribution – You must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use.
- share alike – If you remix, transform, or build upon the material, you must distribute your contributions under the same or compatible license as the original.
File history
Click on a date/time to view the file as it appeared at that time.
Date/Time | Thumbnail | Dimensions | User | Comment | |
---|---|---|---|---|---|
current | 21:10, 15 May 2023 | 512 × 504 (908 KB) | Jorge Stolfi (talk | contribs) | Rebuilt the file with small changes in dataset, colors | |
18:21, 9 May 2023 | 512 × 504 (908 KB) | Jorge Stolfi (talk | contribs) | Uploaded own work with UploadWizard |
You cannot overwrite this file.
File usage on Commons
The following 2 pages use this file:
Metadata
This file contains additional information such as Exif metadata which may have been added by the digital camera, scanner, or software program used to create or digitize it. If the file has been modified from its original state, some details such as the timestamp may not fully reflect those of the original file. The timestamp is only as accurate as the clock in the camera, and it may be completely wrong.
Short title | Gnuplot |
---|---|
Image title | Produced by GNUPLOT 5.4 patchlevel 2 |
Width | 100% |
Height | 100% |