On this page you can find bulk downloads of Linked Open Data sets I have prepared for convenience.
GND has been cleaned from anonymous resources (invalid/deleted/forwarded identifiers like 4073025-6).
Also ' @'
sequences
(used for indicating a mechanical word order for sorting by RAK rules)
has been replaced by proper Unicode u0098
/ u009c
pair
and denoted by a new private W3C language tag de-DE-x-rak
.
Example: "Eine @verhängnisvolle Affäre"
is converted to
"<u0098>Eine <u009c>verhängnisvolle Affäre"@de-DE-x-rak
For more info see W3C language tags
20130424-gnd.nt.gz (897564362 bytes) | |
20130424-gnd.ttl.gz (584462111 bytes) |
A leading '@'
or a ' @'
sequence (used for indicating a mechanical word order)
has been replaced by proper Unicode u0098
/ u009c
pair and denoted by a new private
W3C language tag x-viaf
.
Example: "@Bàn, Kàroly"
is converted to "Bàn, Kàroly"
"Museum of Art & History @The McPherson Center"
is converted to
"<u0098>Museum of Art & History <u009c> The McPherson Center"@x-viaf
For more info see W3C language tags
German Umlauts may be broken like in the original file. This is due to missing PICA character set
conversion to Unicode UTF-8. Example: OsnabrÓck
instead of Osnabrück
Ntriples
20130417-viaf-1.nt.gz (847119146 bytes) | |
20130417-viaf-2.nt.gz (844588750 bytes) | |
20130417-viaf-3.nt.gz (844407933 bytes) | |
20130417-viaf-4.nt.gz (844444219 bytes) |
Turtle
20130417-viaf-1.ttl.gz (514252565 bytes) | |
20130417-viaf-2.ttl.gz (513894175 bytes) | |
20130417-viaf-3.ttl.gz (514651218 bytes) | |
20130417-viaf-4.ttl.gz (515120687 bytes) |
Some statistical notes about conversion (on a desktop PC Fujitsu Esprimo P-400):
GND Turtle: 9799875 resource identifiers, 35 minutes, 3 seconds and 597 milliseconds, 4622421855 = 4,30 GB. 2,096 MB/s
GND Ntriples: 101494227 resource identifiers, 40 minutes, 58 seconds and 990 milliseconds, 12046053787 = 11,22 GB. 4,672 MB/s
VIAF Turtle: 125698959 resource identifiers, 1 hour, 7 minutes, 50 seconds and 564 milliseconds, 25819455706 = 24,05 GB. 30.879,986 rps = 6,049 MB/s
VIAF Ntriples: 127197149 resouce identifiers, 1 hour, 7 minutes, 41 seconds and 506 milliseconds, 60366730574 = 56,22 GB. 31.317,73 rps = 14,175 MB/s