Package org.eclipse.rdf4j.rio.hdt
Parser/writer for the HDT v1.0 format.
Unfortunately the draft specification is not entirely clear and probably slightly out of date, since the open source
reference implementation HDT-It seems to implement a slightly different version. This parser tries to be compatible
with HDT-It 1.0.
File structure:
+---------------------+ | Global | | Header | | Dictionary (Shared) | | Dictionary (S) | | Dictionary (P) | | Dictionary (O) | | Triples | +---------------------+
General structure for Global, Header, Dictionary and Triples
These part all starts with$HDT
, followed by a byte indicating the type of the part, the format, and
optionally one or more key=value;
properties.
Then a NULL
byte, followed by the 16-bit CRC ($HDT
and NULL
included)
+------+------+--------+------+------------+------+-------+ | $HDT | type | format | NULL | key=value; | NULL | CRC16 | +------+------+--------+------+------------+------+-------+The
format
varies slightly: depending on the section, it can either be a string or a URI.-
Class Summary Class Description CRC16 CRC16-ANSI checksumCRC32 CRC32C checksumCRC8 CRC8-CCIT checksumHDTArray HDT ArrayHDTArrayFactory HDT Array factory.HDTArrayLog64 Log64HDTBitmap HDTDictionary HDT Dictionary Part.HDTDictionarySection HDT DictionarySection part.HDTDictionarySectionFactory HDT DictionarySection factory.HDTDictionarySectionPFC HDT DictionarySection Plain Front Coding.HDTGlobal HDT Global Part.HDTHeader HDT Header Part.HDTParser RDF parser for HDT v1.0 files.HDTParserFactory AnRDFParserFactory
for HDT parsers.HDTPart Helper class for different HDT parts.HDTTriples HDT Triples Part.HDTTriplesSection HDT TriplesSection part.HDTTriplesSectionBitmap HDT Triples section.HDTTriplesSectionFactory HDT Array factory.VByte Variable byte encoding for numbers. -
Enum Summary Enum Description HDTArray.Type HDTDictionarySection.Type HDTPart.Type HDTTriples.Order