-
Notifications
You must be signed in to change notification settings - Fork 66
Open
Description
I'm trying to merge two HDT files using hdtCat.sh. Each file has more than 13 billion triples:
- file 1 has
13736601325triples - file 2 has
13827925785triples
After about 25 hours I get this error:
Exception in thread “main” java.lang.ArrayIndexOutOfBoundsException: Index -4 out of bounds for length 29
at org.rdfhdt.hdt.util.disk.LongArrayDisk.get(LongArrayDisk.java:116)
at org.rdfhdt.hdt.dictionary.impl.utilCat.CatMappingBack.set(CatMappingBack.java:77)
at org.rdfhdt.hdt.dictionary.impl.FourSectionDictionaryCat.cat(FourSectionDictionaryCat.java:244)
at org.rdfhdt.hdt.hdt.impl.HDTImpl.cat(HDTImpl.java:486)
at org.rdfhdt.hdt.hdt.HDTManagerImpl.doHDTCat(HDTManagerImpl.java:329)
at org.rdfhdt.hdt.hdt.HDTManager.catHDT(HDTManager.java:642)
at org.rdfhdt.hdt.tools.HDTCat.cat(HDTCat.java:82)
at org.rdfhdt.hdt.tools.HDTCat.execute(HDTCat.java:116)
at org.rdfhdt.hdt.tools.HDTCat.main(HDTCat.java:184)
I tried both v3.0.10 and v3.0.9 with the same result. I can provide these files, but each is about 170 GB. I haven't run into this issue with any smaller files.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels