-
Notifications
You must be signed in to change notification settings - Fork 572
Open
Description
I want to know the number of unique terms in my index and got: -1
Steps:
IndexCollection -collection TrecCollection -input /home/hiemstra/Data/robust04/ -index lucene-index.robust04.pos+docvectors -threads 16 -storePositions -storeDocvectors
IndexReaderUtils -stats -index lucene-index.robust04.pos+docvectors/
Results:
Index statistics
----------------
documents: 528030
documents (non-empty): 528030
unique terms: -1
total terms: 174540872
Turns out that: "Terms.size(): (...) may be unavailable (returns -1) for some Terms implementations such as MultiTerms where it cannot be efficiently computed.
I already solved this myself: I will add a pull request.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels