Redundant computation for exact nearest neighbors

When running the knnPerfTest multiple times with different quantization levels, I noticed that the true nearest neighbors are recomputed and cached to different files each time, which is not necessary. This is because the `indexPath` is used to calculate the hash key for caching the true nearest neighbors.

I think this is redundant because the true nearest neighbors should be index-agnostic.  Should we remove `indexPath` as a hash parameter ? 


https://github.com/mikemccand/luceneutil/blob/3184d644c5e270d80708640fef2ef43d0a603a99/src/main/knn/KnnGraphTester.java#L1025-L1029

	private int[][] getExactNN(Path docPath, Path indexPath, Path queryPath, int queryStartIndex) throws IOException, InterruptedException {
	// look in working directory for cached nn file
	String hash = Integer.toString(Objects.hash(docPath, indexPath, queryPath, numDocs, numQueryVectors, topK, similarityFunction.ordinal(), parentJoin, queryStartIndex, prefilter ? selectivity : 1f, prefilter ? randomSeed : 0f), 36);
	String nnFileName = "nn-" + hash + ".bin";
	Path nnPath = Paths.get(nnFileName);

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Redundant computation for exact nearest neighbors #391

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Redundant computation for exact nearest neighbors #391

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions