New feature: Report query latencies and index size by kwang2049 · Pull Request #5 · thakur-nandan/sprint

kwang2049 · 2023-02-11T23:11:01Z

This PR adds a new feature: The query latency details will be tracked and reported; the index size will be also reported.

Changes:

modified: examples/inference/distilsplade_max/beir_scifact/all_in_one.sh: Check out this example for the final demo report!
modified: sparse_retrieval/inference/aio.py: Add new arguments to search.run and evaluate.run;
modified: sparse_retrieval/inference/search.py:
- Add new argument output_latency to specify the output latency file.
- Add a LatencyReporter;
- Use the LatencyReporter to record each searcher.search/batch_search and report the latency details into a file called latency.tsv (under the same path of run.tsv);
  - Each line of latency.tsv is f"{qid}\t{word_length}\t{latency}\t{batch_size}\n"

modified: sparse_retrieval/inference/evaluate.py:

Add new argument latency_path and index_path as the paths to the stats source;
Add new argument bins to specify the binning query latencies wrt. how many word-length bins.

Summarize and report latency details:

  latency_info = {
    "latency": {
        "latency_avg": np.mean(latencies),
        "query_word_length_avg": np.mean(word_lengths),
        "binned": {
            "word_length_bins": word_length_bins.tolist(),
            "freqs": freqs.tolist(),
            "latencies_avg": binned_latencies_avg,
            "latencies_std": binned_latencies_std
        },
        "batch_size": np.mean(batch_sizes),
        "processor": get_processor_name()
    }
}

Report index size in MB.

modified: sparse_retrieval/inference/utils.py: Some new util functions to support the new features.

kwang2049 · 2023-02-11T23:26:03Z

This PR goes after the successor PR #4. Please first deal with #4 and then come back to this

This reverts commit f0c4600.

Now report query latencies and index size

39e9ec3

kwang2049 changed the title ~~Feature query latency and index size~~ New feature: Report query latencies and index size Feb 11, 2023

kwang2049 changed the base branch from main to BUG_ckpt_name_accepts_list_only February 11, 2023 23:11

kwang2049 requested a review from thakur-nandan February 11, 2023 23:25

kwang2049 added 6 commits February 12, 2023 00:47

correct variable name

f0c4600

Revert "correct variable name"

85d3515

This reverts commit f0c4600.

correct variable name

e0e9f21

new line at end

2ba2859

new line at end

bd219be

added overall std

12fece1

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New feature: Report query latencies and index size#5

New feature: Report query latencies and index size#5
kwang2049 wants to merge 7 commits intoBUG_ckpt_name_accepts_list_onlyfrom
FEATURE-query_latency_and_index_size

kwang2049 commented Feb 11, 2023 •

edited

Loading

Uh oh!

kwang2049 commented Feb 11, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

kwang2049 commented Feb 11, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kwang2049 commented Feb 11, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

kwang2049 commented Feb 11, 2023 •

edited

Loading