Same corpus, the other score is 0

``` python
from rank_bm25 import BM25Okapi

corpus = [
    "Hello there good man!",
    "It is quite windy in London"
    "How is the weather today?"
]

tokenized_corpus = [doc.split(" ") for doc in corpus]

bm25 = BM25Okapi(tokenized_corpus)

query = "windy London"
tokenized_query = query.split(" ")

doc_scores = bm25.get_scores(tokenized_query)

print(doc_scores)
```
> [0.         0.93729472 0.        ]


But
``` python
from rank_bm25 import BM25Okapi

corpus = [
    "Hello there good man!",
    "It is quite windy in London",
    # "How is the weather today?"
]

tokenized_corpus = [doc.split(" ") for doc in corpus]

bm25 = BM25Okapi(tokenized_corpus)

query = "windy London"
tokenized_query = query.split(" ")

doc_scores = bm25.get_scores(tokenized_query)

print(doc_scores)
```
> [0. 0.]

The difference lies in the number of corpus elements. It should be incorrect, but I don't know why？

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Same corpus, the other score is 0 #43

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Same corpus, the other score is 0 #43

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions