Fix multi-word search returning 0 results with JenaText by fvogel · Pull Request #1932 · NatLibFi/Skosmos

fvogel · 2026-02-12T21:22:20Z

Summary

Fixes #1930 — multi-word search queries (e.g. "Siamese cat", "Labrador retriever") return 0 results when using sparqlDialect "JenaText".

Root cause: LUCENE_ESCAPE_CHARS includes a space character, so createTextQueryCondition() escapes spaces as \ , producing a single-token query like "Siamese\ cat*". Since StandardAnalyzer tokenizes labels into individual words, no token ever contains a literal space and the query never matches.

Fix:

Remove space from LUCENE_ESCAPE_CHARS
Split multi-word terms on whitespace and prefix each word with + (Lucene "required" operator)
"Siamese cat*" becomes "+Siamese +cat*" — each word must match independently

Test plan

Single-word search still works: Siamese* → 1 result
Multi-word prefLabel search now works: Siamese cat* → 1 result (was 0)
Multi-word altLabel search now works: Domestic cat* → 1 result (was 0)
Wildcard expansion preserved: Lab* → 1 result (Labrador retriever)
Existing PHPUnit tests pass

Tested against Skosmos 3.1 with Fuseki 5.4.0 (StandardAnalyzer, default config).

Remove space from LUCENE_ESCAPE_CHARS and split multi-word queries into individual required Lucene terms using the '+' operator. Previously, spaces were escaped as '\ ', creating a single-token query like "Siamese\ cat*" that never matches any indexed term because StandardAnalyzer tokenizes labels into individual words. Now "Siamese cat*" becomes "+Siamese +cat*", requiring each word to match independently. Fixes NatLibFi#1930

sonarqubecloud · 2026-02-12T21:22:53Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix multi-word search returning 0 results with JenaText#1932

Fix multi-word search returning 0 results with JenaText#1932
fvogel wants to merge 1 commit intoNatLibFi:mainfrom
fvogel:fix/multiword-search-space-escape

fvogel commented Feb 12, 2026

Uh oh!

sonarqubecloud bot commented Feb 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

fvogel commented Feb 12, 2026

Summary

Test plan

Uh oh!

sonarqubecloud bot commented Feb 12, 2026

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant