You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: x-pack/plugin/inference/src/main/java/org/elasticsearch/xpack/inference/highlight/SemanticTextHighlighter.java
Copy file name to clipboardExpand all lines: x-pack/plugin/inference/src/test/java/org/elasticsearch/xpack/inference/highlight/SemanticTextHighlighterTests.java
+24Lines changed: 24 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -200,6 +200,30 @@ public void testNoSemanticField() throws Exception {
Copy file name to clipboardExpand all lines: x-pack/plugin/inference/src/test/resources/org/elasticsearch/xpack/inference/highlight/queries.json
+4-1Lines changed: 4 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -399,6 +399,9 @@
399
399
"After the marshland between the river Seine and its slower 'dead arm' to its north was filled in from around the 10th century, Paris's cultural centre began to move to the Right Bank. In 1137, a new city marketplace (today's Les Halles) replaced the two smaller ones on the Île de la Cité and Place de Grève (Place de l'Hôtel de Ville). The latter location housed the headquarters of Paris's river trade corporation, an organisation that later became, unofficially (although formally in later years), Paris's first municipal government.\n\n\nIn the late 12th century, Philip Augustus extended the Louvre fortress to defend the city against river invasions from the west, gave the city its first walls between 1190 and 1215, rebuilt its bridges to either side of its central island, and paved its main thoroughfares. In 1190, he transformed Paris's former cathedral school into a student-teacher corporation that would become the University of Paris and would draw students from all of Europe.\n\n\nWith 200,000 inhabitants in 1328, Paris, then already the capital of France, was the most populous city of Europe. By comparison, London in 1300 had 80,000 inhabitants. By the early fourteenth century, so much filth had collected inside urban Europe that French and Italian cities were naming streets after human waste. In medieval Paris, several street names were inspired by merde, the French word for \"shit\".\n\n\n",
400
400
"In March 2001, Bertrand Delanoë became the first socialist mayor. He was re-elected in March 2008. In 2007, in an effort to reduce car traffic, he introduced the Vélib', a system which rents bicycles. Bertrand Delanoë also transformed a section of the highway along the Left Bank of the Seine into an urban promenade and park, the Promenade des Berges de la Seine, which he inaugurated in June 2013.\n\n\nIn 2007, President Nicolas Sarkozy launched the Grand Paris project, to integrate Paris more closely with the towns in the region around it. After many modifications, the new area, named the Metropolis of Grand Paris, with a population of 6.7 million, was created on 1 January 2016. In 2011, the City of Paris and the national government approved the plans for the Grand Paris Express, totalling 205 km (127 mi) of automated metro lines to connect Paris, the innermost three departments around Paris, airports and high-speed rail (TGV) stations, at an estimated cost of €35 billion. The system is scheduled to be completed by 2030.\n\n\nIn January 2015, Al-Qaeda in the Arabian Peninsula claimed attacks across the Paris region. 1.5 million people marched in Paris in a show of solidarity against terrorism and in support of freedom of speech. In November of the same year, terrorist attacks, claimed by ISIL, killed 130 people and injured more than 350.\n\n\n",
401
401
"Bal-musette is a style of French music and dance that first became popular in Paris in the 1870s and 1880s; by 1880 Paris had some 150 dance halls. Patrons danced the bourrée to the accompaniment of the cabrette (a bellows-blown bagpipe locally called a \"musette\") and often the vielle à roue (hurdy-gurdy) in the cafés and bars of the city. Parisian and Italian musicians who played the accordion adopted the style and established themselves in Auvergnat bars, and Paris became a major centre for jazz and still attracts jazz musicians from all around the world to its clubs and cafés.\n\n\nParis is the spiritual home of gypsy jazz in particular, and many of the Parisian jazzmen who developed in the first half of the 20th century began by playing Bal-musette in the city. Django Reinhardt rose to fame in Paris, having moved to the 18th arrondissement in a caravan as a young boy, and performed with violinist Stéphane Grappelli and their Quintette du Hot Club de France in the 1930s and 1940s.\n\n\nImmediately after the War the Saint-Germain-des-Pres quarter and the nearby Saint-Michel quarter became home to many small jazz clubs, including the Caveau des Lorientais, the Club Saint-Germain, the Rose Rouge, the Vieux-Colombier, and the most famous, Le Tabou. They introduced Parisians to the music of Claude Luter, Boris Vian, Sydney Bechet, Mezz Mezzrow, and Henri Salvador. "
402
+
],
403
+
"expected_with_similarity_threshold": [
404
+
"\nParis (.mw-parser-output .IPA-label-small{font-size:85%}.mw-parser-output .references .IPA-label-small,.mw-parser-output .infobox .IPA-label-small,.mw-parser-output .navbox .IPA-label-small{font-size:100%}French pronunciation: ⓘ) is the capital and largest city of France. With an estimated population of 2,102,650 residents in January 2023 in an area of more than 105 km2 (41 sq mi), Paris is the fourth-largest city in the European Union and the 30th most densely populated city in the world in 2022. Since the 17th century, Paris has been one of the world's major centres of finance, diplomacy, commerce, culture, fashion, and gastronomy. Because of its leading role in the arts and sciences and its early adaptation of extensive street lighting, it became known as the City of Light in the 19th century.\n\n\nThe City of Paris is the centre of the Île-de-France region, or Paris Region, with an official estimated population of 12,271,794 inhabitants in January 2023, or about 19% of the population of France. The Paris Region had a nominal GDP of €765 billion (US$1.064 trillion when adjusted for PPP) in 2021, the highest in the European Union. According to the Economist Intelligence Unit Worldwide Cost of Living Survey, in 2022, Paris was the city with the ninth-highest cost of living in the world.\n\n\n"
402
405
]
403
406
},
404
407
"sparse_vector_1": {
@@ -464,4 +467,4 @@
464
467
"Diderot and D'Alembert published their Encyclopédie in 1751, before the Montgolfier Brothers launched the first manned flight in a hot air balloon on 21 November 1783. Paris was the financial capital of continental Europe, as well the primary European centre for book publishing, fashion and the manufacture of fine furniture and luxury goods. On 22 October 1797, Paris was also the site of the first parachute jump in history, by Garnerin.\n\n\nIn the summer of 1789, Paris became the centre stage of the French Revolution. On 14 July, a mob seized the arsenal at the Invalides, acquiring thousands of guns, with which it stormed the Bastille, a principal symbol of royal authority. The first independent Paris Commune, or city council, met in the Hôtel de Ville and elected a Mayor, the astronomer Jean Sylvain Bailly, on 15 July.\n\n\nLouis XVI and the royal family were brought to Paris and incarcerated in the Tuileries Palace. In 1793, as the revolution turned increasingly radical, the king, queen and mayor were beheaded by guillotine in the Reign of Terror, along with more than 16,000 others throughout France. The property of the aristocracy and the church was nationalised, and the city's churches were closed, sold or demolished. A succession of revolutionary factions ruled Paris until 9 November 1799 (coup d'état du 18 brumaire), when Napoleon Bonaparte seized power as First Consul.\n\n\n"
Copy file name to clipboardExpand all lines: x-pack/plugin/inference/src/yamlRestTest/resources/rest-api-spec/test/inference/90_semantic_text_highlighter.yml
+69-1Lines changed: 69 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -98,7 +98,6 @@ setup:
98
98
title: "Elasticsearch"
99
99
body: [ "ElasticSearch is an open source, distributed, RESTful, search engine which is built on top of Lucene internally and enjoys all the features it provides.", "You Know, for Search!" ]
- match: { hits.hits.0.highlight.bbq_hnsw_field.0: "ElasticSearch is an open source, distributed, RESTful, search engine which is built on top of Lucene internally and enjoys all the features it provides." }
reason: semantic highlighter fix for knn with similarity
678
+
679
+
- do:
680
+
index:
681
+
index: test-dense-index
682
+
id: doc_1
683
+
body:
684
+
body: [ "ElasticSearch is an open source, distributed, RESTful, search engine which is built on top of Lucene internally and enjoys all the features it provides.", "You Know, for Search!", "For a moment, nothing happened. Then, after a second or so, nothing continued to happen." ]
685
+
- do:
686
+
index:
687
+
index: test-dense-index
688
+
id: doc_2
689
+
body:
690
+
body: [ "Nothing travels faster than the speed of light with the possible exception of bad news, which obeys its own special laws."]
691
+
refresh: true
692
+
693
+
- do:
694
+
search:
695
+
index: test-dense-index
696
+
body:
697
+
query:
698
+
match_all: { }
699
+
highlight:
700
+
fields:
701
+
body:
702
+
type: "semantic"
703
+
number_of_fragments: 1
704
+
705
+
- match: { hits.total.value: 2 }
706
+
707
+
- match: { hits.hits.0._id: "doc_1" }
708
+
- length: { hits.hits.0.highlight: 1 }
709
+
- length: { hits.hits.0.highlight.body: 1 }
710
+
- match: { hits.hits.0.highlight.body.0: "ElasticSearch is an open source, distributed, RESTful, search engine which is built on top of Lucene internally and enjoys all the features it provides." }
711
+
712
+
- match: { hits.hits.1._id: "doc_2" }
713
+
- length: { hits.hits.1.highlight: 1 }
714
+
- length: { hits.hits.1.highlight.body: 1 }
715
+
- match: { hits.hits.1.highlight.body.0: "Nothing travels faster than the speed of light with the possible exception of bad news, which obeys its own special laws." }
716
+
717
+
- do:
718
+
search:
719
+
index: test-dense-index
720
+
body:
721
+
query:
722
+
knn:
723
+
field: "body"
724
+
query_vector_builder:
725
+
text_embedding:
726
+
model_text: "What is Elasticsearch?"
727
+
k: 10
728
+
num_candidates: 10
729
+
similarity: 0.9977
730
+
highlight:
731
+
fields:
732
+
body:
733
+
type: "semantic"
734
+
number_of_fragments: 3
735
+
736
+
- match: { hits.total.value: 1 }
737
+
- match: { hits.hits.0._id: "doc_1" }
738
+
- length: { hits.hits.0.highlight.body: 3 }
739
+
- match: { hits.hits.0.highlight.body.0: "ElasticSearch is an open source, distributed, RESTful, search engine which is built on top of Lucene internally and enjoys all the features it provides." }
740
+
- match: { hits.hits.0.highlight.body.1: "You Know, for Search!" }
741
+
- match: { hits.hits.0.highlight.body.2: "For a moment, nothing happened. Then, after a second or so, nothing continued to happen."}
Copy file name to clipboardExpand all lines: x-pack/plugin/inference/src/yamlRestTest/resources/rest-api-spec/test/inference/90_semantic_text_highlighter_bwc.yml
- match: { hits.hits.0.highlight.bbq_hnsw_field.0: "ElasticSearch is an open source, distributed, RESTful, search engine which is built on top of Lucene internally and enjoys all the features it provides." }
reason: semantic highlighter fix for knn with similarity
657
+
658
+
- do:
659
+
index:
660
+
index: test-dense-index
661
+
id: doc_1
662
+
body:
663
+
body: [ "ElasticSearch is an open source, distributed, RESTful, search engine which is built on top of Lucene internally and enjoys all the features it provides.", "You Know, for Search!", "For a moment, nothing happened. Then, after a second or so, nothing continued to happen." ]
664
+
- do:
665
+
index:
666
+
index: test-dense-index
667
+
id: doc_2
668
+
body:
669
+
body: [ "Nothing travels faster than the speed of light with the possible exception of bad news, which obeys its own special laws."]
670
+
refresh: true
671
+
672
+
- do:
673
+
search:
674
+
index: test-dense-index
675
+
body:
676
+
query:
677
+
match_all: { }
678
+
highlight:
679
+
fields:
680
+
body:
681
+
type: "semantic"
682
+
number_of_fragments: 1
683
+
684
+
- match: { hits.total.value: 2 }
685
+
686
+
- match: { hits.hits.0._id: "doc_1" }
687
+
- length: { hits.hits.0.highlight: 1 }
688
+
- length: { hits.hits.0.highlight.body: 1 }
689
+
- match: { hits.hits.0.highlight.body.0: "ElasticSearch is an open source, distributed, RESTful, search engine which is built on top of Lucene internally and enjoys all the features it provides." }
652
690
691
+
- match: { hits.hits.1._id: "doc_2" }
692
+
- length: { hits.hits.1.highlight: 1 }
693
+
- length: { hits.hits.1.highlight.body: 1 }
694
+
- match: { hits.hits.1.highlight.body.0: "Nothing travels faster than the speed of light with the possible exception of bad news, which obeys its own special laws." }
653
695
696
+
- do:
697
+
search:
698
+
index: test-dense-index
699
+
body:
700
+
query:
701
+
knn:
702
+
field: "body"
703
+
query_vector_builder:
704
+
text_embedding:
705
+
model_text: "What is Elasticsearch?"
706
+
k: 10
707
+
num_candidates: 10
708
+
similarity: 0.9977
709
+
highlight:
710
+
fields:
711
+
body:
712
+
type: "semantic"
713
+
number_of_fragments: 3
714
+
715
+
- match: { hits.total.value: 1 }
716
+
- match: { hits.hits.0._id: "doc_1" }
717
+
- length: { hits.hits.0.highlight.body: 3 }
718
+
- match: { hits.hits.0.highlight.body.0: "ElasticSearch is an open source, distributed, RESTful, search engine which is built on top of Lucene internally and enjoys all the features it provides." }
719
+
- match: { hits.hits.0.highlight.body.1: "You Know, for Search!" }
720
+
- match: { hits.hits.0.highlight.body.2: "For a moment, nothing happened. Then, after a second or so, nothing continued to happen."}
0 commit comments