@@ -9,7 +9,7 @@ some with additional variants. For a list of supported languages, see the
9
9
<<analysis-stemmer-tokenfilter-language-parm,`language`>> parameter.
10
10
11
11
When not customized, the filter uses the
12
- http ://snowball.tartarus .org/algorithms/porter/stemmer.html[porter stemming
12
+ https ://snowballstem .org/algorithms/porter/stemmer.html[porter stemming
13
13
algorithm] for English.
14
14
15
15
[[analysis-stemmer-tokenfilter-analyze-ex]]
@@ -112,17 +112,17 @@ Language-dependent stemming algorithm used to stem tokens. If both this and the
112
112
.Valid values for `language`
113
113
====
114
114
Valid values are sorted by language. Defaults to
115
- http ://snowball.tartarus .org/algorithms/porter/stemmer.html[*`english`*].
115
+ https ://snowballstem .org/algorithms/porter/stemmer.html[*`english`*].
116
116
Recommended algorithms are *bolded*.
117
117
118
118
Arabic::
119
119
{lucene-analysis-docs}/ar/ArabicStemmer.html[*`arabic`*]
120
120
121
121
Armenian::
122
- http ://snowball.tartarus .org/algorithms/armenian/stemmer.html[*`armenian`*]
122
+ https ://snowballstem .org/algorithms/armenian/stemmer.html[*`armenian`*]
123
123
124
124
Basque::
125
- http ://snowball.tartarus .org/algorithms/basque/stemmer.html[*`basque`*]
125
+ https ://snowballstem .org/algorithms/basque/stemmer.html[*`basque`*]
126
126
127
127
Bengali::
128
128
https://www.tandfonline.com/doi/abs/10.1080/02564602.1993.11437284[*`bengali`*]
@@ -134,36 +134,36 @@ Bulgarian::
134
134
http://members.unine.ch/jacques.savoy/Papers/BUIR.pdf[*`bulgarian`*]
135
135
136
136
Catalan::
137
- http ://snowball.tartarus .org/algorithms/catalan/stemmer.html[*`catalan`*]
137
+ https ://snowballstem .org/algorithms/catalan/stemmer.html[*`catalan`*]
138
138
139
139
Czech::
140
140
https://dl.acm.org/doi/10.1016/j.ipm.2009.06.001[*`czech`*]
141
141
142
142
Danish::
143
- http ://snowball.tartarus .org/algorithms/danish/stemmer.html[*`danish`*]
143
+ https ://snowballstem .org/algorithms/danish/stemmer.html[*`danish`*]
144
144
145
145
Dutch::
146
- http ://snowball.tartarus .org/algorithms/dutch/stemmer.html[*`dutch`*],
147
- http ://snowball.tartarus .org/algorithms/kraaij_pohlmann/stemmer.html[`dutch_kp`]
146
+ https ://snowballstem .org/algorithms/dutch/stemmer.html[*`dutch`*],
147
+ https ://snowballstem .org/algorithms/kraaij_pohlmann/stemmer.html[`dutch_kp`]
148
148
149
149
English::
150
- http ://snowball.tartarus .org/algorithms/porter/stemmer.html[*`english`*],
150
+ https ://snowballstem .org/algorithms/porter/stemmer.html[*`english`*],
151
151
https://ciir.cs.umass.edu/pubfiles/ir-35.pdf[`light_english`],
152
- http ://snowball.tartarus .org/algorithms/lovins/stemmer.html[`lovins`],
152
+ https ://snowballstem .org/algorithms/lovins/stemmer.html[`lovins`],
153
153
https://www.researchgate.net/publication/220433848_How_effective_is_suffixing[`minimal_english`],
154
- http ://snowball.tartarus .org/algorithms/english/stemmer.html[`porter2`],
154
+ https ://snowballstem .org/algorithms/english/stemmer.html[`porter2`],
155
155
{lucene-analysis-docs}/en/EnglishPossessiveFilter.html[`possessive_english`]
156
156
157
157
Estonian::
158
158
https://lucene.apache.org/core/{lucene_version_path}/analyzers-common/org/tartarus/snowball/ext/EstonianStemmer.html[*`estonian`*]
159
159
160
160
Finnish::
161
- http ://snowball.tartarus .org/algorithms/finnish/stemmer.html[*`finnish`*],
161
+ https ://snowballstem .org/algorithms/finnish/stemmer.html[*`finnish`*],
162
162
http://clef.isti.cnr.it/2003/WN_web/22.pdf[`light_finnish`]
163
163
164
164
French::
165
165
https://dl.acm.org/citation.cfm?id=1141523[*`light_french`*],
166
- http ://snowball.tartarus .org/algorithms/french/stemmer.html[`french`],
166
+ https ://snowballstem .org/algorithms/french/stemmer.html[`french`],
167
167
https://dl.acm.org/citation.cfm?id=318984[`minimal_french`]
168
168
169
169
Galician::
@@ -172,8 +172,8 @@ http://bvg.udc.es/recursos_lingua/stemming.jsp[`minimal_galician`] (Plural step
172
172
173
173
German::
174
174
https://dl.acm.org/citation.cfm?id=1141523[*`light_german`*],
175
- http ://snowball.tartarus .org/algorithms/german/stemmer.html[`german`],
176
- http ://snowball.tartarus .org/algorithms/german2/stemmer.html[`german2`],
175
+ https ://snowballstem .org/algorithms/german/stemmer.html[`german`],
176
+ https ://snowballstem .org/algorithms/german2/stemmer.html[`german2`],
177
177
http://members.unine.ch/jacques.savoy/clef/morpho.pdf[`minimal_german`]
178
178
179
179
Greek::
@@ -183,18 +183,18 @@ Hindi::
183
183
http://computing.open.ac.uk/Sites/EACLSouthAsia/Papers/p6-Ramanathan.pdf[*`hindi`*]
184
184
185
185
Hungarian::
186
- http ://snowball.tartarus .org/algorithms/hungarian/stemmer.html[*`hungarian`*],
186
+ https ://snowballstem .org/algorithms/hungarian/stemmer.html[*`hungarian`*],
187
187
https://dl.acm.org/citation.cfm?id=1141523&dl=ACM&coll=DL&CFID=179095584&CFTOKEN=80067181[`light_hungarian`]
188
188
189
189
Indonesian::
190
190
http://www.illc.uva.nl/Publications/ResearchReports/MoL-2003-02.text.pdf[*`indonesian`*]
191
191
192
192
Irish::
193
- http ://snowball.tartarus. org/otherapps/oregan/intro.html [*`irish`*]
193
+ https ://snowballstem. org/otherapps/oregan/[*`irish`*]
194
194
195
195
Italian::
196
196
https://www.ercim.eu/publication/ws-proceedings/CLEF2/savoy.pdf[*`light_italian`*],
197
- http ://snowball.tartarus .org/algorithms/italian/stemmer.html[`italian`]
197
+ https ://snowballstem .org/algorithms/italian/stemmer.html[`italian`]
198
198
199
199
Kurdish (Sorani)::
200
200
{lucene-analysis-docs}/ckb/SoraniStemmer.html[*`sorani`*]
@@ -206,7 +206,7 @@ Lithuanian::
206
206
https://svn.apache.org/viewvc/lucene/dev/branches/lucene_solr_5_3/lucene/analysis/common/src/java/org/apache/lucene/analysis/lt/stem_ISO_8859_1.sbl?view=markup[*`lithuanian`*]
207
207
208
208
Norwegian (Bokmål)::
209
- http ://snowball.tartarus .org/algorithms/norwegian/stemmer.html[*`norwegian`*],
209
+ https ://snowballstem .org/algorithms/norwegian/stemmer.html[*`norwegian`*],
210
210
{lucene-analysis-docs}/no/NorwegianLightStemmer.html[*`light_norwegian`*],
211
211
{lucene-analysis-docs}/no/NorwegianMinimalStemmer.html[`minimal_norwegian`]
212
212
@@ -217,26 +217,26 @@ Norwegian (Nynorsk)::
217
217
Portuguese::
218
218
https://dl.acm.org/citation.cfm?id=1141523&dl=ACM&coll=DL&CFID=179095584&CFTOKEN=80067181[*`light_portuguese`*],
219
219
pass:macros[http://www.inf.ufrgs.br/~buriol/papers/Orengo_CLEF07.pdf[`minimal_portuguese`\]],
220
- http ://snowball.tartarus .org/algorithms/portuguese/stemmer.html[`portuguese`],
220
+ https ://snowballstem .org/algorithms/portuguese/stemmer.html[`portuguese`],
221
221
https://www.inf.ufrgs.br/\~viviane/rslp/index.htm[`portuguese_rslp`]
222
222
223
223
Romanian::
224
- http ://snowball.tartarus .org/algorithms/romanian/stemmer.html[*`romanian`*]
224
+ https ://snowballstem .org/algorithms/romanian/stemmer.html[*`romanian`*]
225
225
226
226
Russian::
227
- http ://snowball.tartarus .org/algorithms/russian/stemmer.html[*`russian`*],
227
+ https ://snowballstem .org/algorithms/russian/stemmer.html[*`russian`*],
228
228
https://doc.rero.ch/lm.php?url=1000%2C43%2C4%2C20091209094227-CA%2FDolamic_Ljiljana_-_Indexing_and_Searching_Strategies_for_the_Russian_20091209.pdf[`light_russian`]
229
229
230
230
Spanish::
231
231
https://www.ercim.eu/publication/ws-proceedings/CLEF2/savoy.pdf[*`light_spanish`*],
232
- http ://snowball.tartarus .org/algorithms/spanish/stemmer.html[`spanish`]
232
+ https ://snowballstem .org/algorithms/spanish/stemmer.html[`spanish`]
233
233
234
234
Swedish::
235
- http ://snowball.tartarus .org/algorithms/swedish/stemmer.html[*`swedish`*],
235
+ https ://snowballstem .org/algorithms/swedish/stemmer.html[*`swedish`*],
236
236
http://clef.isti.cnr.it/2003/WN_web/22.pdf[`light_swedish`]
237
237
238
238
Turkish::
239
- http ://snowball.tartarus .org/algorithms/turkish/stemmer.html[*`turkish`*]
239
+ https ://snowballstem .org/algorithms/turkish/stemmer.html[*`turkish`*]
240
240
====
241
241
242
242
`name`::
0 commit comments