|
38 | 38 | catchall "text" field, and use that for searching. |
39 | 39 | --> |
40 | 40 |
|
41 | | -<schema name="default-config" version="1.6"> |
| 41 | +<schema name="default-config" version="1.7"> |
42 | 42 | <!-- attribute "name" is the name of this schema and is only used for display purposes. |
43 | | - version="x.y" is Solr's version number for the schema syntax and |
| 43 | + version="x.y" is Solr's version number for the schema syntax and |
44 | 44 | semantics. It should not normally be changed by applications. |
45 | 45 |
|
46 | | - 1.0: multiValued attribute did not exist, all fields are multiValued |
| 46 | + 1.0: multiValued attribute did not exist, all fields are multiValued |
47 | 47 | by nature |
48 | | - 1.1: multiValued attribute introduced, false by default |
49 | | - 1.2: omitTermFreqAndPositions attribute introduced, true by default |
| 48 | + 1.1: multiValued attribute introduced, false by default |
| 49 | + 1.2: omitTermFreqAndPositions attribute introduced, true by default |
50 | 50 | except for text fields. |
51 | 51 | 1.3: removed optional field compress feature |
52 | 52 | 1.4: autoGeneratePhraseQueries attribute introduced to drive QueryParser |
53 | | - behavior when a single string produces multiple tokens. Defaults |
| 53 | + behavior when a single string produces multiple tokens. Defaults |
54 | 54 | to off for version >= 1.4 |
55 | | - 1.5: omitNorms defaults to true for primitive field types |
| 55 | + 1.5: omitNorms defaults to true for primitive field types |
56 | 56 | (int, float, boolean, string...) |
57 | 57 | 1.6: useDocValuesAsStored defaults to true. |
| 58 | + 1.7: docValues defaults to true, uninvertible defaults to false. |
58 | 59 | --> |
59 | 60 |
|
60 | 61 | <!-- Valid attributes for fields: |
61 | 62 | name: mandatory - the name for the field |
62 | | - type: mandatory - the name of a field type from the |
| 63 | + type: mandatory - the name of a field type from the |
63 | 64 | fieldTypes section |
64 | 65 | indexed: true if this field should be indexed (searchable or sortable) |
65 | 66 | stored: true if this field should be retrievable |
66 | 67 | docValues: true if this field should have doc values. Doc Values is |
67 | 68 | recommended (required, if you are using *Point fields) for faceting, |
68 | 69 | grouping, sorting and function queries. Doc Values will make the index |
69 | | - faster to load, more NRT-friendly and more memory-efficient. |
70 | | - They are currently only supported by StrField, UUIDField, all |
| 70 | + faster to load, more NRT-friendly and more memory-efficient. |
| 71 | + They are currently only supported by StrField, UUIDField, all |
71 | 72 | *PointFields, and depending on the field type, they might require |
72 | 73 | the field to be single-valued, be required or have a default value |
73 | 74 | (check the documentation of the field type you're interested in for |
|
82 | 83 | given field. |
83 | 84 | When using MoreLikeThis, fields used for similarity should be |
84 | 85 | stored for best performance. |
85 | | - termPositions: Store position information with the term vector. |
| 86 | + termPositions: Store position information with the term vector. |
86 | 87 | This will increase storage costs. |
87 | | - termOffsets: Store offset information with the term vector. This |
| 88 | + termOffsets: Store offset information with the term vector. This |
88 | 89 | will increase storage costs. |
89 | 90 | required: The field is required. It will throw an error if the |
90 | 91 | value does not exist |
|
102 | 103 | <!-- In this _default configset, only four fields are pre-declared: |
103 | 104 | id, _version_, and _text_ and _root_. All other fields will be type guessed and added via the |
104 | 105 | "add-unknown-fields-to-the-schema" update request processor chain declared in solrconfig.xml. |
105 | | - |
106 | | - Note that many dynamic fields are also defined - you can use them to specify a |
| 106 | +
|
| 107 | + Note that many dynamic fields are also defined - you can use them to specify a |
107 | 108 | field's type via field naming conventions - see below. |
108 | | - |
| 109 | +
|
109 | 110 | WARNING: The _text_ catch-all field will significantly increase your index size. |
110 | 111 | If you don't need it, consider removing it and the corresponding copyField directive." |
111 | 112 | --> |
|
115 | 116 | <field name="_version_" type="plong" indexed="false" stored="false"/> |
116 | 117 | <field name="_root_" type="string" indexed="true" stored="false" docValues="false" /> |
117 | 118 |
|
118 | | - |
119 | | - |
120 | | - |
121 | | - |
122 | | -<!-- Start: Dataverse-specific --> |
123 | | - |
| 119 | + |
| 120 | + |
| 121 | + |
| 122 | + |
| 123 | +<!-- Start: Dataverse-specific --> |
| 124 | + |
124 | 125 | <!-- catchall field, containing all other searchable text fields (implemented |
125 | 126 | via copyField further on in this schema --> |
126 | 127 | <!-- Dataverse solr 7.3.0: for some reason the old text wasn't working so switched to _text_ for copyfields --> |
|
216 | 217 | <!-- https://redmine.hmdc.harvard.edu/issues/3482 --> |
217 | 218 | <!-- 'Sorting can be done on the "score" of the document, or on any multiValued="false" indexed="true" field provided that field is either non-tokenized (ie: has no Analyzer) or uses an Analyzer that only produces a single Term (ie: uses the KeywordTokenizer)' http://wiki.apache.org/solr/CommonQueryParameters#sort --> |
218 | 219 | <!-- http://stackoverflow.com/questions/13360706/solr-4-0-alphabetical-sorting-trouble/13361226#13361226 --> |
219 | | - <field name="nameSort" type="alphaOnlySort" indexed="true" stored="true"/> |
| 220 | + <field name="nameSort" type="string" indexed="true" stored="true"/> |
220 | 221 |
|
221 | 222 | <field name="dateSort" type="pdate" indexed="true" stored="true"/> |
222 | 223 |
|
|
785 | 786 | <filter class="solr.TrimFilterFactory" /> |
786 | 787 | <!-- The PatternReplaceFilter gives you the flexibility to use |
787 | 788 | Java Regular expression to replace any sequence of characters |
788 | | - matching a pattern with an arbitrary replacement string, |
| 789 | + matching a pattern with an arbitrary replacement string, |
789 | 790 | which may include back references to portions of the original |
790 | 791 | string matched by the pattern. |
791 | 792 |
|
|
798 | 799 | <!-- https://redmine.hmdc.harvard.edu/issues/3482#note-11 --> |
799 | 800 | <!-- <filter class="solr.PatternReplaceFilterFactory" pattern="([^a-z])" replacement="" replace="all" /> --> |
800 | 801 | </analyzer> |
801 | | - </fieldType> |
802 | | - |
| 802 | + </fieldType> |
| 803 | + |
803 | 804 | <!-- The StrField type is not analyzed, but indexed/stored verbatim. --> |
804 | 805 | <fieldType name="string" class="solr.StrField" sortMissingLast="true" docValues="true" /> |
805 | 806 | <fieldType name="strings" class="solr.StrField" sortMissingLast="true" multiValued="true" docValues="true" /> |
|
0 commit comments