You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The performance of Solr indexing has been significantly improved, particularly for datasets with many files.
4
+
5
+
A new dataverse.solr.min-files-to-use-proxy microprofile setting can be used to further improve performance/lower memory requirements for datasets with many files (e.g. 500+) (defaults to Integer.MAX, disabling use of the new functionality)
Copy file name to clipboardExpand all lines: doc/sphinx-guides/source/installation/config.rst
+11Lines changed: 11 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2689,6 +2689,17 @@ when using it to configure your core name!
2689
2689
2690
2690
Can also be set via *MicroProfile Config API* sources, e.g. the environment variable ``DATAVERSE_SOLR_PATH``.
2691
2691
2692
+
dataverse.solr.min-files-to-use-proxy
2693
+
+++++++++++++++++++++++++++++++++++++
2694
+
2695
+
Specifies when to use a smaller datafile proxy object for the purposes of dataset indexing. This can lower memory requirements
2696
+
and improve performance when reindexing large datasets (e.g. those with hundreds or thousands of files). (Creating the proxy may slightly slow indexing datasets with only a few files.)
2697
+
2698
+
This setting represents a number of files for which the datafile procy should be used. By default, this is set to Interger.MAX which disables using the proxy.
2699
+
A recommended value would be ~1000 but the optimal value may vary depending on details of your installation.
2700
+
2701
+
Can also be set via *MicroProfile Config API* sources, e.g. the environment variable ``DATAVERSE_SOLR_MIN_FILES_TO_USE_PROXY``.
0 commit comments