This minor release addresses the following items:
- Can now provide a secondary query when reading documents from MarkLogic. This is supported via the following new options:
spark.marklogic.read.secondaryUris.invokespark.marklogic.read.secondaryUris.javascriptspark.marklogic.read.secondaryUris.javascriptFilespark.marklogic.read.secondaryUris.xqueryspark.marklogic.read.secondaryUris.xqueryFilespark.marklogic.read.secondaryUris.vars.
- Can now provide a prompt when generating an embedding via the new
spark.marklogic.write.embedder.promptoption. - Can now encode vectors in documents when generating embeddings via the new
spark.marklogic.write.embedder.base64encodeoption. - Fixed a bug where classifying text and generating embeddings did not work when data was read from a structured data source such as JDBC or a delimited text file.
- Fixed a bug where a document with a URI containing multiple colons could not be read from MarkLogic and written to a file.
- Fixed a bug where URIs were incorrectly modified when documents were written as entries in a zip file. URIs are now used as the zip entry name.