Skip to content

adding sourceUrl as deeplink for original of document source when using integrated vectorization #2019

@cforce

Description

@cforce

[X] Feature Request

Leverage storage/data lake blob metadata as input for search index columns (both existing and custom mappings). When a client uploads to a blob, use the metadata, including the source URL, as input for integrated vectorization. The source URL should be propagated to the search index for citation links, directing users to the original source instead of the blob storage, which may act only as a pull adapter.

Key elements:

  • sourceURL: A deep link URI used for citation purposes.
  • lastUpdateDate: The date of creation or last update (independent of blob creation), used for document retrieval, where the most recent documents carry a higher weight/order.

This approach allows blob metadata in each blob container to be mapped to index columns and reused in the UI for links and other citation contexts.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or requestingestionRelated to manual data ingestion with the prepdocs scripts

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions