Skip to content

Add Elasticsearch / OpenSearch as a Source Connector #415

@metalshanked

Description

@metalshanked

Is your feature request related to a problem?

OLake is excellent for replicating transactional databases to Apache Iceberg. However, a massive amount of valuable business data (logs, metrics, application search data) resides in Elasticsearch and OpenSearch clusters. Currently, there is no native way to use OLake to replicate this data into our data lakehouse.

Describe the solution you'd like

I propose the development of a new source connector for Elasticsearch and OpenSearch.

This connector should support:

  • Full Refresh Sync: For the initial load of an index.
  • Incremental Sync: Crucial for time-series data, likely using a timestamp or sorting key as a cursor to efficiently pull new documents.
  • Configuration in the UI: Just like the existing Postgres and MySQL sources, allowing users to specify the host, credentials, and index name.

Metadata

Metadata

Assignees

Labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions