-
Notifications
You must be signed in to change notification settings - Fork 210
Labels
SWoC26Social Winter of CodeSocial Winter of Codefeature-requestgood second issueafter you have solved good first issueafter you have solved good first issuehelp wantedExtra attention is neededExtra attention is needed
Description
Is your feature request related to a problem?
OLake is excellent for replicating transactional databases to Apache Iceberg. However, a massive amount of valuable business data (logs, metrics, application search data) resides in Elasticsearch and OpenSearch clusters. Currently, there is no native way to use OLake to replicate this data into our data lakehouse.
Describe the solution you'd like
I propose the development of a new source connector for Elasticsearch and OpenSearch.
This connector should support:
- Full Refresh Sync: For the initial load of an index.
- Incremental Sync: Crucial for time-series data, likely using a timestamp or sorting key as a cursor to efficiently pull new documents.
- Configuration in the UI: Just like the existing Postgres and MySQL sources, allowing users to specify the host, credentials, and index name.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
SWoC26Social Winter of CodeSocial Winter of Codefeature-requestgood second issueafter you have solved good first issueafter you have solved good first issuehelp wantedExtra attention is neededExtra attention is needed