Skip to content

Commit 778fe0e

Browse files
committed
Refine Airbyte-Couchbase integration tutorial by clarifying sync interval details and updating timestamp fields in examples. Adjusted terminology for consistency and improved accuracy in data extraction references.
1 parent 5a7b93c commit 778fe0e

File tree

1 file changed

+10
-10
lines changed

1 file changed

+10
-10
lines changed

tutorial/markdown/connectors/airbyte/airbyte-couchbase-integration.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -30,7 +30,7 @@ Airbyte is an open-source data integration platform that enables you to move dat
3030
- **Data ingestion**: Load data from SaaS applications, databases, or APIs into Couchbase
3131
- **Change data capture**: Track and replicate document changes with periodic syncs
3232

33-
> **Note**: Airbyte is designed for batch/periodic data synchronization (typically 5-60 minute intervals), not sub-second real-time change tracking. For true real-time CDC, consider Couchbase's built-in XDCR or Eventing services.
33+
> **Note**: Airbyte is designed for batch/periodic data synchronization, not sub-second real-time change tracking. Airbyte Cloud supports minimum 60-minute intervals (hourly syncs); self-hosted deployments may support more frequent syncs with configuration. For true real-time CDC, consider Couchbase's built-in XDCR or Eventing services.
3434
3535
This tutorial will guide you through setting up Airbyte with Couchbase Capella (cloud-hosted) as both source and destination, covering configuration, sync modes, common patterns, and best practices.
3636

@@ -367,7 +367,7 @@ All documents written to Couchbase by Airbyte follow this structure:
367367
"id": "stream_name::key_value",
368368
"type": "airbyte_record",
369369
"stream": "source_stream_name",
370-
"emitted_at": 1642526400000,
370+
"_airbyte_extracted_at": 1642526400000,
371371
"data": {
372372
// Original record data from source
373373
},
@@ -380,7 +380,7 @@ All documents written to Couchbase by Airbyte follow this structure:
380380
- `id`: Composite document ID (based on sync mode and primary key)
381381
- `type`: Always "airbyte_record"
382382
- `stream`: Name of the source stream
383-
- `emitted_at`: Unix timestamp (milliseconds) when record was synced
383+
- `_airbyte_extracted_at`: Unix timestamp (milliseconds) when Airbyte extracted the record from source
384384
- `data`: The actual record data from the source
385385
- `_ab_sync_mode`: Which sync mode was used
386386
- `namespace`: Optional logical grouping
@@ -555,9 +555,9 @@ After the sync completes:
555555
Example query to check synced data:
556556
```sql
557557
SELECT COUNT(*) as doc_count,
558-
MIN(emitted_at) as first_sync,
559-
MAX(emitted_at) as last_sync
560-
FROM `analytics`.`_default`.`travel-sample.inventory.airline`
558+
MIN(_airbyte_extracted_at) as first_sync,
559+
MAX(_airbyte_extracted_at) as last_sync
560+
FROM `staging`.`_default`.`travel-sample.inventory.airline`
561561
WHERE type = 'airbyte_record'
562562
```
563563

@@ -1047,9 +1047,9 @@ Monitor sync health in the Airbyte UI:
10471047
```sql
10481048
-- Monitor recent Airbyte writes
10491049
SELECT COUNT(*) as recent_writes,
1050-
MAX(emitted_at) as last_sync
1050+
MAX(_airbyte_extracted_at) as last_sync
10511051
FROM `bucket`.`scope`.`collection`
1052-
WHERE emitted_at > (UNIX_MILLIS() - 3600000) -- Last hour
1052+
WHERE _airbyte_extracted_at > (UNIX_MILLIS() - 3600000) -- Last hour
10531053
AND type = 'airbyte_record'
10541054
```
10551055

@@ -1350,10 +1350,10 @@ SELECT COUNT(*) FROM `bucket`.`scope`.`collection`
13501350
WHERE type = 'airbyte_record'
13511351

13521352
-- View recent syncs
1353-
SELECT stream, emitted_at, data
1353+
SELECT stream, _airbyte_extracted_at, data
13541354
FROM `bucket`.`scope`.`collection`
13551355
WHERE type = 'airbyte_record'
1356-
ORDER BY emitted_at DESC
1356+
ORDER BY _airbyte_extracted_at DESC
13571357
LIMIT 10
13581358

13591359
-- Check for data quality issues

0 commit comments

Comments
 (0)