Dedup ID on Pinecone DB #59178
Unanswered
ranggaaldosas
asked this question in
Source Python CDK
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I’m currently using the Pinecone connector in Airbyte with Incremental | Append + Deduped mode, and I’ve set cve_id as the primary key. However, I’m still seeing duplicate vectors in my index for the same CVE. Each update is creatin/upserting a new vector instead of update the existing one.
When I inspect the index, I can see that the connector is using the Airbyte-generated record IDs as the Pinecone vector IDs, for example:
Instead of upserting on cve_id, it’s assigning those random UUIDs (632e5bda-…, 01784779-…) as the vector IDs. Can you please advise:
Thanks for your help!
Beta Was this translation helpful? Give feedback.
All reactions