-
Notifications
You must be signed in to change notification settings - Fork 10
Open
Description
Hi,
I'm Matteo Grolla from Sourcesense, Lucidwork's partner in Italy.
I'm developing a custom connector for a customer but I have questions about error management, Robert Lucarini suggested to post my questions here.
Let's use random-content-incremental for our discussion and let's focus on the fetch method
What I've noticed is:
- if an exception is thrown inside generateRandom the framework restarts the crawl from previous checkpoint (or the beginning if it was the first)
How can I terminate the crawl marking it as failed?
I'd like that next time I restart the crawl it proceeds from last saved checkpoint - if an exception is thrown inside emitDocument the framework logs the error and proceeds with the crawl.
Will this document be recrawled? When? Can we control this?
Thanks a lot
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels