-
Notifications
You must be signed in to change notification settings - Fork 5.1k
Description
Hi @pamelafox , @srbalakr
Previously I am using Azure Document intelligence and Sentence Text splitter(prepdocs.py file- setup_list_file_strategy method) for reading the data and converting it into chunks and embeddings. It is taking huge amount of time for indexing data.
I can see IntegratedVectorizerStrategy( https://github.com/Azure-Samples/azure-search-openai-demo/blob/main/app/backend/prepdocslib/integratedvectorizerstrategy.py) file in the updated Repo which will serves the same purpose with the help of Skillsets and Indexer.
I would like to know what is the exact difference between them and does the latest IntegratedVectorizerStrategy approach will consume less time for indexing purpose or does it have any other advantages compared to previous approach . Which approach has more benefits and does the usage of both approaches depends on any particular things.
Thankyou so much