You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: AGENTS.md
+22-3Lines changed: 22 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,9 +17,28 @@ If necessary, edit this file to ensure it accurately reflects the current state
17
17
* app/backend/approaches/prompts/chat_query_rewrite.prompty: Prompt used to rewrite the query based off search history into a better search query
18
18
* app/backend/approaches/prompts/chat_query_rewrite_tools.json: Tools used by the query rewriting prompt
19
19
* app/backend/approaches/prompts/chat_answer_question.prompty: Prompt used by the Chat approach to actually answer the question based off sources
20
-
* app/backend/prepdocslib/cloudingestionstrategy.py: Builds the Azure AI Search indexer and skillset for the cloud ingestion pipeline
21
-
* app/backend/prepdocslib/pdfparser.py: Uses Azure Document Intelligence to emit page text plus figure placeholders
22
-
* app/backend/prepdocslib/figureprocessor.py: Shared helper that generates figure descriptions for both local ingestion and the cloud figure-processor skill
20
+
* app/backend/prepdocslib: Contains the document ingestion library used by both local and cloud ingestion
21
+
* app/backend/prepdocslib/blobmanager.py: Manages uploads to Azure Blob Storage
22
+
* app/backend/prepdocslib/cloudingestionstrategy.py: Builds the Azure AI Search indexer and skillset for the cloud ingestion pipeline
* app/backend/prepdocslib/page.py: Data classes for pages, images, and chunks
34
+
* app/backend/prepdocslib/parser.py: Base parser interface
35
+
* app/backend/prepdocslib/pdfparser.py: Parses PDFs using Azure Document Intelligence or local parser
36
+
* app/backend/prepdocslib/searchmanager.py: Manages Azure AI Search index creation and updates
37
+
* app/backend/prepdocslib/servicesetup.py: Shared service setup helpers for OpenAI, embeddings, blob storage, etc.
38
+
* app/backend/prepdocslib/strategy.py: Base strategy interface for document ingestion
39
+
* app/backend/prepdocslib/textparser.py: Parses plain text and markdown files
40
+
* app/backend/prepdocslib/textprocessor.py: Processes text chunks for cloud ingestion (merges figures, generates embeddings)
41
+
* app/backend/prepdocslib/textsplitter.py: Splits text into chunks using different strategies
23
42
* app/backend/app.py: The main entry point for the backend application.
24
43
* app/functions: Azure Functions used for cloud ingestion custom skills (document extraction, figure processing, text processing). Each function bundles a synchronized copy of `prepdocslib`; run `python scripts/copy_prepdocslib.py` to refresh the local copies if you modify the library.
25
44
* app/frontend: Contains the React frontend code, built with TypeScript, built with vite.
0 commit comments