Skip to content

2024-02-27: HTML parsing via Azure Document Intelligence

Choose a tag to compare

@pamelafox pamelafox released this 27 Feb 20:45
· 393 commits to main since this release
a9617cd

We updated prepdocs.py so that HTML files will be processed by Azure Document Intelligence. Here's a stream demonstrating ingestion of HTML docs. You can just update to latest, put HTML files in the data/ folder, and they will get picked up.

What's Changed

Full Changelog: 2024-02-23...2024-02-27