Adds MS Word .doc support to the llm-dataset-converter library.
-
antiword available on
PATH- Debian/Ubuntu:
sudo apt install antiword - Windows: Softpedia
- Debian/Ubuntu:
pip install git+https://github.com/waikato-llm/llm-dataset-converter.git
pip install git+https://github.com/waikato-llm/ldc-doc.gitSee here for an overview of all plugins.