v0.3.0
This release adds support for PGvector Vector DB, speech-in speech-out support using RIVA and RAG observability tooling. This release also adds a dedicated example for RAG pipeline using only models from NVIDIA AI Foundation and one example demonstrating query decomposition. Detailed changes are listed below:
Added
- New dedicated example showcasing Nvidia AI Playground based models using Langchain connectors.
- New example demonstrating query decomposition.
- Support for using PG Vector as a vector database in the developer rag canonical example.
- Support for using Speech-in Speech-out interface in the sample frontend leveraging RIVA Skills.
- New tool showcasing RAG observability support.
- Support for on-prem deployment of TRTLLM based nemotron models.
Changed
- Upgraded Langchain and llamaindex dependencies for all container.
- Restructured README files for better intuitiveness.
- Added provision to plug in multiple examples using a common base class.
- Changed
minioservice's port to9010from9000in docker based deployment. - Moved
evaluationdirectory from top level to undertoolsand created a dedicated compose file. - Added an experimental directory for plugging in experimental features.
- Modified notebooks to use TRTLLM and Nvidia AI foundation based connectors from langchain.
- Changed
ai-playgroundmodel engine name tonv-ai-foundationin configurations.