Skip to content

KameniAlexNea/medivocate

Repository files navigation

An AI-driven platform empowering users with trustworthy, personalized history guidance to combat misinformation and promote equitable history.

Deployed on HF Space

📦 ./
├── 📁 docs/
├── 📁 src/
│   ├── 📁 ocr/
│   ├── 📁 preprocessing/
│   ├── 📁 chunking/
│   ├── 📁 vector_store/
│   ├── 📁 rag_pipeline/
│   ├── 📁 llm_integration/
│   └── 📁 prompt_engineering/
├── 📁 tests/
│   ├── 📁 unit/
│   └── 📁 integration/
├── 📁 examples/
├── 📁 notebooks/
├── 📁 config/
├── 📄 README.md
├── 📄 CONTRIBUTING.md
├── 📄 requirements.txt
├── 📄 .gitignore
└── 📄 LICENSE

Description des Dossiers et Fichiers

  1. docs/

    • Contient la documentation générale du projet.
    • Exemple : Guide de démarrage rapide, architecture du projet, et spécifications techniques.
  2. src/

    • Dossier principal contenant le code source organisé par modules.
    • Sous-dossiers :
      • ocr/ : Module pour l'extraction de texte à partir de documents.
      • preprocessing/ : Pipelines de nettoyage et de standardisation des documents.
      • chunking/ : Méthodes pour diviser les documents en chunks exploitables.
      • vector_store/ : Intégration de bases de données vectorielles.
      • rag_pipeline/ : Implémentation du pipeline RAG (Retrieval-Augmented Generation).
      • llm_integration/ : Gestion des modèles LLM pour la génération de réponses.
      • prompt_engineering/ : Modules pour reformuler et optimiser les requêtes.
  3. tests/

    • unit/ : Tests unitaires pour chaque module.
    • integration/ : Tests d’intégration entre plusieurs modules.
  4. examples/

    • Contient des exemples fonctionnels démontrant l'utilisation des principaux modules.
  5. notebooks/

    • Jupyter notebooks pour des expérimentations ou des démonstrations rapides.
  6. config/

    • Fichiers de configuration pour les bibliothèques, les pipelines, ou les environnements.

Some recommendations

Groq

While Groq's request limit is noted, a brief explanation of what Groq offers in terms of LLM integration would help. For instance: “Groq provides access to high-performance LLM APIs with free-tier support for RAG applications. Ideal for quick prototyping and testing.”

LangChain

LangChain is a framework for developing applications powered by large language models (LLMs).

For these applications, LangChain simplifies the entire application lifecycle:

  • Open-source libraries : Build your applications using LangChain's open-source components and third-party integrations. Use LangGraph to build stateful agents with first-class streaming and human-in-the-loop support.
  • Productionization : Inspect, monitor, and evaluate your apps with LangSmith so that you can constantly optimize and deploy with confidence.
  • Deployment : Turn your LangGraph applications into production-ready APIs and Assistants with LangGraph Platform.

LangSmith

LangSmith helps your team debug, evaluate, and monitor your language models and intelligent agents. It works with any LLM Application, including a native integration with the LangChain Python and LangChain JS open source libraries.

LangSmith is developed and maintained by LangChain, the company behind the LangChain framework.

Dataset

🤗 Datasets is a lightweight library providing two main features:

  • one-line dataloaders for many public datasets : one-liners to download and pre-process any of the number of datasets major public datasets (image datasets, audio datasets, text datasets in 467 languages and dialects, etc.) provided on the HuggingFace Datasets Hub. With a simple command like squad_dataset = load_dataset("squad"), get any of these datasets ready to use in a dataloader for training/evaluating a ML model (Numpy/Pandas/PyTorch/TensorFlow/JAX),
  • efficient data pre-processing : simple, fast and reproducible data pre-processing for the public datasets as well as your own local datasets in CSV, JSON, text, PNG, JPEG, WAV, MP3, Parquet, etc. With simple commands like processed_dataset = dataset.map(process_example), efficiently prepare the dataset for inspection and ML model evaluation and training.

Search Index : Hugging Face + Chroma, BM25

About

An AI-powered platform exploring African history, culture, and traditional medicine, fostering understanding and appreciation of the continent's rich heritage.

Topics

Resources

License

Contributing

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages