medivocate

An AI-driven platform empowering users with trustworthy, personalized history guidance to combat misinformation and promote equitable history.

Deployed on HF Space

📦 ./
├── 📁 docs/
├── 📁 src/
│   ├── 📁 ocr/
│   ├── 📁 preprocessing/
│   ├── 📁 chunking/
│   ├── 📁 vector_store/
│   ├── 📁 rag_pipeline/
│   ├── 📁 llm_integration/
│   └── 📁 prompt_engineering/
├── 📁 tests/
│   ├── 📁 unit/
│   └── 📁 integration/
├── 📁 examples/
├── 📁 notebooks/
├── 📁 config/
├── 📄 README.md
├── 📄 CONTRIBUTING.md
├── 📄 requirements.txt
├── 📄 .gitignore
└── 📄 LICENSE

Description des Dossiers et Fichiers

docs/
- Contient la documentation générale du projet.
- Exemple : Guide de démarrage rapide, architecture du projet, et spécifications techniques.
src/
- Dossier principal contenant le code source organisé par modules.
- Sous-dossiers :
  - ocr/ : Module pour l'extraction de texte à partir de documents.
  - preprocessing/ : Pipelines de nettoyage et de standardisation des documents.
  - chunking/ : Méthodes pour diviser les documents en chunks exploitables.
  - vector_store/ : Intégration de bases de données vectorielles.
  - rag_pipeline/ : Implémentation du pipeline RAG (Retrieval-Augmented Generation).
  - llm_integration/ : Gestion des modèles LLM pour la génération de réponses.
  - prompt_engineering/ : Modules pour reformuler et optimiser les requêtes.
tests/
- unit/ : Tests unitaires pour chaque module.
- integration/ : Tests d’intégration entre plusieurs modules.
examples/
- Contient des exemples fonctionnels démontrant l'utilisation des principaux modules.
notebooks/
- Jupyter notebooks pour des expérimentations ou des démonstrations rapides.
config/
- Fichiers de configuration pour les bibliothèques, les pipelines, ou les environnements.

Some recommendations

Groq

While Groq's request limit is noted, a brief explanation of what Groq offers in terms of LLM integration would help. For instance: “Groq provides access to high-performance LLM APIs with free-tier support for RAG applications. Ideal for quick prototyping and testing.”

LangChain

LangChain is a framework for developing applications powered by large language models (LLMs).

For these applications, LangChain simplifies the entire application lifecycle:

Open-source libraries : Build your applications using LangChain's open-source components and third-party integrations. Use LangGraph to build stateful agents with first-class streaming and human-in-the-loop support.
Productionization : Inspect, monitor, and evaluate your apps with LangSmith so that you can constantly optimize and deploy with confidence.
Deployment : Turn your LangGraph applications into production-ready APIs and Assistants with LangGraph Platform.

LangSmith

LangSmith helps your team debug, evaluate, and monitor your language models and intelligent agents. It works with any LLM Application, including a native integration with the LangChain Python and LangChain JS open source libraries.

LangSmith is developed and maintained by LangChain, the company behind the LangChain framework.

Dataset

🤗 Datasets is a lightweight library providing two main features:

one-line dataloaders for many public datasets : one-liners to download and pre-process any of the major public datasets (image datasets, audio datasets, text datasets in 467 languages and dialects, etc.) provided on the HuggingFace Datasets Hub. With a simple command like squad_dataset = load_dataset("squad"), get any of these datasets ready to use in a dataloader for training/evaluating a ML model (Numpy/Pandas/PyTorch/TensorFlow/JAX),
efficient data pre-processing : simple, fast and reproducible data pre-processing for the public datasets as well as your own local datasets in CSV, JSON, text, PNG, JPEG, WAV, MP3, Parquet, etc. With simple commands like processed_dataset = dataset.map(process_example), efficiently prepare the dataset for inspection and ML model evaluation and training.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

medivocate

Description des Dossiers et Fichiers

Some recommendations

Groq

LangChain

LangSmith

Dataset

Search Index : Hugging Face + Chroma, BM25

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.github/workflows		.github/workflows
config		config
docs		docs
evaluation		evaluation
examples		examples
notebooks		notebooks
ocr		ocr
src		src
tests		tests
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
RESUME.txt		RESUME.txt
SECURITY.md		SECURITY.md
app.py		app.py
dev_dependencies.txt		dev_dependencies.txt
pytest.ini		pytest.ini
requirements.txt		requirements.txt
tox.ini		tox.ini

Folders and files

Latest commit

History

Repository files navigation

medivocate

Description des Dossiers et Fichiers

Some recommendations

Groq

LangChain

LangSmith

Dataset

Search Index : Hugging Face + Chroma, BM25

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages