Skip to content

Fine-Tuning Vision Language Models to Understand Medical Handwriting

Notifications You must be signed in to change notification settings

alxfgh/Decipher-Doctor-VLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Doctor Handwriting Reader

Overview

This project aims to fine-tune Vision Language Models (VLMs) on doctor handwriting to improve the readability and interpretation of medical prescriptions and notes. We're creating synthetic data to enhance model's performance and make it more accessible through the Decipher Doctor platform.

Features

  • Fine-tuning of Qwen2-VL for doctor handwriting recognition
  • Synthetic data generation for improved model performance
  • Open-source initiative to enhance medical communication

Compute Resources

We're actively looking for compute resources to accelerate our synthetic data generation and model fine-tuning. If you can provide GPU time or other computational resources, please contact us.

Contact

Alexander Al-Feghali - alexander.al-feghali@mail.mcgill.ca

Website: Decipher.Doctor

Blog: Doctor Handwriting Reader

Acknowledgements

Getting Started

Prerequisites

  • Python 3.7+
  • PyTorch
  • Transformers library
  • Pillow
  • Matplotlib
  • NumPy

Installation

  1. Clone the repository:
    git clone https://github.com/yourusername/doctor-handwriting-reader.git
    
  2. Install the required packages:
    pip install -r requirements.txt
    

Contributing

We welcome contributions to improve the project! If you're interested in helping, please:

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add some AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

License

Distributed under the MIT License. See LICENSE for more information.

About

Fine-Tuning Vision Language Models to Understand Medical Handwriting

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published