-
Notifications
You must be signed in to change notification settings - Fork 71
Add Omni Reader Project #187
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 15 commits
Commits
Show all changes
131 commits
Select commit
Hold shift + click to select a range
f9ed2aa
Initial project setup with dependencies and license
8001947
Add project README with documentation and usage instructions
d97da0c
Add core utility functions for integrations
74a12b0
Add data schemas for the OCR pipeline
db67c1e
Add OCR comparison pipeline framework
377263b
Add model evaluation step for OCR comparison
f09655d
Add main execution script for OCR pipeline
0de7871
Add image encoding, metrics calculation and prompt utilities
25347d3
Add OCR implementation steps for Gemma3 and Mistral models
2387351
Add Streamlit web interface for interactive OCR comparison
71b8b94
Add sample images for OCR testing
e885c25
update README
c6b5ce6
Add configuration settings for OCR pipeline
2b45149
Add standalone script for quick OCR comparison without ZenML
3ea4f93
Add poetry.lock file to lock dependencies
66d0de1
Add pip requirements file for non-Poetry installations
a747636
Add ImageDescription pydantic model for the OCR pipeline
1c4ee21
update readme
c70c6e9
update assets dir structure
b60e1a6
update prompt.py: add confidence for extracted text
08a8fc6
add confidence field to ImageDescription
b14b16a
organize assets dir
4882760
remove integrations utils: remove mlflow, and docker for now
991298b
add pipelines __init__ file
235993a
add other init files
e34e924
add run_ocr.py file: reusable ocr step for various models
9210dd2
delete run_gemma3_ocr and run_mistral_ocr in favor of unified run_ocr…
4892b3f
update ocr_pipeline to save ocr results and visualizations
168e3c5
update save ocr results step
43da0dc
add configs dir with ocr_config yaml file
ffa8601
add config util file
b7d0b93
add io utils file for loading images/files and for saving results
21711ca
add ocr model utils file for processing chat completions for various …
6a3fd96
add step for loading images/files
19193a4
add run.py file
5b9afed
pass each result dict to save_ocr_results step to resolve StepInterfa…
82f2dd7
update ocr_pipeline to pass model_names in addition to results when s…
d191735
refactor error_analysis and metrics under 1 file and remove confidenc…
4928919
add detailed html string containing ocr results, error analysis and o…
b4c5987
update config file with default image folder
34cb61a
update app.py
625156a
Add detailed metrics to streamlit UI app
efcdd1d
Update README.md: fix broken links, and add project organization
023ca45
remove pyproject.toml and poetry.lock in favor of single requirements…
b3f1137
update requirements.txt
5f01ba3
remove mlflow from readme
7efe6d7
delete unused assets
682ebd8
update prompt to clarify entitiy output
3f739b9
refactor app.py: simplify and remove entities/description
4c00594
update assets
70bbacf
add model_info util forgetting model metadata throughout the app
3e894e8
update save results to dynamically render model info
c5ff0ff
move ImageDescription schema into prompt
cb22d7d
update steps to integrate changes
70c6f45
update ocr pipeline
a33b38c
update run and run_compare_ocr entrypoint files
8f44467
refactor main.py -- remove ground-trith-texts flag
ee4bc8d
update README.md
1063522
update ocr_config.yaml: remove image_patterns, update keys
d8c6ded
update assets
3be4143
delete config.yaml
5e0897a
update ocr model utils to process images with ollama api rather than …
9e06ff7
remove running_from_ui param
27e610c
add run_ollama_ocr_from_ui function for streamlit app
bbfd4e6
add docker settings for ocr pipeline
d1eb77e
update run_compare_ocr
17e1a4c
refactor run_ocr
0de8ddc
add Dockerfile and .dockerignore
16f3b77
update requirements.txt
b65894b
refactor: revert to use litellm+instructor for ollama models
9b4e584
edit api_key access for OpenAI client
87c05df
add logos for different ocr models
73c3aba
update utils for multi-model ocr
58ac407
update steps for multi-model ocr changes
6a0295f
add model_configs util for centralized model configuration and client…
4d96ae3
rename ocr_model_utils to ocr_processing
991e89f
update pipeline for multi-model ocr and update docker settings
0c05d71
update entrypoint run files
9ad7c8a
refactor streamlit app for readability and allow any number of models…
e316cfc
update README
ff58276
update ocr_config.yaml to integrate new changes
0e3638c
update assets
9095d01
update assets
7ec8df3
improve UI for comparing and displaying multiple models
28d698a
add extract json util
70a9c5e
add .env.example
e5f5924
update pipeline
7e7e108
update ocr processing, metrics, and model configs
233f95c
update utils and revamp evaluate_models
903923b
pass MODEL_CONFIGS in streamlit ocr processing
a3be487
set defaults for ground_truth_model for project
515aa0c
add images for README
e877918
update requirements.txt
f328c41
remove demo_models from utils
46bfb7f
update README: add images, links, docs
f1ec1b5
simplify assets dir
575a0d6
delete unused assets
6ecc9b5
update steps
d6cc858
add loaders.py
92e3fa0
update utils
8e70923
separate batch ocr from evaluation into 2 pipelines
7c2e498
add ground_truth_texts samples matching images provided
5b82b7e
remove run_compare_ocr and main.py in favor of run.py
987477d
update config.yaml to align with zenml config file definition
82a4bcc
update README
f501c4a
update .env.example
8447714
re-integrate docker settings and cleanup config.yaml
7062613
update image links in README
c31edfa
update visualization img
868e3b7
update assets
7496685
refactor utils
a792081
move visualization logic into util file
32a1bbc
refactor steps: remove local saving/loading and integrate artifacts
637655a
refactor pipelines: remove save/load from local dirs + split config f…
a7e4fa8
split config into dedicated config files for each pipeline
fc45c9d
add schemas dir
394e8ce
update requirements.txt
aa0190f
update run.py: simplify args, and integrate new config structure
5dcf313
delete main.py
94d8e62
update configs
09e847f
add docker settings in pipeline definitions
51c8eae
update README
df81509
add analyse and Labour to .typos.toml
bbbd82c
cleanup pipelines and add small html visualization for batch pipeline
bcaf3df
cleanup utils
2d4b4bf
update steps/evaluate models to use combined dataframe from updated r…
09c5704
update loader to work with Dataframe directly, and not a dict
cb91618
coerce potential lists being returned in model responses to strings
b443344
update README
894366f
cleanup configs and run_ocr
d8a654d
cleanup requirements.txt
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -1,2 +1,3 @@ | ||
| OPENAI_API_KEY=your_openai_api_key | ||
| MISTRAL_API_KEY=your_mistral_api_key | ||
| OLLAMA_HOST=base_url_for_ollama_host # defaults to "http://localhost:11434/api/generate" if not set |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.