Skip to content

Commit b706744

Browse files
authored
Merge branch 'master' into feat/deploy
2 parents a411848 + 4ae782f commit b706744

File tree

3 files changed

+6
-0
lines changed

3 files changed

+6
-0
lines changed

README.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -12,6 +12,7 @@
1212
**OmniParser** is a comprehensive method for parsing user interface screenshots into structured and easy-to-understand elements, which significantly enhances the ability of GPT-4V to generate actions that can be accurately grounded in the corresponding regions of the interface.
1313

1414
## News
15+
- [2024/10] OmniParser is the #1 trending model on huggingface model hub (starting 10/29/2024).
1516
- [2024/10] Feel free to checkout our demo on [huggingface space](https://huggingface.co/spaces/microsoft/OmniParser)! (stay tuned for OmniParser + Claude Computer Use)
1617
- [2024/10] Both Interactive Region Detection Model and Icon functional description model are released! [Hugginface models](https://huggingface.co/microsoft/OmniParser)
1718
- [2024/09] OmniParser achieves the best performance on [Windows Agent Arena](https://microsoft.github.io/WindowsAgentArena/)!
@@ -64,6 +65,9 @@ To deploy OmniParser to EC2 on AWS via Github Actions:
6465
1. Fork this repository and clone your fork to your local machine.
6566
2. Follow the instructions at the top of [`deploy.py`](https://github.com/microsoft/OmniParser/blob/main/deploy.py).
6667

68+
## Model Weights License
69+
For the model checkpoints on huggingface model hub, please note that icon_detect model is under AGPL license since it is a license inherited from the original yolo model. And icon_caption_blip2 & icon_caption_florence is under MIT license. Please refer to the LICENSE file in the folder of each model: https://huggingface.co/microsoft/OmniParser.
70+
6771
## 📚 Citation
6872
Our technical report can be found [here](https://arxiv.org/abs/2408.00203).
6973
If you find our work useful, please consider citing our work:

gradio_demo.py

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,6 +14,8 @@
1414

1515
yolo_model = get_yolo_model(model_path='weights/icon_detect/best.pt')
1616
caption_model_processor = get_caption_model_processor(model_name="florence2", model_name_or_path="weights/icon_caption_florence")
17+
# caption_model_processor = get_caption_model_processor(model_name="blip2", model_name_or_path="weights/icon_caption_blip2")
18+
1719
platform = 'pc'
1820
if platform == 'pc':
1921
draw_bbox_config = {

imgs/saved_image_demo.png

122 KB
Loading

0 commit comments

Comments
 (0)