Skip to content

Commit 83c72bc

Browse files
authored
Add Documents (#131)
* Update document * Update how_to_evaluate_internvl_chat_1_5.md * Update evaluation * Update how_to_evaluate_internvl_chat_1_5_using_vlmevalkit.md
1 parent c225130 commit 83c72bc

19 files changed

+1127
-3360
lines changed

README.md

Lines changed: 8 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,9 @@
11
# <img width="60" alt="image" src="https://github.com/OpenGVLab/InternVL/assets/8529570/5aa4cda8-b453-40a0-9336-17012b430ae8"> InternVL Family: Closing the Gap to Commercial Multimodal Models with Open-Source Suites —— A Pioneering Open-Source Alternative to GPT-4V
22

3-
\[[Update Blog](./BLOG.md)\] \[[Paper](https://arxiv.org/abs/2312.14238)\] \[[InternVL 1.5 Technical Report](https://arxiv.org/abs/2404.16821)\] \[[Chat Demo](https://internvl.opengvlab.com/)\] [[HuggingFace Demo]](https://huggingface.co/spaces/OpenGVLab/InternVL) \[[Quick Start](#quick-start-with-huggingface)\] \[[中文解读](https://zhuanlan.zhihu.com/p/675877376)\]
3+
\[[Update Blog](./BLOG.md)\] \[[Paper](https://arxiv.org/abs/2312.14238)\] \[[InternVL 1.5 Technical Report](https://arxiv.org/abs/2404.16821)\] \[[Chat Demo](https://internvl.opengvlab.com/)\] [\[HuggingFace Demo\]](https://huggingface.co/spaces/OpenGVLab/InternVL) \[[Quick Start](#quick-start-with-huggingface)\] \[[中文解读](https://zhuanlan.zhihu.com/p/675877376)\]
44

55
## News🚀🚀🚀
6+
67
- `2024/04/28`: We release the INT8 version of InternVL-Chat-V1-5, see [here](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5-Int8).
78
- `2024/04/28`: We achieve the SOTA performance (75.74) on the Infographics VQA benchmark, see [here](https://rrc.cvc.uab.es/?ch=17&com=evaluation&task=3).
89
- `2024/04/18`: InternVL-Chat-V1.5 has been released at [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5), approaching the performance of GPT-4V and Gemini Pro on various benchmarks like MMMU, DocVQA, ChartQA, MathVista, etc.
@@ -15,8 +16,12 @@
1516
- `2024/01/24`: InternVL-Chat-V1.1 is released, it supports Chinese and has stronger OCR capability, see [here](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-1) or try our [demo](https://internvl.opengvlab.com/).
1617
- `2024/01/16`: We release our [customized mmcv/mmsegmentation/mmdetection code](https://github.com/OpenGVLab/InternVL-MMDetSeg), integrated with DeepSpeed, which can be used for training large-scale object detection and semantic segmentation models.
1718

18-
## Compared with SOTA VLLMs
19+
## Documents
1920

21+
- How to Evaluate InternVL-Chat-V1-5? [\[link\]](./document/how_to_evaluate_internvl_chat_1_5.md)
22+
- How to Evaluate InternVL-Chat-V1-5 using VLMEvalKit? (Recommend) [\[link\]](./document/how_to_evaluate_internvl_chat_1_5_using_vlmevalkit.md)
23+
24+
## Compared with SOTA VLLMs
2025

2126
<p align="center"><img width="500" alt="image" src="https://github.com/OpenGVLab/InternVL/assets/23737120/38e8a632-229c-4b20-b7e1-77299dfc6cee"></p>
2227

@@ -34,7 +39,7 @@ InternVL scales up the ViT to _**6B parameters**_ and aligns it with LLM.
3439

3540
| Model | Date | Download | Note |
3641
| ----------------------- | ---------- | ------------------------------------------------------------------------------------ | ------------------------------------------------------------------------------------------------------------------------------------------------------------------ |
37-
| InternVL−Chat−V1.5-Int8 | 2024.04.28 | 🤗 [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5-Int8) | The INT8 version of InternVL-Chat-V1-5 |
42+
| InternVL−Chat−V1.5-Int8 | 2024.04.28 | 🤗 [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5-Int8) | The INT8 version of InternVL-Chat-V1-5 |
3843
| InternVL−Chat−V1.5 | 2024.04.18 | 🤗 [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-5) | support 4K image; super strong OCR; Approaching the performance of GPT-4V and Gemini Pro on various benchmarks like MMMU, DocVQA, ChartQA, MathVista, etc. (🔥new) |
3944
| InternVL−Chat−V1.2−Plus | 2024.02.21 | 🤗 [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-2-Plus) | more SFT data and stronger |
4045
| InternVL−Chat−V1.2 | 2024.02.11 | 🤗 [HF link](https://huggingface.co/OpenGVLab/InternVL-Chat-V1-2) | scaling up LLM to 34B |
@@ -618,7 +623,6 @@ for question, response in zip(questions, responses):
618623
print(response)
619624
```
620625

621-
622626
</details>
623627

624628
## Chat Web Demo

0 commit comments

Comments
 (0)