Skip to content

Commit 77cc07a

Browse files
SilviaseCopilotspeed1313
authored
CC_OCR 日本語サブセットの追加 (#168)
* add CCOCR Task * add cvqa japanese subset Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * refactoring and add results --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: speed1313 <speedtry13@icloud.com>
1 parent d3bf734 commit 77cc07a

File tree

12 files changed

+1068
-368
lines changed

12 files changed

+1068
-368
lines changed

eval_all.sh

Lines changed: 6 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -30,6 +30,7 @@ declare -A MODEL_GROUP_MAP=(
3030
["sbintuitions/sarashina2-vision-8b"]="sarashina"
3131
["sbintuitions/sarashina2-vision-14b"]="sarashina"
3232
["microsoft/Phi-4-multimodal-instruct"]="phi"
33+
["turing-motors/Heron-NVILA-Lite-15B"]="heron_nvila"
3334
)
3435

3536
# Task list
@@ -44,6 +45,8 @@ declare -a task_list=(
4445
"llava-bench-in-the-wild"
4546
"jic-vqa"
4647
"mecha-ja"
48+
"cc-ocr"
49+
"cvqa"
4750
)
4851

4952
# Define metrics per task
@@ -58,6 +61,8 @@ declare -A METRIC_MAP=(
5861
["llava-bench-in-the-wild"]="llm-as-a-judge,rougel"
5962
["jic-vqa"]="jic-vqa"
6063
["mecha-ja"]="mecha-ja"
64+
["cc-ocr"]="cc-ocr"
65+
["cvqa"]="substring-match"
6166
)
6267

6368
# Result directories
@@ -77,9 +82,7 @@ for RESULT_DIR in "${result_dir_list[@]}"; do
7782
--task_id "$task" \
7883
--metrics "$METRIC" \
7984
--judge_model "gpt-4o-2024-11-20" \
80-
--result_dir "$RESULT_DIR" \
81-
--inference_only
82-
85+
--result_dir "$RESULT_DIR"
8386
done
8487
done
8588
done

examples/heron_nvila.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -37,7 +37,9 @@ def __init__(self, model_id="turing-motors/Heron-NVILA-Lite-15B"):
3737
model_id, trust_remote_code=True, device_map="auto"
3838
)
3939

40-
def generate(self, image, text: str, gen_kwargs: GenerationConfig = GenerationConfig()):
40+
def generate(
41+
self, image, text: str, gen_kwargs: GenerationConfig = GenerationConfig()
42+
):
4143
gen_cfg = HFGenerationConfig(**gen_kwargs.__dict__)
4244
prompt = create_prompt(text, image)
4345
with torch.no_grad():

github_pages/public/dataset_url.json

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -28,5 +28,11 @@
2828
},
2929
"MECHA": {
3030
"url": "https://huggingface.co/datasets/llm-jp/MECHA-ja"
31+
},
32+
"CC-OCR": {
33+
"url": "https://huggingface.co/datasets/wulipc/CC-OCR"
34+
},
35+
"CVQA": {
36+
"url": "https://huggingface.co/datasets/afaji/cvqa"
3137
}
3238
}

0 commit comments

Comments
 (0)