[BUG FIX 5616904]: Make VILA codebase importable and import configuration before load model config (#511)

yueshen2016 · mxinO · commit 99fcd067a37a · 2025-11-10T22:15:44.000-08:00
## What does this PR do? **Type of change:** ?  But fix **Overview:** ? Make VILA codebase importable and import configuration before load model config Fix https://nvbugspro.nvidia.com/bug/5616904 ## Usage  ```python # Add a code snippet demonstrating how to use this ``` ## Testing  ``` CUDA_VISIBLE_DEVICES=0 bash -e scripts/huggingface_example.sh --model /models/vila1.5-3b --quant int4_awq --tp 1 --pp 1 --trust_remote_code --kv_cache_free_gpu_memory_fraction 0.5 ``` ## Before your PR is "*Ready for review*"  - **Make sure you read and follow [Contributor guidelines](https://github.com/NVIDIA/TensorRT-Model-Optimizer/blob/main/CONTRIBUTING.md)** and your commits are signed. - **Is this change backward compatible?**: Yes/No  - **Did you write any new necessary tests?**: Yes/No - **Did you add or update any necessary documentation?**: Yes/No - **Did you update [Changelog](https://github.com/NVIDIA/TensorRT-Model-Optimizer/blob/main/CHANGELOG.rst)?**: Yes/No  ## Additional Information  Signed-off-by: Yue <yueshen@nvidia.com> Signed-off-by: mxin <mxin@nvidia.com>
diff --git a/examples/llm_ptq/example_utils.py b/examples/llm_ptq/example_utils.py
@@ -270,6 +270,13 @@ def get_model(
     if device == "cpu":
         device_map = "cpu"
 
+    # Add VILA to sys.path before loading config if needed
+    if "vila" in ckpt_path.lower():
+        vila_path = os.path.join(ckpt_path, "..", "VILA")
+        if vila_path not in sys.path:
+            sys.path.append(vila_path)
+        from llava.model import LlavaLlamaConfig, LlavaLlamaModel  # noqa: F401
+
     # Prepare config kwargs for loading
     config_kwargs = {"trust_remote_code": trust_remote_code} if trust_remote_code else {}
 
@@ -295,8 +302,6 @@ def get_model(
         model_kwargs.setdefault("torch_dtype", "auto")
 
     if "vila" in ckpt_path.lower():
-        sys.path.append(os.path.join(ckpt_path, "..", "VILA"))
-        from llava.model import LlavaLlamaConfig, LlavaLlamaModel  # noqa: F401
         from transformers import AutoModel
 
         hf_vila = AutoModel.from_pretrained(