OpenAI API: Changes to enable multi-modal for 3.2 11B #1211

byjlw · 2024-09-25T22:23:30Z

Start the server by either specifying the model

python3 torchchat.py server llama3.2-11B

or by specifying paths for the checkpoint

python3 torchchat.py server --checkpoint-path ../llama_3.2_11b/consolidated.pth --tokenizer-path ../llama_3.2_11b/tokenizer.model --params-path torchchat/model_params/Llama-3.2-11B-Vision.json

python3 torchchat.py server
Use this curl command to test

curl http://127.0.0.1:5000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "llama3.2",
    "messages": [
      {
        "role": "user",
        "content": [
          {
            "type": "text",
            "text": "What'\''s in this image?"
          },
          {
            "type": "image_url",
            "image_url": "data:image/jpeg;base64,iVBORw0KGgoAAAANSUhEUgAAAG0AAABmCAYAAADBPx+VAAAACXBIWXMAAAsTAAALEwEAmpwYAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAA3VSURBVHgB7Z27r0zdG8fX743i1bi1ikMoFMQloXRpKFFIqI7LH4BEQ+NWIkjQuSWCRIEoULk0gsK1kCBI0IhrQVT7tz/7zZo888yz1r7MnDl7z5xvsjkzs2fP3uu71nNfa7lkAsm7d++Sffv2JbNmzUqcc8m0adOSzZs3Z+/XES4ZckAWJEGWPiCxjsQNLWmQsWjRIpMseaxcuTKpG/7HP27I8P79e7dq1ars/yL4/v27S0ejqwv+cUOGEGGpKHR37tzJCEpHV9tnT58+dXXCJDdECBE2Ojrqjh071hpNECjx4cMHVycM1Uhbv359B2F79+51586daxN/+pyRkRFXKyRDAqxEp4yMlDDzXG1NPnnyJKkThoK0VFd1ELZu3TrzXKxKfW7dMBQ6bcuWLW2v0VlHjx41z717927ba22U9APcw7Nnz1oGEPeL3m3p2mTAYYnFmMOMXybPPXv2bNIPpFZr1NHn4HMw0KRBjg9NuRw95s8PEcz/6DZELQd/09C9QGq5RsmSRybqkwHGjh07OsJSsYYm3ijPpyHzoiacg35MLdDSIS/O1yM778jOTwYUkKNHWUzUWaOsylE00MyI0fcnOwIdjvtNdW/HZwNLGg+sR1kMepSNJXmIwxBZiG8tDTpEZzKg0GItNsosY8USkxDhD0Rinuiko2gfL/RbiD2LZAjU9zKQJj8RDR0vJBR1/Phx9+PHj9Z7REF4nTZkxzX4LCXHrV271qXkBAPGfP/atWvu/PnzHe4C97F48eIsRLZ9+3a3f/9+87dwP1JxaF7/3r17ba+5l4EcaVo0lj3SBq5kGTJSQmLWMjgYNei2GPT1MuMqGTDEFHzeQSP2wi/jGnkmPJ/nhccs44jvDAxpVcxnq0F6eT8h4ni/iIWpR5lPyA6ETkNXoSukvpJAD3AsXLiwpZs49+fPn5ke4j10TqYvegSfn0OnafC+Tv9ooA/JPkgQysqQNBzagXY55nO/oa1F7qvIPWkRL12WRpMWUvpVDYmxAPehxWSe8ZEXL20sadYIozfmNch4QJPAfeJgW3rNsnzphBKNJM2KKODo1rVOMRYik5ETy3ix4qWNI81qAAirizgMIc+yhTytx0JWZuNI03qsrgWlGtwjoS9XwgUhWGyhUaRZZQNNIEwCiXD16tXcAHUs79co0vSD8rrJCIW98pzvxpAWyyo3HYwqS0+H0BjStClcZJT5coMm6D2LOF8TolGJtK9fvyZpyiC5ePFi9nc/oJU4eiEP0jVoAnHa9wyJycITMP78+eMeP37sXrx44d6+fdt6f82aNdkx1pg9e3Zb5W+RSRE+n+VjksQWifvVaTKFhn5O8my63K8Qabdv33b379/PiAP//vuvW7BggZszZ072/+TJk91YgkafPn166zXB1rQHFvouAWHq9z3SEevSUerqCn2/dDCeta2jxYbr69evk4MHDyY7d+7MjhMnTiTPnz9Pfv/+nfQT2ggpO2dMF8cghuoM7Ygj5iWCqRlGFml0QC/ftGmTmzt3rmsaKDsgBSPh0/8yPeLLBihLkOKJc0jp8H8vUzcxIA1k6QJ/c78tWEyj5P3o4u9+jywNPdJi5rAH9x0KHcl4Hg570eQp3+vHXGyrmEeigzQsQsjavXt38ujRo44LQuDDhw+TW7duRS1HGgMxhNXHgflaNTOsHyKvHK5Ijo2jbFjJBQK9YwFd6RVMzfgRBmEfP37suBBm/p49e1qjEP2mwTViNRo0VJWH1deMXcNK08uUjVUu7s/zRaL+oLNxz1bpANco4npUgX4G2eFbpDFyQoQxojBCpEGSytmOH8qrH5Q9vuzD6ofQylkCUmh8DBAr+q8JCyVNtWQIidKQE9wNtLSQnS4jDSsxNHogzFuQBw4cyM61UKVsjfr3ooBkPSqqQHesUPWVtzi9/vQi1T+rJj7WiTz4Pt/l3LxUkr5P2VYZaZ4URpsE+st/dujQoaBBYokbrz/8TJNQYLSonrPS9kUaSkPeZyj1AWSj+d+VBoy1pIWVNed8P0Ll/ee5HdGRhrHhR5GGN0r4LGZBaj8oFDJitBTJzIZgFcmU0Y8ytWMZMzJOaXUSrUs5RxKnrxmbb5YXO9VGUhtpXldhEUogFr3IzIsvlpmdosVcGVGXFWp2oU9kLFL3dEkSz6NHEY1sjSRdIuDFWEhd8KxFqsRi1uM/nz9/zpxnwlESONdg6dKlbsaMGS4EHFHtjFIDHwKOo46l4TxSuxgDzi+rE2jg+BaFruOX4HXa0Nnf1lwAPufZeF8/r6zD97WK2qFnGjBxTw5qNGPxT+5T/r7/7RawFC3j4vTp09koCxkeHjqbHJqArmH5UrFKKksnxrK7FuRIs8STfBZv+luugXZ2pR/pP9Ois4z+TiMzUUkUjD0iEi1fzX8GmXyuxUBRcaUfykV0YZnlJGKQpOiGB76x5GeWkWWJc3mOrK6S7xdND+W5N6XyaRgtWJFe13GkaZnKOsYqGdOVVVbGupsyA/l7emTLHi7vwTdirNEt0qxnzAvBFcnQF16xh/TMpUuXHDowhlA9vQVraQhkudRdzOnK+04ZSP3DUhVSP61YsaLtd/ks7ZgtPcXqPqEafHkdqa84X6aCeL7YWlv6edGFHb+ZFICPlljHhg0bKuk0CSvVznWsotRu433alNdFrqG45ejoaPCaUkWERpLXjzFL2Rpllp7PJU2a/v7Ab8N05/9t27Z16KUqoFGsxnI9EosS2niSYg9SpU6B4JgTrvVW1flt1sT+0ADIJU2maXzcUTraGCRaL1Wp9rUMk16PMom8QhruxzvZIegJjFU7LLCePfS8uaQdPny4jTTL0dbee5mYokQsXTIWNY46kuMbnt8Kmec+LGWtOVIl9cT1rCB0V8WqkjAsRwta93TbwNYoGKsUSChN44lgBNCoHLHzquYKrU6qZ8lolCIN0Rh6cP0Q3U6I6IXILYOQI513hJaSKAorFpuHXJNfVlpRtmYBk1Su1obZr5dnKAO+L10Hrj3WZW+E3qh6IszE37F6EB+68mGpvKm4eb9bFrlzrok7fvr0Kfv727dvWRmdVTJHw0qiiCUSZ6wCK+7XL/AcsgNyL74DQQ730sv78Su7+t/A36MdY0sW5o40ahslXr58aZ5HtZB8GH64m9EmMZ7FpYw4T6QnrZfgenrhFxaSiSGXtPnz57e9TkNZLvTjeqhr734CNtrK41L40sUQckmj1lGKQ0rC37x544r8eNXRpnVE3ZZY7zXo8NomiO0ZUCj2uHz58rbXoZ6gc0uA+F6ZeKS/jhRDUq8MKrTho9fEkihMmhxtBI1DxKFY9XLpVcSkfoi8JGnToZO5sU5aiDQIW716ddt7ZLYtMQlhECdBGXZZMWldY5BHm5xgAroWj4C0hbYkSc/jBmggIrXJWlZM6pSETsEPGqZOndr2uuuR5rF169a2HoHPdurUKZM4CO1WTPqaDaAd+GFGKdIQkxAn9RuEWcTRyN2KSUgiSgF5aWzPTeA/lN5rZubMmR2bE4SIC4nJoltgAV/dVefZm72AtctUCJU2CMJ327hxY9t7EHbkyJFseq+EJSY16RPo3Dkq1kkr7+q0bNmyDuLQcZBEPYmHVdOBiJyIlrRDq41YPWfXOxUysi5fvtyaj+2BpcnsUV/oSoEMOk2CQGlr4ckhBwaetBhjCwH0ZHtJROPJkyc7UjcYLDjmrH7ADTEBXFfOYmB0k9oYBOjJ8b4aOYSe7QkKcYhFlq3QYLQhSidNmtS2RATwy8YOM3EQJsUjKiaWZ+vZToUQgzhkHXudb/PW5YMHD9yZM2faPsMwoc7RciYJXbGuBqJ1UIGKKLv915jsvgtJxCZDubdXr165mzdvtr1Hz5LONA8jrUwKPqsmVesKa49S3Q4WxmRPUEYdTjgiUcfUwLx589ySJUva3oMkP6IYddq6HMS4o55xBJBUeRjzfa4Zdeg56QZ43LhxoyPo7Lf1kNt7oO8wWAbNwaYjIv5lhyS7kRf96dvm5Jah8vfvX3flyhX35cuX6HfzFHOToS1H4BenCaHvO8pr8iDuwoUL7tevX+b5ZdbBair0xkFIlFDlW4ZknEClsp/TzXyAKVOmmHWFVSbDNw1l1+4f90U6IY/q4V27dpnE9bJ+v87QEydjqx/UamVVPRG+mwkNTYN+9tjkwzEx+atCm/X9WvWtDtAb68Wy9LXa1UmvCDDIpPkyOQ5ZwSzJ4jMrvFcr0rSjOUh+GcT4LSg5ugkW1Io0/SCDQBojh0hPlaJdah+tkVYrnTZowP8iq1F1TgMBBauufyB33x1v+NWFYmT5KmppgHC+NkAgbmRkpD3yn9QIseXymoTQFGQmIOKTxiZIWpvAatenVqRVXf2nTrAWMsPnKrMZHz6bJq5jvce6QK8J1cQNgKxlJapMPdZSR64/UivS9NztpkVEdKcrs5alhhWP9NeqlfWopzhZScI6QxseegZRGeg5a8C3Re1Mfl1ScP36ddcUaMuv24iOJtz7sbUjTS4qBvKmstYJoUauiuD3k5qhyr7QdUHMeCgLa1Ear9NquemdXgmum4fvJ6w1lqsuDhNrg1qSpleJK7K3TF0Q2jSd94uSZ60kK1e3qyVpQK6PVWXp2/FC3mp6jBhKKOiY2h3gtUV64TWM6wDETRPLDfSakXmH3w8g9Jlug8ZtTt4kVF0kLUYYmCCtD/DrQ5YhMGbA9L3ucdjh0y8kOHW5gU/VEEmJTcL4Pz/f7mgoAbYkAAAAAElFTkSuQmCC"
          }
        ]
      }
    ],
    "max_tokens": 300
  }'

This is the image

pytorch-bot · 2024-09-25T22:23:34Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1211

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 7 New Failures, 6 Unrelated Failures

As of commit 8c14c55 with merge base c454026 ():

NEW FAILURES - The following jobs have failed:

pull / test-gpu-aoti-bfloat16 (cuda, stories15M) / linux-job (gh)
RuntimeError: Command docker exec -t 70df98bbd91705a30d8399f20bb8feae692b5304feaf236c3df469dc84343a7d /exec failed with exit code 1
pull / test-gpu-aoti-float16 (cuda, stories15M) / linux-job (gh)
RuntimeError: Command docker exec -t 7042ec25d2ee1af58308c2b01cf00997df53cf7955a444504f515fdf9aebaf5f /exec failed with exit code 1
pull / test-gpu-aoti-float32 (cuda, stories15M) / linux-job (gh)
RuntimeError: Command docker exec -t c87d4e5df87558f6d974fdea43031f3956e2b141d6e6ff991bda89fd7b68ec11 /exec failed with exit code 1
pull / test-gpu-compile (cuda, stories15M) / linux-job (gh)
RuntimeError: Command docker exec -t 6d5a4b05d6362c63c823bfc24dacebf1ee5f08afc996e53c37e8d75afe583a23 /exec failed with exit code 1
pull / test-gpu-eval-sanity-check (cuda, stories15M) / linux-job (gh)
RuntimeError: Command docker exec -t 7b1f9e98a8a742060b323dd88d04e2184e5a1b4ee72080244dc88f70cee3243b /exec failed with exit code 1
Run parallel prefill / test-cuda / linux-job (gh)
RuntimeError: Command docker exec -t 491a1c7f69ead5ed38e2f864ce634eb54ddb72624b752400fb228c2a66fef609 /exec failed with exit code 1
Run the aoti runner with CUDA using stories / test-runner-aot-cuda / linux-job (gh)
RuntimeError: Command docker exec -t 712b50700be8914e99d51446571da208f208397c4e7a0d624a497deced10cb4a /exec failed with exit code 1

BROKEN TRUNK - The following jobs failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

pull / test-cpu-eval-sanity-check (aarch64, stories15M) (gh) (trunk failure)
AttributeError: module 'evaluate' has no attribute 'utils'
pull / test-cpu-eval-sanity-check (x86_64, stories15M) (gh) (trunk failure)
AttributeError: module 'evaluate' has no attribute 'utils'
pull / test-cpu-eval-sanity-check-float16 (aarch64, stories15M) (gh) (trunk failure)
AttributeError: module 'evaluate' has no attribute 'utils'
pull / test-cpu-eval-sanity-check-float16 (x86_64, stories15M) (gh) (trunk failure)
AttributeError: module 'evaluate' has no attribute 'utils'
pull / test-cpu-eval-sanity-check-float32 (aarch64, stories15M) (gh) (trunk failure)
AttributeError: module 'evaluate' has no attribute 'utils'
pull / test-cpu-eval-sanity-check-float32 (x86_64, stories15M) (gh) (trunk failure)
AttributeError: module 'evaluate' has no attribute 'utils'

This comment was automatically generated by Dr. CI and updates every 15 minutes.

it was defaulting to too small a number. This will fix some things for now

Gasoonjia · 2024-09-26T03:13:10Z

torchchat/usages/openai_api.py

            )
        except:
-            # can not find max_seq_length in model config, use default value
-            self.max_seq_length = 128


haha now i can see where 128 comes from

It happens to be the same as the head_dim, very tricky to trace it LOL

Thanks to @iseeyuan for spotting it.

iseeyuan

LGTM. Thanks!

Gasoonjia · 2024-09-26T03:24:01Z

torchchat/usages/openai_api.py

                + self.speculative_builder_args.speculate_k
                + 1
                if self.draft_model is not None
                else self.model.text_transformer_args.max_seq_length


One of the things on our list is having a unification configuration system for both tune-backend model and chat-backend models to get rid of the try .. except here.

Jack-Khuu

Thanks for hopping in and fixing this

We'll have a separate PR for updating the MM README

Jack-Khuu · 2024-09-26T19:52:39Z

This is irrelevant to the failures, I'm forcing it through

initial changes

eee8abc

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 25, 2024

Jack-Khuu changed the title ~~changes to enable multi-modal for 3.2 11B~~ OpenAI API: Changes to enable multi-modal for 3.2 11B Sep 25, 2024

there is an issue in getting max sequence length

8c14c55

it was defaulting to too small a number. This will fix some things for now

byjlw marked this pull request as ready for review September 26, 2024 03:10

byjlw requested review from Gasoonjia, Jack-Khuu, iseeyuan and vmpuri and removed request for Jack-Khuu and vmpuri September 26, 2024 03:11

Gasoonjia reviewed Sep 26, 2024

View reviewed changes

iseeyuan approved these changes Sep 26, 2024

View reviewed changes

Gasoonjia reviewed Sep 26, 2024

View reviewed changes

Gasoonjia approved these changes Sep 26, 2024

View reviewed changes

Jack-Khuu approved these changes Sep 26, 2024

View reviewed changes

Jack-Khuu merged commit ae3555b into main Sep 26, 2024
38 of 51 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

OpenAI API: Changes to enable multi-modal for 3.2 11B #1211

OpenAI API: Changes to enable multi-modal for 3.2 11B #1211

Uh oh!

byjlw commented Sep 25, 2024 •

edited

Loading

Uh oh!

pytorch-bot bot commented Sep 25, 2024 •

edited

Loading

Uh oh!

Gasoonjia Sep 26, 2024

Uh oh!

iseeyuan Sep 26, 2024

Uh oh!

byjlw Sep 26, 2024

Uh oh!

byjlw Sep 26, 2024

Uh oh!

iseeyuan left a comment

Uh oh!

Gasoonjia Sep 26, 2024

Uh oh!

Jack-Khuu left a comment

Uh oh!

Jack-Khuu commented Sep 26, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

OpenAI API: Changes to enable multi-modal for 3.2 11B #1211

OpenAI API: Changes to enable multi-modal for 3.2 11B #1211

Uh oh!

Conversation

byjlw commented Sep 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Sep 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1211

❌ 7 New Failures, 6 Unrelated Failures

Uh oh!

Gasoonjia Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

iseeyuan Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

byjlw Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

byjlw Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

iseeyuan left a comment

Choose a reason for hiding this comment

Uh oh!

Gasoonjia Sep 26, 2024

Choose a reason for hiding this comment

Uh oh!

Jack-Khuu left a comment

Choose a reason for hiding this comment

Uh oh!

Jack-Khuu commented Sep 26, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

byjlw commented Sep 25, 2024 •

edited

Loading

pytorch-bot bot commented Sep 25, 2024 •

edited

Loading