[llava 2/n] Support Llava Model Construction #1155

Gasoonjia · 2024-09-17T09:04:25Z

This PR supports Llava Model Construction.
E2E model integration will be in the following PRs.

…lity

pytorch-bot · 2024-09-17T09:04:28Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1155

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

High MacOS queue

✅ You can merge normally! (4 Unrelated Failures)

As of commit 672915a with merge base f730056 ():

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

pull / test-mps / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
pull / test-mps-dtype / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
Run the README instructions - with stories - on MPS/MacOS / test-quantization-mps-macos / macos-job (gh) (detected as infra flaky with no log or failing log classifier)
Run the README instructions - with stories - on MPS/MacOS / test-readme-mps-macos / macos-job (gh) (detected as infra flaky with no log or failing log classifier)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

Jack-Khuu · 2024-09-17T21:27:50Z

torchchat/model.py

+from torchtune.models.flamingo import flamingo_decoder, flamingo_vision_encoder
+from torchtune.models.llama3_1._component_builders import llama3_1 as llama3_1_builder
+from torchtune.modules.model_fusion import DeepFusionModel
+from torchtune.models.clip import clip_vision_encoder


Rebase is fun and makes clones

Jack-Khuu · 2024-09-17T21:57:46Z

torchchat/model.py

 from enum import Enum
 from pathlib import Path
+from PIL import Image
+import requests


make sure that this is downloaded in install requirements (it probably already is)

they can be removed rn; they should be sth in the 3/n pr haha

Jack-Khuu · 2024-09-17T22:00:32Z

torchchat/model.py



+@dataclass
+class ProjectorArgs:


Let's add docstring for each of these dataclasses since they not the usuall Llama classes

I think I can remove it rn. We can further take it back when we design arg class for different modules.

Jack-Khuu · 2024-09-17T22:02:27Z

torchchat/model.py

+            encoder_output = self.encoder(
+                encoder_input,
+            )


Suggested change

encoder_output = self.encoder(

encoder_input,

)

encoder_output = self.encoder(encoder_input)

Jack-Khuu · 2024-09-17T22:06:24Z

torchchat/model.py

+    def setup_caches(self, batch_size, max_seq_len):
+        self.decoder.setup_caches(batch_size, max_seq_len)
+
+    def _encoder_feature_select(self, encoder_output):


REturn type

Jack-Khuu · 2024-09-17T22:06:36Z

torchchat/model.py

+        *,
+        encoder_output: Optional[Tensor],
+        post_tokens: Optional[Tensor],
+    ):


return type

Jack-Khuu · 2024-09-17T22:09:10Z

torchchat/model.py

+            modules={
+                'encoder': clip_vision_encoder,
+                'decoder': Transformer
+            },
+            fusion_class=ConcateFusion,


It's really cool to see them working together!

Jack-Khuu · 2024-09-17T22:14:58Z

torchchat/model.py

        elif model_type == ModelType.Llama3_1:
            return cls._llama3_1()
+        elif model_type == ModelType.Llava:
+            return cls._llava()


match model_type: case ModelType.TextOnly: return cls._text_only() case ModelType.Flamingo: return cls.flamingo() ...

Oh YEAH we are in 3.10, it is a good timing to switch to match case statement!

Jack-Khuu · 2024-09-17T22:16:42Z

torchchat/model.py

        return recipe.fusion_class(**modules)
-
+
+    def _replace_know_params(self, params):


Suggested change

def _replace_know_params(self, params):

def _replace_known_params(self, params):

stupid grammar issue

swolchok · 2024-10-12T05:20:11Z

torchchat/model.py

 from enum import Enum
 from pathlib import Path

+import torchvision


this torchvision import seems unused

Gasoonjia added 25 commits September 15, 2024 11:24

llava init

f4bf00b

2/n llava init

2cabbe7

3/n llava init

353fafe

reformat llava

728fc46

3/n llava

215331d

llava config update

23d6504

4/n llava init

22fd2a5

unify model construction ppl

fff8647

update transformer config

4b666a7

update model config for gguf

cc8b4d6

hack PTEModel to have same config hirearchy as Model

7ec018a

unify model construction ppl

94e56f1

Merge branch 'main' into unify-constuct-model

2e3d1dc

merge with unified model contruction pipeline

cbadc92

5/n torchchat init

43dfdc7

hack PTEModel to support current ppl

63d76a1

fix a typo

01bb624

unify model construction ppl

319ac86

rebase and solve comments

141fea0

bring TransformerArgs back to Transformer

8cd0936

rename get_text_transformer_args as text_transformer_args for readibi…

304fece

…lity

make text_transformer_args a real attribute

1eff939

get rid of model.model

a356897

merge with unified model contruction pipeline

a190b0f

llava model constuction support

cbda879

facebook-github-bot added the CLA Signed This label is managed by the Meta Open Source bot. label Sep 17, 2024

Gasoonjia changed the title ~~Support Llava Model Construction~~ [llava 2/n] Support Llava Model Construction Sep 17, 2024

Gasoonjia added 2 commits September 17, 2024 11:22

1/2 solve cache issue

6fbb460

solve comments

f224da7

Gasoonjia added 4 commits September 17, 2024 11:32

Merge branch 'unify-constuct-model' into llava-support

83f8501

prepare for rebase

128566c

merge with main

f3cbd53

bring license back

7aab3b4

Jack-Khuu approved these changes Sep 17, 2024

View reviewed changes

Gasoonjia added 2 commits September 17, 2024 16:32

solve comments

7ffec73

remove extra arg.

672915a

Gasoonjia merged commit 3b162e2 into main Sep 18, 2024
47 of 51 checks passed

swolchok reviewed Oct 12, 2024

View reviewed changes

torchchat/model.py

from enum import Enum

from pathlib import Path

import torchvision

Copy link

Contributor

swolchok Oct 12, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this torchvision import seems unused

		return recipe.fusion_class(**modules)


		def _replace_know_params(self, params):

	def _replace_know_params(self, params):
	def _replace_known_params(self, params):

[llava 2/n] Support Llava Model Construction #1155

[llava 2/n] Support Llava Model Construction #1155

Uh oh!

Conversation

Gasoonjia commented Sep 17, 2024

Uh oh!

pytorch-bot bot commented Sep 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/torchchat/1155

❗ 1 Active SEVs

✅ You can merge normally! (4 Unrelated Failures)

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Gasoonjia Sep 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

pytorch-bot bot commented Sep 17, 2024 •

edited

Loading

Gasoonjia Sep 17, 2024 •

edited

Loading