Add NVIDIA Catalog Connector by chrisalexiuk-nvidia · Pull Request #1 · chrisalexiuk-nvidia/llama_index

chrisalexiuk-nvidia · 2024-04-02T20:29:02Z

Added NVIDIA Catalog Connector

…connector_llm

…hub.com:chrisalexiuk-nvidia/llama_index into add_feature_nvidia_api_playground_connector_llm

…connector_llm

…hub.com:chrisalexiuk-nvidia/llama_index into add_feature_nvidia_api_playground_connector_llm

…connector_llm

mattf · 2024-04-02T21:13:21Z

...ns/llms/llama-index-llms-nvidia-ai-playground/llama_index/llms/nvidia_ai_playground/utils.py

+    "playground_mixtral_8x7b": 32_000,
+    "playground_llama2_code_34b": 100_000,
+    "playground_steerlm_llama_70b": 3072,
+}


models to add -

mistralai/mistral-7b-instruct-v0.2

mistralai/mixtral-8x7b-instruct-v0.1

google/gemma-7b

meta/codellama-70b

meta/llama2-70b

cohere/aya-101

cohere/command-r

@chrisalexiuk-nvidia the cohere ones aren't actually part of api catalog, let's remove them

Sounds good!

mattf · 2024-04-02T21:13:49Z

...ns/llms/llama-index-llms-nvidia-ai-playground/llama_index/llms/nvidia_ai_playground/utils.py

+from openai.types.chat import ChatCompletionMessageParam
+from openai.types.chat.chat_completion_message import ChatCompletionMessage
+
+AI_PLAYGROUND_MODELS: Dict[str, int] = {


these will be API_CATALOG_MODELS

mattf · 2024-04-02T21:15:09Z

...llms/llama-index-llms-nvidia-ai-playground/llama_index/llms/nvidia_ai_playground/__init__.py

@@ -0,0 +1,3 @@
+from llama_index.llms.nvidia_ai_playground.base import NvidiaAIPlayground
+
+__all__ = ["NvidiaAIPlayground"]


let's tentatively go with NVIDIA instead of NvidiaAIPlayground

mattf · 2024-04-02T21:28:06Z

...ons/llms/llama-index-llms-nvidia-ai-playground/llama_index/llms/nvidia_ai_playground/base.py

+DEFAULT_PLAYGROUND_MAX_TOKENS = 512
+
+
+class NvidiaAIPlayground(LLM):


chat completion models also support params -

frequency_penalty: float -2..2

presence_penalty: float -2..2

seed

stop: str | list[str]

include them as properties or require passing as kwargs

in either case, make sure there's test coverage

Passing as kwargs tested as working, added example to "test" notebook.

mattf · 2024-04-02T21:43:43Z

...ons/llms/llama-index-llms-nvidia-ai-playground/llama_index/llms/nvidia_ai_playground/base.py

+
+        response = self._client.chat.completions.create(messages=message_dicts, stream=True, **all_kwargs)
+
+        def gen() -> ChatResponseGen:


why do you create gen() and return gen() instead of yielding from the loop?

This is just the pattern found in LlamaIndex - they use this ChatResponseGen later.

https://github.com/run-llama/llama_index/blob/main/llama-index-core/llama_index/core/base/llms/types.py#L99

(Sorry for double ping, was on my personal acct.)

…hub.com:chrisalexiuk-nvidia/llama_index into add_feature_nvidia_api_playground_connector_llm

mattf

how should this be extended to work with a downloaded NIM?

llama-index-integrations/llms/llama-index-llms-nvidia/llama_index/llms/nvidia/base.py

llama-index-integrations/llms/llama-index-llms-nvidia/llama_index/llms/nvidia/utils.py

chrisalexiuk-nvidia · 2024-04-19T17:53:40Z

@mattf Added tests for Pytest to test for basic functionality

chrisalexiuk-nvidia · 2024-04-19T17:55:28Z

how should this be extended to work with a downloaded NIM?

I'm still not sure the ".mode()" works well with LlamaIndex - but if that's the best way to do it, in your impression, then I'll try and whip it up today.

mattf · 2024-04-23T14:26:41Z

llama-index-integrations/llms/llama-index-llms-nvidia/tests/test_nvidia.py

+
+
+def test_validates_api_key_is_present() -> None:
+    with CachedNVIDIApiKeys(set_fake_key=True):


NVIDIA() should work with no api_key or NVIDIA_API_KEY env to support users doing NVIDIA().mode("nim", base_url=...) without a key. please add a test for that case.

mattf

need to add support and tests for mode switching w/ NVIDIA().mode("nim", base_url=...)

chrisalexiuk-nvidia and others added 10 commits March 14, 2024 16:04

Adding NVIDIA Adapter Playground

683e308

Merge branch 'run-llama:main' into add_feature_nvidia_api_playground_…

aba4181

…connector_llm

LlamaIndex NVIDIA API Playground Adapter

a99917d

Merge branch 'add_feature_nvidia_api_playground_connector_llm' of git…

6f78863

…hub.com:chrisalexiuk-nvidia/llama_index into add_feature_nvidia_api_playground_connector_llm

Adding more test cases

7e72f89

Merge branch 'run-llama:main' into add_feature_nvidia_api_playground_…

19c2143

…connector_llm

Merge branch 'run-llama:main' into add_feature_nvidia_api_playground_…

dc68690

…connector_llm

Improving test notebook documentation

f96d2f5

Merge branch 'add_feature_nvidia_api_playground_connector_llm' of git…

afa245f

…hub.com:chrisalexiuk-nvidia/llama_index into add_feature_nvidia_api_playground_connector_llm

Merge branch 'run-llama:main' into add_feature_nvidia_api_playground_…

a43f5a5

…connector_llm

mattf reviewed Apr 2, 2024

View reviewed changes

chrisalexiuk-nvidia added 2 commits April 3, 2024 17:01

Refactored playground to API Catalog

f898a8c

Merge branch 'add_feature_nvidia_api_playground_connector_llm' of git…

9b3285a

…hub.com:chrisalexiuk-nvidia/llama_index into add_feature_nvidia_api_playground_connector_llm

mattf reviewed Apr 4, 2024

View reviewed changes

chrisalexiuk-nvidia added 5 commits April 4, 2024 14:44

Removed Cohere Models

4d12453

Adding max context length

773871a

Fixing incorrect references and API key standardization

edec0fd

Adding user-agent for tool reporting

83c983f

Added Pytest tests

0b93fc0

Added new model references

d68c565

mattf reviewed Apr 23, 2024

View reviewed changes

		@@ -0,0 +1,3 @@
		from llama_index.llms.nvidia_ai_playground.base import NvidiaAIPlayground

		__all__ = ["NvidiaAIPlayground"]

		DEFAULT_PLAYGROUND_MAX_TOKENS = 512


		class NvidiaAIPlayground(LLM):


		response = self._client.chat.completions.create(messages=message_dicts, stream=True, **all_kwargs)

		def gen() -> ChatResponseGen:



		def test_validates_api_key_is_present() -> None:
		with CachedNVIDIApiKeys(set_fake_key=True):

Conversation

chrisalexiuk-nvidia commented Apr 2, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mattf left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

chrisalexiuk-nvidia commented Apr 19, 2024

Uh oh!

chrisalexiuk-nvidia commented Apr 19, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mattf left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants