Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
55 commits
Select commit Hold shift + click to select a range
dc041f7
Add generation type to ModelConfig
nabinchha Nov 25, 2025
0d6b830
pass tests
nabinchha Nov 25, 2025
254fd8a
added generate_text_embeddings
nabinchha Nov 25, 2025
1126ea1
tests
nabinchha Nov 25, 2025
744bc8f
remove sensitive=True old artifact no longer needed
nabinchha Nov 25, 2025
b913f8d
Slight refactor
nabinchha Nov 26, 2025
052db7a
slight refactor
nabinchha Nov 26, 2025
5504c8d
Added embedding generator
nabinchha Nov 26, 2025
4b6f877
chunk_separator -> chunk_pattern
nabinchha Nov 26, 2025
04fc0f3
update tests
nabinchha Nov 26, 2025
26d6da1
rename for consistency
nabinchha Nov 26, 2025
6facbd2
Restructure InferenceParameters -> CompletionInferenceParameters, Bas…
nabinchha Nov 26, 2025
2c1b267
Remove purpose from consolidated kwargs
nabinchha Nov 26, 2025
4b1492b
WithModelConfiguration.inference_parameters should should be typed wi…
nabinchha Dec 2, 2025
c445caf
Type as WithModelGeneration
nabinchha Dec 2, 2025
4b8aa2b
Add image generation modality
nabinchha Dec 2, 2025
2c5933f
update return type for generate_kwargs
nabinchha Dec 3, 2025
c6c29d4
make generation_type a field of ModelConfig as opposed to a prop reso…
nabinchha Dec 3, 2025
06a724b
remove regex based chunking from embedding generator
nabinchha Dec 3, 2025
bbb6a83
Merge branch 'main' into nmulepati/feat/support-embedding-generation
nabinchha Dec 8, 2025
b9455d4
Remove image generation for now
nabinchha Dec 8, 2025
e5c0b7a
more tests and updates
nabinchha Dec 9, 2025
6460c6b
column_type_is_llm_generated -> column_type_is_model_generated
nabinchha Dec 9, 2025
e294b40
change set to list: fix flaky tests
nabinchha Dec 9, 2025
4e697ec
CompletionInferenceParameters -> ChatCompletionInferenceParameters fo…
nabinchha Dec 9, 2025
d650398
Update docs
nabinchha Dec 9, 2025
4601e3f
fix deprecation warning originating from cli model settings
nabinchha Dec 9, 2025
65ba5bf
update display of inference parameters in cli list
nabinchha Dec 9, 2025
0917d6e
save prog on inference parameter
nabinchha Dec 10, 2025
1aa74dd
updates for the ocnfig builder
nabinchha Dec 10, 2025
d72b204
update cli readme
nabinchha Dec 10, 2025
4c53f1f
update cli for inference parmeters
nabinchha Dec 10, 2025
bd63f91
Merge branch 'main' into nmulepati/feat/support-embedding-generation
nabinchha Dec 10, 2025
3413799
update inference parameter names
nabinchha Dec 10, 2025
5425df5
flip order of vars
nabinchha Dec 10, 2025
7723764
WithCompletion -> WithChatCompletion
nabinchha Dec 10, 2025
9ae1bb8
specify InferenceParamsT
nabinchha Dec 10, 2025
3aa3326
Update columns.md with EmbeddingColumnConfig info
nabinchha Dec 10, 2025
c73b183
make generation_type a descriminator field in inference params. add c…
nabinchha Dec 10, 2025
6899805
DRY out some stuff in field.py
nabinchha Dec 10, 2025
299479a
Merge branch 'main' into nmulepati/feat/support-embedding-generation
nabinchha Dec 11, 2025
a1f239d
docs for custom model settings
nabinchha Dec 11, 2025
0d61587
Merge branch 'main' into nmulepati/feat/support-embedding-generation
nabinchha Dec 11, 2025
8e91e95
Merge branch 'main' into nmulepati/feat/support-embedding-generation
nabinchha Dec 11, 2025
51dcffa
Update nomenclature. prompt tokens -> input tokens, completion tokens…
nabinchha Dec 11, 2025
7253898
Add nvidia-embedding and openai-embedding to default model configs
nabinchha Dec 12, 2025
0f21576
Merge branch 'main' into nmulepati/feat/support-embedding-generation
nabinchha Dec 12, 2025
9acf600
Fix typo in docs
nabinchha Dec 13, 2025
954d4d0
Merge branch 'main' into nmulepati/feat/support-embedding-generation
nabinchha Dec 13, 2025
c7176b9
Make generate collab notebooks
nabinchha Dec 13, 2025
5b0104a
Address PR comments
nabinchha Dec 15, 2025
a616ed6
Merge branch 'nmulepati/feat/support-embedding-generation' into nmule…
nabinchha Dec 15, 2025
b47495c
Merge branch 'main' into nmulepati/feat/support-embedding-generation
nabinchha Dec 15, 2025
21f4383
Merge branch 'nmulepati/feat/support-embedding-generation' into nmule…
nabinchha Dec 15, 2025
e4ff60b
Merge branch 'main' into nmulepati/docs/123-docs-for-custom-model-set…
nabinchha Dec 15, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
62 changes: 31 additions & 31 deletions docs/colab_notebooks/1-the-basics.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -2,7 +2,7 @@
"cells": [
{
"cell_type": "markdown",
"id": "3599c474",
"id": "709c75cf",
"metadata": {},
"source": [
"# 🎨 Data Designer Tutorial: The Basics\n",
Expand All @@ -14,7 +14,7 @@
},
{
"cell_type": "markdown",
"id": "ee8bed13",
"id": "3cb2774e",
"metadata": {},
"source": [
"### ⚡ Colab Setup\n",
Expand All @@ -25,7 +25,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "f43069d1",
"id": "b886272b",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -36,7 +36,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "c136bf4f",
"id": "f5cf20f9",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -53,7 +53,7 @@
},
{
"cell_type": "markdown",
"id": "48739393",
"id": "e11a4288",
"metadata": {},
"source": [
"### 📦 Import the essentials\n",
Expand All @@ -64,7 +64,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "e459cd98",
"id": "e8faecea",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -85,7 +85,7 @@
},
{
"cell_type": "markdown",
"id": "b705d204",
"id": "314d17c1",
"metadata": {},
"source": [
"### ⚙️ Initialize the Data Designer interface\n",
Expand All @@ -98,7 +98,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "aee62c85",
"id": "be3b5c6f",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -107,7 +107,7 @@
},
{
"cell_type": "markdown",
"id": "ae65c557",
"id": "1c2852e1",
"metadata": {},
"source": [
"### 🎛️ Define model configurations\n",
Expand All @@ -124,7 +124,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "1079200d",
"id": "5ad52a10",
"metadata": {},
"outputs": [],
"source": [
Expand Down Expand Up @@ -156,7 +156,7 @@
},
{
"cell_type": "markdown",
"id": "9f15426a",
"id": "25cce9f7",
"metadata": {},
"source": [
"### 🏗️ Initialize the Data Designer Config Builder\n",
Expand All @@ -171,7 +171,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "79b8212c",
"id": "8ff7190c",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -180,7 +180,7 @@
},
{
"cell_type": "markdown",
"id": "cd1d9e09",
"id": "6bc3b23e",
"metadata": {},
"source": [
"## 🎲 Getting started with sampler columns\n",
Expand All @@ -197,7 +197,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "b3f469d6",
"id": "4cff01cb",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -206,7 +206,7 @@
},
{
"cell_type": "markdown",
"id": "e44adc6c",
"id": "f981ec58",
"metadata": {},
"source": [
"Let's start designing our product review dataset by adding product category and subcategory columns.\n"
Expand All @@ -215,7 +215,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "82b32804",
"id": "70ba24a6",
"metadata": {},
"outputs": [],
"source": [
Expand Down Expand Up @@ -296,7 +296,7 @@
},
{
"cell_type": "markdown",
"id": "bd65456c",
"id": "6f1a6c59",
"metadata": {},
"source": [
"Next, let's add samplers to generate data related to the customer and their review.\n"
Expand All @@ -305,7 +305,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "6d6d4eef",
"id": "d45b925f",
"metadata": {},
"outputs": [],
"source": [
Expand Down Expand Up @@ -342,7 +342,7 @@
},
{
"cell_type": "markdown",
"id": "eb7b415c",
"id": "bf49c2b1",
"metadata": {},
"source": [
"## 🦜 LLM-generated columns\n",
Expand All @@ -357,7 +357,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "ed811560",
"id": "669fe324",
"metadata": {},
"outputs": [],
"source": [
Expand Down Expand Up @@ -394,7 +394,7 @@
},
{
"cell_type": "markdown",
"id": "fdc0a2c8",
"id": "4d93ad9a",
"metadata": {},
"source": [
"### 🔁 Iteration is key – preview the dataset!\n",
Expand All @@ -411,7 +411,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "59987c81",
"id": "7b2466d1",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -421,7 +421,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "0823ca7f",
"id": "508a2866",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -432,7 +432,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "eca4f0bc",
"id": "6fbdaf64",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -442,7 +442,7 @@
},
{
"cell_type": "markdown",
"id": "edd57f85",
"id": "154e8e71",
"metadata": {},
"source": [
"### 📊 Analyze the generated data\n",
Expand All @@ -455,7 +455,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "5c681eee",
"id": "7e031c7b",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -465,7 +465,7 @@
},
{
"cell_type": "markdown",
"id": "14bf06f2",
"id": "a60a1fab",
"metadata": {},
"source": [
"### 🆙 Scale up!\n",
Expand All @@ -478,7 +478,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "b7ffead1",
"id": "e07c6718",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -488,7 +488,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "aa966388",
"id": "7a5406da",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -501,7 +501,7 @@
{
"cell_type": "code",
"execution_count": null,
"id": "98e1085c",
"id": "f0360b0e",
"metadata": {},
"outputs": [],
"source": [
Expand All @@ -513,7 +513,7 @@
},
{
"cell_type": "markdown",
"id": "e0b9c65a",
"id": "d365dda0",
"metadata": {},
"source": [
"## ⏭️ Next Steps\n",
Expand Down
Loading