Skip to content

Commit 6e6efc0

Browse files
authored
docs: some updates for nano3 (#149)
* some fixes * generate colab notebooks
1 parent 8540529 commit 6e6efc0

File tree

7 files changed

+157
-173
lines changed

7 files changed

+157
-173
lines changed

docs/colab_notebooks/1-the-basics.ipynb

Lines changed: 37 additions & 40 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
"cells": [
33
{
44
"cell_type": "markdown",
5-
"id": "9f804f90",
5+
"id": "56daa304",
66
"metadata": {},
77
"source": [
88
"# 🎨 Data Designer Tutorial: The Basics\n",
@@ -14,7 +14,7 @@
1414
},
1515
{
1616
"cell_type": "markdown",
17-
"id": "9cb786eb",
17+
"id": "8734a74a",
1818
"metadata": {},
1919
"source": [
2020
"### ⚡ Colab Setup\n",
@@ -25,7 +25,7 @@
2525
{
2626
"cell_type": "code",
2727
"execution_count": null,
28-
"id": "7f45ea56",
28+
"id": "45510d11",
2929
"metadata": {},
3030
"outputs": [],
3131
"source": [
@@ -36,7 +36,7 @@
3636
{
3737
"cell_type": "code",
3838
"execution_count": null,
39-
"id": "ea86e81e",
39+
"id": "4bad4940",
4040
"metadata": {},
4141
"outputs": [],
4242
"source": [
@@ -53,7 +53,7 @@
5353
},
5454
{
5555
"cell_type": "markdown",
56-
"id": "16611c7b",
56+
"id": "0543d90e",
5757
"metadata": {},
5858
"source": [
5959
"### 📦 Import the essentials\n",
@@ -64,7 +64,7 @@
6464
{
6565
"cell_type": "code",
6666
"execution_count": null,
67-
"id": "875342bb",
67+
"id": "90185344",
6868
"metadata": {},
6969
"outputs": [],
7070
"source": [
@@ -85,7 +85,7 @@
8585
},
8686
{
8787
"cell_type": "markdown",
88-
"id": "b58ac676",
88+
"id": "e6fcf82b",
8989
"metadata": {},
9090
"source": [
9191
"### ⚙️ Initialize the Data Designer interface\n",
@@ -98,7 +98,7 @@
9898
{
9999
"cell_type": "code",
100100
"execution_count": null,
101-
"id": "3ce805ad",
101+
"id": "8760c1ef",
102102
"metadata": {},
103103
"outputs": [],
104104
"source": [
@@ -107,7 +107,7 @@
107107
},
108108
{
109109
"cell_type": "markdown",
110-
"id": "50e961ed",
110+
"id": "da9d9f06",
111111
"metadata": {},
112112
"source": [
113113
"### 🎛️ Define model configurations\n",
@@ -124,7 +124,7 @@
124124
{
125125
"cell_type": "code",
126126
"execution_count": null,
127-
"id": "1b07a6a5",
127+
"id": "03760d56",
128128
"metadata": {},
129129
"outputs": [],
130130
"source": [
@@ -135,28 +135,26 @@
135135
"MODEL_ID = \"nvidia/nemotron-3-nano-30b-a3b\"\n",
136136
"\n",
137137
"# We choose this alias to be descriptive for our use case.\n",
138-
"MODEL_ALIAS = \"nemotron-nano-v2\"\n",
139-
"\n",
140-
"# This sets reasoning to False for the nemotron-nano-v2 model.\n",
141-
"SYSTEM_PROMPT = \"/no_think\"\n",
138+
"MODEL_ALIAS = \"nemotron-nano-v3\"\n",
142139
"\n",
143140
"model_configs = [\n",
144141
" ModelConfig(\n",
145142
" alias=MODEL_ALIAS,\n",
146143
" model=MODEL_ID,\n",
147144
" provider=MODEL_PROVIDER,\n",
148145
" inference_parameters=ChatCompletionInferenceParams(\n",
149-
" temperature=0.5,\n",
146+
" temperature=1.0,\n",
150147
" top_p=1.0,\n",
151-
" max_tokens=1024,\n",
148+
" max_tokens=2048,\n",
149+
" extra_body={\"chat_template_kwargs\": {\"enable_thinking\": False}},\n",
152150
" ),\n",
153151
" )\n",
154152
"]"
155153
]
156154
},
157155
{
158156
"cell_type": "markdown",
159-
"id": "6d873251",
157+
"id": "a968637c",
160158
"metadata": {},
161159
"source": [
162160
"### 🏗️ Initialize the Data Designer Config Builder\n",
@@ -171,7 +169,7 @@
171169
{
172170
"cell_type": "code",
173171
"execution_count": null,
174-
"id": "d45fac13",
172+
"id": "e5768870",
175173
"metadata": {},
176174
"outputs": [],
177175
"source": [
@@ -180,7 +178,7 @@
180178
},
181179
{
182180
"cell_type": "markdown",
183-
"id": "c35b0274",
181+
"id": "d12c1559",
184182
"metadata": {},
185183
"source": [
186184
"## 🎲 Getting started with sampler columns\n",
@@ -197,7 +195,7 @@
197195
{
198196
"cell_type": "code",
199197
"execution_count": null,
200-
"id": "14cb9967",
198+
"id": "3c47fbe6",
201199
"metadata": {},
202200
"outputs": [],
203201
"source": [
@@ -206,7 +204,7 @@
206204
},
207205
{
208206
"cell_type": "markdown",
209-
"id": "40945aea",
207+
"id": "b47862c5",
210208
"metadata": {},
211209
"source": [
212210
"Let's start designing our product review dataset by adding product category and subcategory columns.\n"
@@ -215,7 +213,7 @@
215213
{
216214
"cell_type": "code",
217215
"execution_count": null,
218-
"id": "a7d87e00",
216+
"id": "6ff2257f",
219217
"metadata": {},
220218
"outputs": [],
221219
"source": [
@@ -296,7 +294,7 @@
296294
},
297295
{
298296
"cell_type": "markdown",
299-
"id": "48699878",
297+
"id": "a26f889e",
300298
"metadata": {},
301299
"source": [
302300
"Next, let's add samplers to generate data related to the customer and their review.\n"
@@ -305,7 +303,7 @@
305303
{
306304
"cell_type": "code",
307305
"execution_count": null,
308-
"id": "df84faf3",
306+
"id": "e603d4cc",
309307
"metadata": {},
310308
"outputs": [],
311309
"source": [
@@ -342,7 +340,7 @@
342340
},
343341
{
344342
"cell_type": "markdown",
345-
"id": "8288352d",
343+
"id": "cf5070af",
346344
"metadata": {},
347345
"source": [
348346
"## 🦜 LLM-generated columns\n",
@@ -357,7 +355,7 @@
357355
{
358356
"cell_type": "code",
359357
"execution_count": null,
360-
"id": "157919b4",
358+
"id": "775c6fa8",
361359
"metadata": {},
362360
"outputs": [],
363361
"source": [
@@ -370,7 +368,6 @@
370368
" \"on products related to '{{ product_subcategory }}'. The target age range of the ideal customer is \"\n",
371369
" \"{{ target_age_range }} years old. Respond with only the product name, no other text.\"\n",
372370
" ),\n",
373-
" system_prompt=SYSTEM_PROMPT,\n",
374371
" model_alias=MODEL_ALIAS,\n",
375372
" )\n",
376373
")\n",
@@ -382,9 +379,9 @@
382379
" \"You are a customer named {{ customer.first_name }} from {{ customer.city }}, {{ customer.state }}. \"\n",
383380
" \"You are {{ customer.age }} years old and recently purchased a product called {{ product_name }}. \"\n",
384381
" \"Write a review of this product, which you gave a rating of {{ number_of_stars }} stars. \"\n",
385-
" \"The style of the review should be '{{ review_style }}'.\"\n",
382+
" \"The style of the review should be '{{ review_style }}'. \"\n",
383+
" \"Respond with only the review, no other text.\"\n",
386384
" ),\n",
387-
" system_prompt=SYSTEM_PROMPT,\n",
388385
" model_alias=MODEL_ALIAS,\n",
389386
" )\n",
390387
")\n",
@@ -394,7 +391,7 @@
394391
},
395392
{
396393
"cell_type": "markdown",
397-
"id": "009646e4",
394+
"id": "25796666",
398395
"metadata": {},
399396
"source": [
400397
"### 🔁 Iteration is key – preview the dataset!\n",
@@ -411,7 +408,7 @@
411408
{
412409
"cell_type": "code",
413410
"execution_count": null,
414-
"id": "a9c90236",
411+
"id": "ba90ee16",
415412
"metadata": {},
416413
"outputs": [],
417414
"source": [
@@ -421,7 +418,7 @@
421418
{
422419
"cell_type": "code",
423420
"execution_count": null,
424-
"id": "3cfe180e",
421+
"id": "db9d6f8a",
425422
"metadata": {},
426423
"outputs": [],
427424
"source": [
@@ -432,7 +429,7 @@
432429
{
433430
"cell_type": "code",
434431
"execution_count": null,
435-
"id": "65b2f595",
432+
"id": "cb555bd5",
436433
"metadata": {},
437434
"outputs": [],
438435
"source": [
@@ -442,7 +439,7 @@
442439
},
443440
{
444441
"cell_type": "markdown",
445-
"id": "2134fa0f",
442+
"id": "b35ee52b",
446443
"metadata": {},
447444
"source": [
448445
"### 📊 Analyze the generated data\n",
@@ -455,7 +452,7 @@
455452
{
456453
"cell_type": "code",
457454
"execution_count": null,
458-
"id": "8a37dd61",
455+
"id": "0d15fb8d",
459456
"metadata": {},
460457
"outputs": [],
461458
"source": [
@@ -465,7 +462,7 @@
465462
},
466463
{
467464
"cell_type": "markdown",
468-
"id": "b715bc3a",
465+
"id": "4fefec9f",
469466
"metadata": {},
470467
"source": [
471468
"### 🆙 Scale up!\n",
@@ -478,7 +475,7 @@
478475
{
479476
"cell_type": "code",
480477
"execution_count": null,
481-
"id": "565f03a1",
478+
"id": "395faa2c",
482479
"metadata": {},
483480
"outputs": [],
484481
"source": [
@@ -488,7 +485,7 @@
488485
{
489486
"cell_type": "code",
490487
"execution_count": null,
491-
"id": "9d4c91ad",
488+
"id": "65dcd625",
492489
"metadata": {},
493490
"outputs": [],
494491
"source": [
@@ -501,7 +498,7 @@
501498
{
502499
"cell_type": "code",
503500
"execution_count": null,
504-
"id": "93c5a082",
501+
"id": "1aef103b",
505502
"metadata": {},
506503
"outputs": [],
507504
"source": [
@@ -513,7 +510,7 @@
513510
},
514511
{
515512
"cell_type": "markdown",
516-
"id": "13f7c942",
513+
"id": "09ec21ba",
517514
"metadata": {},
518515
"source": [
519516
"## ⏭️ Next Steps\n",

0 commit comments

Comments
 (0)