Skip to content

SANA Ultra HD Fast Text to Image Model from NVIDIA Step by Step Tutorial on Windows Cloud and Kaggle

FurkanGozukara edited this page Oct 17, 2025 · 1 revision

SANA: Ultra HD Fast Text to Image Model from NVIDIA Step by Step Tutorial on Windows, Cloud & Kaggle

SANA: Ultra HD Fast Text to Image Model from NVIDIA Step by Step Tutorial on Windows, Cloud & Kaggle

image Hits Patreon BuyMeACoffee Furkan Gözükara Medium Codio Furkan Gözükara Medium

YouTube Channel Furkan Gözükara LinkedIn Udemy Twitter Follow Furkan Gözükara

Step by step tutorial and a 1-Click to installer having very advanced Gradio APP to use newest Text-to-Image SANA Model on your Windows PC locally and also on cloud services such as Massed Compute, RunPod and free Kaggle. SANA's most powerful feature is being able to generate 4 Megapixel resolution (2048x2048) very fast natively.

🔗 Full Instructions, Configs, Installers, Information and Links Shared Post (the one used in the tutorial) ⤵️

▶️ https://www.patreon.com/posts/click-to-open-post-used-in-tutorial-116474081

🔗 SECourses Official Discord 9500+ Members ⤵️

▶️ https://discord.com/servers/software-engineering-courses-secourses-772774097734074388

🔗 Stable Diffusion, FLUX, Generative AI Tutorials and Resources GitHub ⤵️

▶️ https://github.com/FurkanGozukara/Stable-Diffusion

🔗 SECourses Official Reddit - Stay Subscribed To Learn All The News and More ⤵️

▶️ https://www.reddit.com/r/SECourses/

🔗 Official Repository of NVIDIA Labs SANA Model ⤵️

▶️ https://github.com/NVlabs/Sana

00:00:00 Introduction to the published by NVIDIA SANA model step by step tutorial

00:02:48 How to install SANA model on Windows and start using

00:05:35 How to verify installation and save installation logs in case of an error to report back to us

00:06:03 How to start the APP after installation on Windows and how to use the SANA model properly

00:09:38 Where the generated images are saved in which folder

00:12:11 How to edit the styles that the APP has - prompting styles

00:12:59 How to install and use SANA APP and any of SECourses published AI apps on Massed Compute

00:14:17 How to select accurate category and the template image on Massed Compute cloud service

00:14:25 How to apply our SECourses coupon to get 50% price discount on Massed Compute - permanently working

00:14:46 How to install and setup ThinLinc client to transfer files and use Massed Compute cloud desktop PC

00:15:51 How to connect Massed Compute after initialized and install any AI scripts that we publish e.g. SANA model

00:19:05 How to start the application after it has been installed and use it on your PC (but it will run in Massed Compute server)

00:20:31 How to download individually and as a folder the generated images on Massed Compute to your computer

00:21:30 How to terminate Massed Compute to not spend any credits / money

00:22:03 How to install and use SANA APP and any of SECourses published AI apps on RunPod cloud service

00:24:43 How to start the SANA APP after installation has been completed on RunPod

00:26:34 The speed of RTX 4090 on RunPod for SANA 2K model 4 MegaPixel image generation

00:26:44 How to download individually and as a folder the generated images on RunPod to your computer

00:27:09 How to stop the pod and terminate to not waste any credits / money on RunPod

00:27:24 How to start the APP again that was previously installed on RunPod (not terminated only stopped pod)

00:27:34 How to use SANA APP on a free Kaggle account and any of my developed Kaggle notebooks

00:28:38 Selecting accurate session options on Kaggle like GPUs, accelerator and Internet On

00:29:06 How to run cells and install SANA APP or any APP on Kaggle

00:29:44 How to get Ngrok token and set it up and use it to connect SANA APP from Kaggle

00:30:57 How to download all generates images as a zip file on Kaggle

00:31:46 How to restart the SANA app on Kaggle or any AI APPs same logic

00:32:11 How to see how much GPU time you have left for free on Kaggle - 30 hours every week

Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

We introduce Sana, a text-to-image framework that can efficiently generate images up to 4096 × 4096 resolution. Sana can synthesize high-resolution, high-quality images with strong text-image alignment at a remarkably fast speed, deployable on laptop GPU. Core designs include:

(1) DC-AE: unlike traditional AEs, which compress images only 8×, we trained an AE that can compress images 32×, effectively reducing the number of latent tokens.

(2) Linear DiT: we replace all vanilla attention in DiT with linear attention, which is more efficient at high resolutions without sacrificing quality.

(3) Decoder-only text encoder: we replaced T5 with modern decoder-only small LLM as the text encoder and designed complex human instruction with in-context learning to enhance the image-text alignment.

(4) Efficient training and sampling: we propose Flow-DPM-Solver to reduce sampling steps, with efficient caption labeling and selection to accelerate convergence.

As a result, Sana-0.6B is very competitive with modern giant diffusion model (e.g. Flux-12B), being 20 times smaller and 100+ times faster in measured throughput. Moreover, Sana-0.6B can be deployed on a 16GB laptop GPU, taking less than 1 second to generate a 1024 × 1024 resolution image. Sana enables content creation at low cost.

Video Transcription

  • 00:00:00 Greetings everyone. Today I am going to introduce  you text to image SANA model developed by the  

  • 00:00:07 NVIDIA. The advantage of this model is that it  is able to generate very high resolution images,  

  • 00:00:14 4 megapixel at the moment in a very short time  with a low VRAM memory. So, it is an amazing model  

  • 00:00:23 to generate fast images at high resolution.  This model is not supported widely yet,  

  • 00:00:30 not even the ComfyUI is supporting its 4 megapixel  resolution. However, I have developed a very  

  • 00:00:37 advanced Gradio application with one click to  install on Windows and also use on cloud services  

  • 00:00:45 such as Massed Compute, RunPod, and even a free  Kaggle account notebook. If you are a Linux user,  

  • 00:00:52 you can use Massed Compute or RunPod installer as  well. So, what are the features and the advantages  

  • 00:00:59 of my Gradio application, it is supporting the  latest 4 megapixel SANA 2K model with its native  

  • 00:01:08 pipeline. Moreover, I have improved the pipeline  so that it is doing offloading and working even on  

  • 00:01:18 lower VRAM GPUs very fast and very efficiently.  Moreover, my Gradio application is supporting  

  • 00:01:24 multi line prompting, so you can write a prompt  to the each line and it will generate each line  

  • 00:01:33 separately as a new prompt. Furthermore, my  Gradio application has more advanced features  

  • 00:01:39 such as you can set the aspect ratio and it  will set the accurate resolution immediately  

  • 00:01:46 accordingly. Moreover, it is supporting negative  prompting, image styles, randomized seed,  

  • 00:01:53 and batch size and number of generations. So, you  can even generate three images at once or you can  

  • 00:02:00 set it to generate hundreds of images in a loop.  All of these features are combined with multi  

  • 00:02:07 prompting and all of the images are saved in an  properly set up outputs folder automatically. So,  

  • 00:02:15 in the first part of this tutorial, I will  show you how to install and use on Windows  

  • 00:02:21 and I will teach how to use this application.  Then I will show how to use on Massed Compute,  

  • 00:02:28 on a cloud service, if you don't have a powerful  GPU, you can use Massed Compute, then I will  

  • 00:02:33 show on RunPod and finally I will show on a free  Kaggle account, so if you don't want to pay any  

  • 00:02:40 money to any cloud services and if you don't have  a powerful GPU, you can use this amazing model on  

  • 00:02:46 a free Kaggle account for free. So, as usual, I  have prepared an amazing post where you will find  

  • 00:02:52 all of the instructions, links and the zip file.  Please read this post from top to the bottom.  

  • 00:03:00 You can download the latest zip file from  the top or also from the very bottom where  

  • 00:03:05 the attachments are. So, what we need to be  able to use the SANA on our Windows Computer,  

  • 00:03:12 we need to have Python 3.10 installed, Cuda 12.4,  cuDNN, C++ tools and Git. How to install them  

  • 00:03:22 has been explained in this amazing tutorial video  step by step, so please follow that. This is only  

  • 00:03:29 one time mandatory, after that you will be able to  use all of the AI applications perfectly fine. So,  

  • 00:03:36 let's go to the bottom and download the zip file.  As I said, please read the announcements, changes,  

  • 00:03:42 updates, everything on this post before you begin  installation and usage. How to install on RunPod,  

  • 00:03:49 on Massed Compute and Kaggle will be shown  after Windows tutorial part has been completed,  

  • 00:03:55 but it is mandatory to watch the Windows tutorial  part to learn how to use this amazing application.  

  • 00:04:02 If you are a Linux user, you can use RunPod  or Massed Compute installers to install on  

  • 00:04:07 your system because RunPod and Massed Compute are  both Ubuntu Linux. Move the zip file into the disc  

  • 00:04:14 drive where you are going to install, I'm going  to install into my Seagate. Extract the files,  

  • 00:04:19 extract. Then make sure that your folder does not  have any special characters or space character and  

  • 00:04:26 use a short path like this. All you need to do  is just double click windows install.bat file.  

  • 00:04:34 It will start the installation automatically for  you, it will install everything into a virtual  

  • 00:04:39 environment with using Python 3.10 and it will  automatically download the necessary SANA models  

  • 00:04:46 to the accurate folders for you. During the  installation, you will also see this error,  

  • 00:04:51 it is not important because we are fixing that  in the next part. Unfortunately, there is not  

  • 00:04:57 an officially supported Triton for Windows but  we are using a pre-compiled wheel. Moreover, you  

  • 00:05:04 will see that my downloader has been optimized for  speed, it will use your entire speed, currently,  

  • 00:05:11 you see this is downloading with 100 megabytes  per second on my Computer even more than that.  

  • 00:05:17 However, if you get an error, however, if you  get an error during the download for any reason,  

  • 00:05:23 we have Windows fix model download.bat file,  you can run this afterwards if you get an error  

  • 00:05:30 during this stage. So, the installation has been  completed, quickly verify whether there are any  

  • 00:05:36 errors or not. Moreover, you should save the logs  if there is an error. How to, you see in the top  

  • 00:05:43 here, right click, export text, save it anywhere  and send me installation logs. On Windows 10,  

  • 00:05:50 you need to select everything like this, ctrl-C,  copy, paste anywhere and send me logs. Then click  

  • 00:05:57 anywhere to close. Now we are ready to start using  the SANA model from the NVIDIA. How to start,  

  • 00:06:04 you see there is Windows start.bat file,  double click it, more info run anyway and  

  • 00:06:09 it is started. Do not run any of the bat files as  an administrator unless it is explicitly stated.  

  • 00:06:17 All of my scripts runs as normal user. So, this is  the interface. How to use this interface, first of  

  • 00:06:23 all, decide which model you are going to use. SANA  2K model is 4 megapixel model and may not work on  

  • 00:06:31 low VRAM GPUs even though I have made an amazing  optimization to the pipeline of the SANA model.  

  • 00:06:39 With my optimization, I am offloading VAE and the  model at the final stage of the image generation,  

  • 00:06:46 otherwise it was very slow and it was using a lot  of VRAM. So first decide which model you are going  

  • 00:06:51 to use. Let's use the SANA 2K model, 4 megapixel,  when you change model, wait until model reloading  

  • 00:06:59 has been completed, you see currently it is  reloading the model. Model has been reloaded.  

  • 00:07:04 Type your prompt like super fast, amazing car  moving on the ice on a sunny day, then here the  

  • 00:07:12 resolution. SANA works slightly different than  other models, it is based on the aspect ratio,  

  • 00:07:19 it is always going to generate same resolution.  So, when you change resolution to 1536 to 1536,  

  • 00:07:27 it is still going to generate in 2048 to 2048  because this is SANA 2K model. Therefore,  

  • 00:07:36 I recommend you to set the aspect ratio from this  dropdown box that I have added for you, let's try  

  • 00:07:43 4:3, so you see it is going to automatically  set accurate resolution for you like this. Then  

  • 00:07:48 there are sampling steps, I didn't notice much  improvement with more steps but you can do any  

  • 00:07:54 number of steps you want. CFG guidance scale is  going to impact how much it is going to follow  

  • 00:08:01 your prompt, this is default value. PAG guidance  scale is again related to the SANA model itself,  

  • 00:08:08 you can try different values and see their impact.  You can also use negative prompt if you wish. But  

  • 00:08:15 it is probably not necessary, moreover, we have  image styles that you can immediately apply.  

  • 00:08:22 And by default, it is going to randomize seed,  so with different seed, you are going to get a  

  • 00:08:28 different image every time you generate an image  even though the prompt is same. Finally, we are  

  • 00:08:34 supporting only one sampler at the moment. You can  also increase batch size, this is going to speed  

  • 00:08:40 up your generation, it is working and you can  generate any number of images as you wish. Okay,  

  • 00:08:47 let's click run and let's begin generation. You  can always follow the status on the CMD window  

  • 00:08:54 which I recommend because always there will be  more information on the CMD window. You see my  

  • 00:09:02 extreme optimization is working amazing, otherwise  it would be using a lot of VRAM. And even though  

  • 00:09:08 we are generating huge resolution, the speed  is great as you are seeing right now. So, it  

  • 00:09:12 is moving models back and forth and the image has  been generated. Let's change aspect ratio to 16:9  

  • 00:09:20 and generate. The first generation can be slower  than the consequent ones. So, it is 1.38 IT per  

  • 00:09:27 second on RTX 3090 TI right now on the 4 megapixel  resolution model. This is a huge resolution. And  

  • 00:09:35 we got the image. So, where are these images  are saved, click open outputs folder. It will  

  • 00:09:41 open the folder, they will be saved like this,  you see online demo image and there will be the  

  • 00:09:48 date of the folder. Let's open one of the images,  so you see this is an original native 4 megapixel  

  • 00:09:55 resolution image. I made this application very,  very advanced, it supports multi line prompt. So,  

  • 00:10:01 I can type here another thing. Let's try car,  truck, apple. And let's try 3D model style and  

  • 00:10:09 let's generate. So, it is going to generate each  one of the prompts one by one, this also supports  

  • 00:10:15 batch size and number of generations. With this  way, you can type your prompts each line and it  

  • 00:10:22 will generate that line with an order. You can  follow the status on here. So, you see after the  

  • 00:10:28 first image, it says generated one over three  images, this is showing everything for you. So,  

  • 00:10:34 this is the first image, we just prompt car, this  is an extremely simple prompt. You can see example  

  • 00:10:41 prompts at the bottom and try them and understand  how they are working. These prompts are from the  

  • 00:10:47 official repository of the SANA model from the  NVIDIA. And as these prompts are generated,  

  • 00:10:54 you will see this interface is getting updated,  this is also a very cool feature that I have  

  • 00:11:00 developed. You see like this to like this. Yeah, I  cannot say this is a great image but our prompt is  

  • 00:11:05 also very simple, that is why. But we can clearly  see that it is following the 3D prompt. You can  

  • 00:11:12 also click this download icon to download it, also  click here to see it as a full screen like this.  

  • 00:11:18 My application is using the latest version of the  Gradio, so it is fully up to date. And the third  

  • 00:11:24 image is arriving. Yes, this is just an apple in  3D. We can also try anime, let's try. By the way,  

  • 00:11:32 I am doing 40 steps, this is not mandatory,  you can also try 18 steps which is default.  

  • 00:11:38 Sometimes it may not display image in here after  it has been generated, in that case, go to the  

  • 00:11:43 outputs folder and the image will be saved there,  so you can see. So, this was the anime car image,  

  • 00:11:50 it is looking pretty, pretty good, it is just  car prompt with the style of anime. And this  

  • 00:11:57 is native 4 megapixel, I can see the resolution  2,688 to 1,536. And the truck. Yes, the anime  

  • 00:12:08 truck is even better than that as you are seeing  right now. If you want to edit these styles, it  

  • 00:12:14 is also possible. Go to the SANA model where you  have installed, go to the app folder. And when you  

  • 00:12:20 edit this SECourses app file, you will see these  styles. And all images have been generated. So,  

  • 00:12:25 this is the anime version of the Apple. It is just  looking amazing. This model is an improvement and  

  • 00:12:31 they are publishing new models, so this model is  promising, hopefully I will also make tutorials  

  • 00:12:37 for training SANA model. Now I will show you how  to use this amazing model on the Massed Compute,  

  • 00:12:44 so if you don't have a powerful GPU, you can  also use it on the Massed Compute, then RunPod,  

  • 00:12:50 then on a free Kaggle account. Unfortunately, on  the free Kaggle account, the SANA 2K model is not  

  • 00:12:55 working but SANA 1K model is working perfectly  fine. Now I will show you how to use the SANA on  

  • 00:13:01 Massed Compute on a cloud service, if you don't  have a powerful GPU, this is the way of using it  

  • 00:13:08 fast or if you want to do more scaling, generating  faster, again, cloud is your choice. So, when you  

  • 00:13:15 scroll down, you will see our cloud section and  in here, I recommend using Massed Compute, after  

  • 00:13:22 Massed Compute, I will show RunPod and Kaggle as  well. If you haven't downloaded the zip file yet,  

  • 00:13:28 go to the very bottom and download zip file from  the attachments. Then please use this registration  

  • 00:13:34 link if you haven't registered yet to the Massed  Compute, I appreciate that. After registering,  

  • 00:13:40 go to billing, set up your credits and load  some balance then. Go to the deploy and in here,  

  • 00:13:46 you will see the available GPUs, so which GPU I  recommend. I recommend L40 if it is available,  

  • 00:13:54 however, since our coupon is now working on L40,  you see it is not available at the moment. So,  

  • 00:13:59 I will go with the second best GPU RTX A6000.  You don't need multiple GPUs but if you want  

  • 00:14:05 to generate with multiple GPUs, you can also  select them. To generate on multiple GPUs,  

  • 00:14:09 you need to start multiple instances of the  application on each GPU with export Cuda visible  

  • 00:14:15 devices. It is so easy. So, from the category  select creator and from image select SECourses.  

  • 00:14:22 Then you see the price is 62 cents per hour. Then  we are going to apply our coupon SECourses verify.  

  • 00:14:29 And now it is 31 cents per hour, deploy. Wait  a while, it will move you to the GPU's page,  

  • 00:14:35 yes. In here, we need to wait until  initialization has been completed, moreover,  

  • 00:14:40 if this is your first time using Massed Compute,  you should install ThinLinc client. The link of  

  • 00:14:46 ThinLinc client is here. Click here then download  according to your platform, it works on Windows,  

  • 00:14:52 Mac, Linux, I'm on Windows, so click here. Click  the downloaded exe file. Click yes, click next. I  

  • 00:15:00 accept, next, install. So, everything is default,  finish. Then on this screen, click options, go to  

  • 00:15:07 the local devices, uncheck all and check drives.  We are going to add a synchronization folder,  

  • 00:15:12 this is for transferring small files. Like  generated images or the downloaded scripts,  

  • 00:15:19 so from here, remove everything. And click add,  first of all, generate a folder on your disc  

  • 00:15:24 where you want synchronization to be happen. Go  to the any disk you want and generate a folder,  

  • 00:15:30 then copy it's path like this. You see here, I  click here and copy with ctrl-C and paste it here.  

  • 00:15:36 You see now it is added. Select the permission  as read and write, so it can work both ways,  

  • 00:15:42 click okay, click okay. Now our ThinLinc client  is ready to connect Massed Compute and use it. All  

  • 00:15:48 we need to do is just wait initialization. Okay,  so the Massed Compute has been initialized, you  

  • 00:15:54 see status is running. Now we need to connect it.  So, copy this, paste it. Copy this, paste it here  

  • 00:16:01 and make sure that username Ubuntu is also copy  pasted here. If you click end existing session,  

  • 00:16:08 it is going to close all of the running  applications on your Massed Compute on the cloud  

  • 00:16:14 service. Do not use this option unless you needed  it, unless you are not able to connect to the  

  • 00:16:20 remote machine or the synchronization folder is  not working, then click connect. Click continue.  

  • 00:16:26 Wait until this screen appears, then click start.  And now this is the interface of the cloud Massed  

  • 00:16:33 Compute machine. So, this is running on the cloud,  not on my Computer, I can do whatever I want here.  

  • 00:16:40 First of all, we should move the downloaded  zip file into our synchronization folder. So,  

  • 00:16:47 the synchronization folder depends on wherever you  have made it, it is inside here in my Computer.  

  • 00:16:53 Then click home here at the left top, go to the  thindrives here and enter inside the folder. This  

  • 00:17:01 folder will be synchronized from your Computer,  these will not work for big files. Remember,  

  • 00:17:07 for big files, use like Google Drive, OneDrive  or Hugging Face, however, for small files, it  

  • 00:17:12 will work. Do not install anything here. First of  all, you should copy the files into the downloads  

  • 00:17:18 or the other folders, so let's refresh this  page to see the zip file. If you don't see it,  

  • 00:17:24 refresh it and here SANA zip file. So, I will  drag and drop it into the downloads folder,  

  • 00:17:30 wait until it is copied here. Yes, it's copied  already. Right click, extract here. Then enter  

  • 00:17:37 inside that folder and then open Massed Compute  instructions txt file. Copy this installation  

  • 00:17:45 command with ctrl-C. Go back to the files here,  click this three dots icon, open in terminal.  

  • 00:17:51 You see this terminal has been started inside  that folder, you need to be inside this folder.  

  • 00:17:58 Then right click and paste. Then hit enter, it  will install everything automatically for us on  

  • 00:18:05 Massed Compute. The Massed Compute installations  are extremely fast. The initialization may take  

  • 00:18:11 a while but after that, once the machine has been  initialized, the installations are like 10 times,  

  • 00:18:19 five times faster than RunPod. That is why I  recommend the Massed Compute. You will see the  

  • 00:18:24 download speeds are just amazing, installation  speeds are just amazing, it will be installed  

  • 00:18:29 under two minutes at max this model because this  is not a very big model. So, after initialized,  

  • 00:18:36 it will be ready in like two minutes to use. So  you see it is downloading with 1 GB per second,  

  • 00:18:43 sometimes, sometimes 500 megabytes per second, the  average was around 600 megabytes per second. This  

  • 00:18:51 is the speed of the Massed Compute, there is no  such speed anywhere, wow, this was downloaded with  

  • 00:18:57 600 megabytes per second. So, the installation  has been completed already, you can quickly verify  

  • 00:19:03 whether there are any errors or not. But it is not  mandatory, then return back to the Massed Compute  

  • 00:19:09 instructions txt file. Copy this part, you see,  ctrl-C, return back to the folder where you have  

  • 00:19:16 extracted and installed. Click this three dots  icon, open in terminal, right click and paste,  

  • 00:19:22 then it will start the application with a Gradio  live link. You can also use it locally inside  

  • 00:19:27 ThinLinc client, however, I don't recommend it  because it is slower. The Gradio will be faster,  

  • 00:19:33 okay, Gradio started, copy this link. You can  access this link from your Computer, from your  

  • 00:19:40 phone, from tablet, wherever you want, even from  your TV, then select the model which you want to  

  • 00:19:46 generate images. Let's select SANA 2K model, wait  for processing to be completed. This will take  

  • 00:19:53 a while when do you first time load the model.  Then we will be able to generate images. Okay,  

  • 00:19:59 so the model has been loaded. Now we are ready  to start generating images, it is exactly same  

  • 00:20:04 as on the Windows tutorial part. Let's generate  a car image as an anime, hit run. If you want to  

  • 00:20:10 see the status of the CMD window, go back to the  ThinLinc client and you will see the generation,  

  • 00:20:16 it is pretty, pretty fast. 1.78 IT per second,  faster than my RTX 3090 and this is only 31  

  • 00:20:25 cents per hour on Massed Compute. And the image has been generated. So when we click this here. So you will see the image here. You can  

  • 00:20:32 click this icon to download it onto your Computer  or what you can do, you can go back to TinLinc  

  • 00:20:39 client, click open outputs folder, inside here,  it should open the folder, if it doesn't open the  

  • 00:20:45 folder I will fix this later. Maybe since this is  running on. Yes, this is running on Gradio live,  

  • 00:20:50 so it won't open it. So go to the SANA  folder, go to the output, online demo images,  

  • 00:20:56 and this is where the generated images  are saved. So, you can right-click, copy,  

  • 00:21:02 and go to the home, go to the thindrives, enter  inside your synchronization folder, and paste it  

  • 00:21:08 there. If you have too many images, it may take a  while for synchronization, but it should be fast  

  • 00:21:13 if you don't have many. Then, when I go back to  my synchronization folder on my computer, I will  

  • 00:21:19 see that it will appear here. Yes, you see. When I  enter inside the folder, I will see the image. So,  

  • 00:21:25 there is no stop and continue feature on the  Massed Compute. If you want to stop using your  

  • 00:21:32 credits, you have to terminate the instance. But  once you terminate the instance, everything will  

  • 00:21:38 be gone forever. So, if you stop the instance,  it will not stop using your credits. You see,  

  • 00:21:44 it is also saying that this does not stop billing.  There is no permanent storage on Massed Compute  

  • 00:21:49 yet. Hopefully, it will arrive soon. So, if you  need permanent storage, you need to use RunPod,  

  • 00:21:54 which I am going to show after this part of the  tutorial. So, let's terminate this instance,  

  • 00:21:59 then it will not be using our credits anymore.  Now, I will show how to use NVIDIA Labs SANA  

  • 00:22:06 model on RunPod. If you don't have a powerful GPU,  or if you want to scale up for any reason, you can  

  • 00:22:12 use RunPod to generate images. My recommendation  is Massed Compute, but if you want RunPod, then  

  • 00:22:20 here it is. First of all, please register with  this link. I appreciate that. Then, let's log in  

  • 00:22:26 after registration. Then, go to the billing, set  up your billing, set some balance, then go to the  

  • 00:22:32 pods. Then, click deploy. Before deploying, make  sure that you have downloaded the ZIP file from  

  • 00:22:39 the attachments and extracted it anywhere. Why?  Because you need to read the RunPod instructions  

  • 00:22:47 TXT file. When you open that file, it will show  you which RunPod template to use. So, this is the  

  • 00:22:54 template that we need. This is important. Pick  any GPU you want. I am going to use RTX 4090. It  

  • 00:23:00 is a very fast GPU, and I am going to use secure  cloud because I need faster initialization right  

  • 00:23:06 now. You can pick the server from here. I find  that this is working a little bit faster. And  

  • 00:23:11 then RTX 4090. Then, click change template, type  torch, and find the RunPod PyTorch 2.2.0. This is  

  • 00:23:21 the recommended version right now that you need.  Click edit template, set the disk like 50 GB,  

  • 00:23:28 and set override. Then, click deploy on demand.  Then, go to my pods. My pods are also here,  

  • 00:23:35 and wait for initialization to be completed. Since  this is an official, very lightweight template,  

  • 00:23:42 it will be very, very fast. If it is not very  fast, then the pod is likely to be broken. Okay,  

  • 00:23:48 it was really fast. Then, click connect and  connect the Jupyter lab. If this is orange or  

  • 00:23:53 not enabled, refresh the page and try again. If  this page doesn't load, refresh and try again,  

  • 00:23:59 and you can also sometimes restart your browser.  Then, click this arrow, go to your downloads,  

  • 00:24:04 and upload the downloaded ZIP file like this.  Then, right-click and extract archive. Then,  

  • 00:24:11 click this refresh icon. Wait for extraction.  Then, double-click, open RunPod instructions.  

  • 00:24:16 The installation command is here. Just  copy it with Ctrl+C, open a new terminal,  

  • 00:24:22 and you can do Ctrl+V, or right-click and paste.  Both of them are working, and it will install  

  • 00:24:29 everything automatically for you. Just wait  for the installation to be completed. So, the  

  • 00:24:33 installation on RunPod has been completed. Quickly  scroll up and see if there are any errors or not.  

  • 00:24:39 The models have been downloaded, everything is  ready. Return back to the RunPod instructions  

  • 00:24:44 TXT file, copy all of these, this is for starting,  open a new terminal, then paste it, and hit Enter.  

  • 00:24:52 It is going to start the SANA app on RunPod with  a Gradio live share link. You can also use RunPod  

  • 00:25:00 proxy to connect. However, we didn't add the  proxy port during the initialization, therefore  

  • 00:25:06 we cannot use it right now. And I also recommend  using the Gradio live share because it is working  

  • 00:25:12 way better than the RunPod proxy system itself. So,  it is getting started. So, the Gradio live share  

  • 00:25:18 arrived, click it. This will be running on RunPod,  not on your computer, and it will be really,  

  • 00:25:23 really fast. You can also access this from  your tablet, from your phone, from your TV,  

  • 00:25:28 wherever you want. Then, use the model that you  want exactly same as in the Windows tutorial part.  

  • 00:25:35 Let's generate an image with the 2K model and  see the speed. The initial loading of the RunPod  

  • 00:25:41 will be way slower than the Massed Compute or  Windows. RunPod hard drives are usually very slow.  

  • 00:25:48 The download speeds are really good right now  because I have optimized these applications for  

  • 00:25:52 faster downloads. You see, it is using the entire  download speed. If I didn't make these changes,  

  • 00:25:59 it would be limited to like 40 MB per second. But  now it is even able to get 300 MB per second. So,  

  • 00:26:06 the downloads of the shards have been completed.  Now, we need to wait for models to be loaded. This  

  • 00:26:12 is the slowest part on RunPod usually. You need  to wait until this processing is gone. The model  

  • 00:26:19 has been switched. So, the model has been loaded.  Let's type an example prompt like car, and let's  

  • 00:26:26 generate with anime. As I said, please watch the  Windows tutorial part to learn how to use this  

  • 00:26:31 application. The speed is just mind-blowingly  fast. We can see. It is 2.5 IT per second,  

  • 00:26:37 2.6 IT per second. It will take a few seconds.  Yes, the image has been generated. You can click  

  • 00:26:44 here to download, and it should download, yes. Or,  you can go to the SANA folder here, output folder,  

  • 00:26:51 online demo image, and this is where they are  saved. Right-click and download as an archive. It  

  • 00:26:57 will download all of the generated images for you.  The usage, the rest is same as on the Windows,  

  • 00:27:03 so watch that part. If you don't want your credits  to be spent, you need to stop the pod. However,  

  • 00:27:09 stopping the pod will still not stop using your  credits. You need to terminate the pod. Once you  

  • 00:27:15 terminate the pod, you won't be able to recover  any of the files again. You can also start this  

  • 00:27:21 pod again and start using it immediately again.  For reusing it, you just need to run this command  

  • 00:27:27 as usual. Okay, let's terminate this pod so  we won't be spending any money, and it is  

  • 00:27:32 terminated. Now, I will show how you can use the  SANA model on a free Kaggle account. This is the  

  • 00:27:38 least recommended way, but it is free. So, if you  don't want to pay any money to any cloud service,  

  • 00:27:45 if you don't have a powerful GPU, you can use  Kaggle free GPUs 30 hours every week. Yes,  

  • 00:27:53 every week. So, how to use Kaggle? First of all,  download the attached ZIP file. Inside there,  

  • 00:27:59 there will be a Kaggle notebook. Extract it.  Go to the kaggle.com, generate your account,  

  • 00:28:04 it is free. After generating your account, make  sure that you have verified your phone number from  

  • 00:28:10 settings, otherwise, it will not work. Then,  click create a new notebook. Then, click file  

  • 00:28:15 and import the notebook, browse files, go to the  folder where you have extracted the ZIP file, and  

  • 00:28:22 you see Kaggle free account notebook version 3.  Double-click it and select, then you see selected,  

  • 00:28:28 and click import. Then, wait until you see this,  then click okay. Now, what you need to do is,  

  • 00:28:34 first of all, you need to select the session  options. You see, accelerator is selected as GPU  

  • 00:28:41 T4x2. This is important. Then, make sure that the  internet is on. If you are not able to select the  

  • 00:28:46 GPU, that means that your account is not phone  verified. Once you have both of them selected,  

  • 00:28:53 click the start session, wait until this becomes  green. It should be pretty fast. Okay, you see  

  • 00:29:01 it is becoming green, and when you click here, you  should see the GPU and everything. Yes, now we are  

  • 00:29:06 ready. Then, click this cell. You can either click  this play icon here, or if it is not visible,  

  • 00:29:13 you can click the run current cell here. Click  it. The first cell is going to install everything  

  • 00:29:19 automatically for us. Just wait until this  cancel run disappears. Once it is disappeared,  

  • 00:29:26 it means that the installation has been completed.  You can also follow the what is happening in the  

  • 00:29:33 output. You will see the outputs like this.  Just wait until cancel run disappears, and the  

  • 00:29:39 cell execution has been completed. So, the Kaggle  installation has been completed. We need to get a  

  • 00:29:46 token from here. Click here, this link, register  an account on the Ngrok, it is free. Copy your  

  • 00:29:52 token, then paste it here, then execute this cell.  Then, you will get a link here. Open this link,  

  • 00:30:00 but do not click visit site yet. Then, click  this cell, and it will start the application.  

  • 00:30:07 We need to wait until the application is started,  then we will click this visit site. So, wait here  

  • 00:30:15 for the application to start. You will see it  is running on local URL. Once you see that, that  

  • 00:30:21 means the application has been started. Let's just  wait a little bit. Yes, now running on local URL,  

  • 00:30:28 that means the application has been started.  Click visit site. Now, we will be able to use  

  • 00:30:33 the SANA model on a free Kaggle account. So, put  your prompt, select your aspect ratio exactly as  

  • 00:30:40 in the Windows tutorial part, then run, and  it will start generating images. The same is  

  • 00:30:45 exactly as on the Windows tutorial part. SANA 2K  model is not working on Kaggle yet because of the  

  • 00:30:51 GPU limitations. And what if you want to download  generated images all at once? There is a cell here  

  • 00:30:59 to run this cell, you need to cancel run. It will  generate images in the Kaggle working directory.  

  • 00:31:04 I will show after the first image has been  generated. Moreover, you can also download from  

  • 00:31:10 the Gradio interface as well. The first generation  may be slower because it will download necessary  

  • 00:31:16 files, load them, but the consequent ones will be  faster. So, the image on the Kaggle notebook has  

  • 00:31:23 been generated. You see, it is here. I can click  here to download the image. It is downloaded. Or,  

  • 00:31:29 I can use this mass download. To download it,  let's cancel run and click this cell, and it  

  • 00:31:35 will generate images inside the Kaggle working  directory. Let's refresh it. Yes, so you see  

  • 00:31:41 images.zip file. Click these three dots icon and  download, and it will download the images. If you  

  • 00:31:47 want to restart the application after this, let's  right-click and clear output of all cells. Then,  

  • 00:31:54 you need to get a new link from here. So, execute  this cell, get a new link, open it, but don't  

  • 00:32:01 click visit site yet. Start the application, and  once we see the local URL, it is same as the first  

  • 00:32:07 time we will be able to start using it. Moreover,  how much time you left will be displayed here.  

  • 00:32:14 You see, currently, I have 30 hours this week to  use. I can generate images 30 hours on the Kaggle  

  • 00:32:22 for free, and it is ready now, and visit site.  Kaggle is extremely picky, so if you generate  

  • 00:32:29 anything not safe for work, they will block  your account. So, use the Kaggle with respect  

  • 00:32:36 to the its rules. And then, stop your session.  If you close your computer, Kaggle will not work,  

  • 00:32:42 but if you are using RunPod or Massed Compute, you  can close your computer, and they will continue  

  • 00:32:48 generating images at the server, and it is ready  again. Let's stop the session. Thank you so much  

  • 00:32:54 for watching. We have Patreon exclusive post index  here. We have Discord server here. So, if you join  

  • 00:33:01 our Discord server, you can chat with me. You  can chat with other 10,000 people. I also have  

  • 00:33:08 a Stable Diffusion Generative AI GitHub. Please  fork this, watch this, star it. If you sponsor,  

  • 00:33:14 I appreciate that. When you scroll down, you will  see we have tutorial videos list. It is very long.  

  • 00:33:20 You can watch all the videos here. And we have  a Reddit. So, go to this Reddit link and also  

  • 00:33:25 follow us on the Reddit as well. And you can  also follow me on my LinkedIn profile. It is  

  • 00:33:31 my real profile. Thank you so much for watching.  Hopefully, see you in future amazing tutorials.

Clone this wiki locally