You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/dev-box/concept-dev-box-serverless-gpu.md
+27-31Lines changed: 27 additions & 31 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,8 +1,8 @@
1
1
---
2
-
title: Serverless GPU compute in Microsoft Dev Box
2
+
title: Serverless GPU Compute in Microsoft Dev Box
3
3
description: Learn about serverless GPU compute in Microsoft Dev Box, how it works, benefits for developers and organizations, and key use cases.
4
4
ms.service: dev-box
5
-
ms.topic: concept-article
5
+
ms.topic: how-to
6
6
ms.date: 05/05/2025
7
7
author: RoseHJM
8
8
ms.author: rosemalcolm
@@ -11,9 +11,9 @@ ai-usage: ai-generated
11
11
#customer intent: As a business decision-maker, I want to evaluate serverless GPU compute in Dev Box so that I can determine its value for my team’s workflows.
12
12
---
13
13
14
-
# Serverless GPU compute in Microsoft Dev Box
14
+
# Use Serverless GPU compute in Microsoft Dev Box
15
15
16
-
Serverless GPU compute in Microsoft Dev Box lets you use GPU resources on demand for compute-intensive tasks like AI model training, inference, and data processing. Microsoft Dev Box serverless GPU compute enables developers to access powerful GPU resources on demand without requiring permanent infrastructure provisioning or complex setup. This article explains what serverless GPU compute is, how it works, and key scenarios for its use.
16
+
Serverless GPU compute in Microsoft Dev Box (preview) lets you spin up dev boxes with GPU acceleration — no additional setup required. Microsoft Dev Box serverless GPU compute enables developers to access powerful GPU resources on demand without requiring permanent infrastructure provisioning or complex setup. This article explains what serverless GPU compute is, how it works, and key scenarios for its use.
17
17
18
18
## What is serverless GPU compute?
19
19
@@ -44,16 +44,16 @@ Serverless GPU compute in Microsoft Dev Box offers distinct benefits for both de
44
44
45
45
-**No setup required**: Access GPU compute with a single click from your Dev Box environment
46
46
- Access GPU compute with one step from your Dev Box environment.
47
-
-**No permission barriers**: Use GPU resources without needing rights to create cloud infrastructure
48
-
-**Integrated development experience**: Seamlessly use GPU compute within familiar tools like Windows Terminal, Visual Studio, and VS Code
49
-
-**Zero configuration**: GPU sessions start automatically when needed and stop when not in use
47
+
-**No permission barriers**: Use GPU resources without needing rights to create cloud infrastructure.
48
+
-**Integrated development experience**: Seamlessly use GPU compute within familiar tools like Windows Terminal, Visual Studio, and VS Code.
49
+
-**Zero configuration**: GPU sessions start automatically when needed and stop when not in use.
50
50
51
51
### For organizations
52
52
53
53
-**Cost optimization**: Pay only for the GPU resources you use instead of provisioning dedicated hardware.
54
-
-**Centralized control**: Manage GPU access through project-level policies
55
-
-**Security and compliance**: Keep sensitive data in your secure corporate network while using GPU resources
56
-
-**Simplified resource management**: Control GPU usage limits at the project level
54
+
-**Centralized control**: Manage GPU access through project-level policies.
55
+
-**Security and compliance**: Keep sensitive data in your secure corporate network while using GPU resources.
56
+
-**Simplified resource management**: Control GPU usage limits at the project level.
57
57
58
58
## How serverless GPU compute works
59
59
@@ -69,47 +69,43 @@ Serverless GPU compute in Dev Box uses Azure Container Apps (ACA) to provide GPU
69
69
These GPU options are supported:
70
70
71
71
-**NVIDIA T4 GPUs**: Readily available with minimal quota concerns
72
-
-**NVIDIA A100 GPUs**: More powerful but available in limited capacity
72
+
73
73
74
74
### Regional availability
75
75
76
76
GPU resources are available in these Azure regions:
77
77
78
78
- West US 3
79
-
- Sweden North
80
-
- Australia East
81
79
82
80
Additional regions may be supported in the future based on demand.
83
81
84
-
### vNet injection
85
-
86
-
vNet injection allows customers to integrate their network and security protocols with the serverless GPU environment. While not required for the proof of concept (POC), this feature will be prioritized for public previews and general availability (GA). With vNet injection, customers can achieve tighter control over network and security configurations.
82
+
## Administration and management
87
83
88
-
vNet injection lets customers control network and security configurations more tightly.
84
+
Admins control serverless GPU access at the project level through Dev Center. Key management capabilities include:
89
85
90
-
### MOBO architecture model
86
+
-**Enable/disable GPU access**: Control whether projects can use serverless GPU resources
87
+
-**Set concurrent GPU limits**: Specify the maximum number of GPUs that can be used simultaneously across a project
91
88
92
-
Serverless GPU compute adopts the MOBO architecture model for ACA integration. In this model, ACA instances are created and managed within the customer’s subscription, providing a more controlled and streamlined management experience. This ensures that the Dev Box service can securely manage ACA sessions without introducing additional complexity.
89
+
Access to serverless GPU resources is managed through project-level properties. When the serverless GPU feature is enabled for a project, all Dev Boxes within that project automatically gain access to GPU compute. This simplifies the access model by removing the need for custom roles or pool-based configurations.
93
90
94
-
### Developer experience
91
+
##Configure Serverless GPU
95
92
96
-
Developers can access serverless GPU compute through:
93
+
To configure
97
94
98
-
-**Windows Terminal**: Launch a GPU-powered shell directly from Windows Terminal
99
-
-**Visual Studio**: Access GPU compute from within the Visual Studio environment
100
-
-**VS Code with AI Toolkit**: Use seamless GPU integration for AI development tasks
95
+
### Register for the subscription
101
96
102
-
The goal is to provide a seamless, native experience where GPU resources are accessible without requiring any setup from the developer.
97
+
1. Sign in to the [Azure portal](https://portal.azure.com).
98
+
1. Navigate to your subscription.
99
+
1. Select **Settings** > **Preview features**.
100
+
1. Select **Dev Box Serverless GPU Preview**, and then select **Register**.
103
101
104
-
##Administration and management
102
+
### Enable GPU for a project
105
103
106
-
Admins control serverless GPU access at the project level through Dev Center. Key management capabilities include:
104
+
1. Go to your project.
105
+
1. Select **Settings** > **Dev box settings**.
106
+
1. Under **AI workloads**, select **Enable**, and then select **Apply**.
107
107
108
-
-**Enable/disable GPU access**: Control whether projects can use serverless GPU resources
109
-
-**Set concurrent GPU limits**: Specify the maximum number of GPUs that can be used simultaneously across a project
110
-
-**Cost controls**: Manage GPU usage within subscription quotas
111
108
112
-
Access to serverless GPU resources is managed through project-level properties. When the serverless GPU feature is enabled for a project, all Dev Boxes within that project automatically gain access to GPU compute. This simplifies the access model by removing the need for custom roles or pool-based configurations.
0 commit comments