Skip to content

Commit 4bd9875

Browse files
committed
WIP
1 parent 7696870 commit 4bd9875

File tree

3 files changed

+27
-581
lines changed

3 files changed

+27
-581
lines changed

articles/dev-box/concept-dev-box-serverless-gpu.md

Lines changed: 27 additions & 31 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,8 @@
11
---
2-
title: Serverless GPU compute in Microsoft Dev Box
2+
title: Serverless GPU Compute in Microsoft Dev Box
33
description: Learn about serverless GPU compute in Microsoft Dev Box, how it works, benefits for developers and organizations, and key use cases.
44
ms.service: dev-box
5-
ms.topic: concept-article
5+
ms.topic: how-to
66
ms.date: 05/05/2025
77
author: RoseHJM
88
ms.author: rosemalcolm
@@ -11,9 +11,9 @@ ai-usage: ai-generated
1111
#customer intent: As a business decision-maker, I want to evaluate serverless GPU compute in Dev Box so that I can determine its value for my team’s workflows.
1212
---
1313

14-
# Serverless GPU compute in Microsoft Dev Box
14+
# Use Serverless GPU compute in Microsoft Dev Box
1515

16-
Serverless GPU compute in Microsoft Dev Box lets you use GPU resources on demand for compute-intensive tasks like AI model training, inference, and data processing. Microsoft Dev Box serverless GPU compute enables developers to access powerful GPU resources on demand without requiring permanent infrastructure provisioning or complex setup. This article explains what serverless GPU compute is, how it works, and key scenarios for its use.
16+
Serverless GPU compute in Microsoft Dev Box (preview) lets you spin up dev boxes with GPU acceleration — no additional setup required. Microsoft Dev Box serverless GPU compute enables developers to access powerful GPU resources on demand without requiring permanent infrastructure provisioning or complex setup. This article explains what serverless GPU compute is, how it works, and key scenarios for its use.
1717

1818
## What is serverless GPU compute?
1919

@@ -44,16 +44,16 @@ Serverless GPU compute in Microsoft Dev Box offers distinct benefits for both de
4444

4545
- **No setup required**: Access GPU compute with a single click from your Dev Box environment
4646
- Access GPU compute with one step from your Dev Box environment.
47-
- **No permission barriers**: Use GPU resources without needing rights to create cloud infrastructure
48-
- **Integrated development experience**: Seamlessly use GPU compute within familiar tools like Windows Terminal, Visual Studio, and VS Code
49-
- **Zero configuration**: GPU sessions start automatically when needed and stop when not in use
47+
- **No permission barriers**: Use GPU resources without needing rights to create cloud infrastructure.
48+
- **Integrated development experience**: Seamlessly use GPU compute within familiar tools like Windows Terminal, Visual Studio, and VS Code.
49+
- **Zero configuration**: GPU sessions start automatically when needed and stop when not in use.
5050

5151
### For organizations
5252

5353
- **Cost optimization**: Pay only for the GPU resources you use instead of provisioning dedicated hardware.
54-
- **Centralized control**: Manage GPU access through project-level policies
55-
- **Security and compliance**: Keep sensitive data in your secure corporate network while using GPU resources
56-
- **Simplified resource management**: Control GPU usage limits at the project level
54+
- **Centralized control**: Manage GPU access through project-level policies.
55+
- **Security and compliance**: Keep sensitive data in your secure corporate network while using GPU resources.
56+
- **Simplified resource management**: Control GPU usage limits at the project level.
5757

5858
## How serverless GPU compute works
5959

@@ -69,47 +69,43 @@ Serverless GPU compute in Dev Box uses Azure Container Apps (ACA) to provide GPU
6969
These GPU options are supported:
7070

7171
- **NVIDIA T4 GPUs**: Readily available with minimal quota concerns
72-
- **NVIDIA A100 GPUs**: More powerful but available in limited capacity
72+
7373

7474
### Regional availability
7575

7676
GPU resources are available in these Azure regions:
7777

7878
- West US 3
79-
- Sweden North
80-
- Australia East
8179

8280
Additional regions may be supported in the future based on demand.
8381

84-
### vNet injection
85-
86-
vNet injection allows customers to integrate their network and security protocols with the serverless GPU environment. While not required for the proof of concept (POC), this feature will be prioritized for public previews and general availability (GA). With vNet injection, customers can achieve tighter control over network and security configurations.
82+
## Administration and management
8783

88-
vNet injection lets customers control network and security configurations more tightly.
84+
Admins control serverless GPU access at the project level through Dev Center. Key management capabilities include:
8985

90-
### MOBO architecture model
86+
- **Enable/disable GPU access**: Control whether projects can use serverless GPU resources
87+
- **Set concurrent GPU limits**: Specify the maximum number of GPUs that can be used simultaneously across a project
9188

92-
Serverless GPU compute adopts the MOBO architecture model for ACA integration. In this model, ACA instances are created and managed within the customer’s subscription, providing a more controlled and streamlined management experience. This ensures that the Dev Box service can securely manage ACA sessions without introducing additional complexity.
89+
Access to serverless GPU resources is managed through project-level properties. When the serverless GPU feature is enabled for a project, all Dev Boxes within that project automatically gain access to GPU compute. This simplifies the access model by removing the need for custom roles or pool-based configurations.
9390

94-
### Developer experience
91+
## Configure Serverless GPU
9592

96-
Developers can access serverless GPU compute through:
93+
To configure
9794

98-
- **Windows Terminal**: Launch a GPU-powered shell directly from Windows Terminal
99-
- **Visual Studio**: Access GPU compute from within the Visual Studio environment
100-
- **VS Code with AI Toolkit**: Use seamless GPU integration for AI development tasks
95+
### Register for the subscription
10196

102-
The goal is to provide a seamless, native experience where GPU resources are accessible without requiring any setup from the developer.
97+
1. Sign in to the [Azure portal](https://portal.azure.com).
98+
1. Navigate to your subscription.
99+
1. Select **Settings** > **Preview features**.
100+
1. Select **Dev Box Serverless GPU Preview**, and then select **Register**.
103101

104-
## Administration and management
102+
### Enable GPU for a project
105103

106-
Admins control serverless GPU access at the project level through Dev Center. Key management capabilities include:
104+
1. Go to your project.
105+
1. Select **Settings** > **Dev box settings**.
106+
1. Under **AI workloads**, select **Enable**, and then select **Apply**.
107107

108-
- **Enable/disable GPU access**: Control whether projects can use serverless GPU resources
109-
- **Set concurrent GPU limits**: Specify the maximum number of GPUs that can be used simultaneously across a project
110-
- **Cost controls**: Manage GPU usage within subscription quotas
111108

112-
Access to serverless GPU resources is managed through project-level properties. When the serverless GPU feature is enabled for a project, all Dev Boxes within that project automatically gain access to GPU compute. This simplifies the access model by removing the need for custom roles or pool-based configurations.
113109

114110
## Related content
115111

0 commit comments

Comments
 (0)