You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/core/whats-new/dotnet-9/overview.md
+44-2Lines changed: 44 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -53,9 +53,51 @@ The .NET 9 SDK introduces _workload sets_, where all of your workloads stay at a
53
53
54
54
For more information, see [What's new in the SDK for .NET 9](sdk.md).
55
55
56
-
## ML.NET
56
+
## AI building blocks
57
57
58
-
ML.NET is an open-source, cross-platform framework that enables integration of custom machine-learning models into .NET applications. The latest version, ML.NET 4.0, adds [additional tokenizer support](../../../machine-learning/whats-new/overview.md#additional-tokenizer-support) for tokenizers such as Tiktoken and models such as Llama and CodeGen. <!--Add info about `Tensor<T>` here and in what's new for ML.NET.-->
58
+
.NET 9 introduces a unified layer of C# abstractions through the [Microsoft.Extensions.AI](https://www.nuget.org/packages/Microsoft.Extensions.AI.Abstractions/) and [Microsoft.Extensions.VectorData](https://www.nuget.org/packages/Microsoft.Extensions.VectorData.Abstractions/) packages. These abstractions facilitate interaction with AI services, including small and large language models (SLMs and LLMs), embeddings, vector stores, and middleware.
59
+
60
+
.NET 9 also includes new tensor types that expand AI capabilities. <xref:System.Numerics.Tensors.TensorPrimitives> and the new <xref:System.Numerics.Tensors.Tensor%601> type expand AI capabilities by enabling efficient encoding, manipulation, and computation of multi-dimensional data. You can find these types in the latest release of the [System.Numerics.Tensors package](https://www.nuget.org/packages/System.Numerics.Tensors/).
61
+
62
+
### TensorPrimitives
63
+
64
+
- Expanded method scope: Increased from 40 to nearly 200 overloads, now including numerical operations similar to `Math`, `MathF`, and `INumber<T>` but for spans of values.
65
+
- Performance enhancements: Many operations are now SIMD-optimized for better performance.
66
+
- Generic overloads: Supports any type `T` that implements a certain interface, expanding beyond just spans of float values in .NET.
67
+
68
+
### Tensor\<T>
69
+
70
+
- Builds on top of `TensorPrimitives` for efficient math operations.
71
+
- Provides efficient interop with AI libraries (ML.NET, TorchSharp, ONNX Runtime) using zero copies where possible.
72
+
- Enables easy and efficient data manipulation with indexing and slicing operations.
73
+
74
+
### ML.NET
75
+
76
+
[ML.NET](https://www.nuget.org/packages/Microsoft.ML/) is an open-source, cross-platform framework that enables integration of custom machine-learning models into .NET applications.
77
+
78
+
ML.NET 4.0 brings the following improvements:
79
+
80
+
- New ways to programmatically configure `MLContext` options.
81
+
- Load ONNX models as `Stream`.
82
+
- DataFrame improvements.
83
+
- New capabilities for [tokenizers](#tokenizers).
84
+
- (Experimental) TorchSharp ports of Llama and Phi family of models.
85
+
- (Experimental) CausalLM pipeline APIs.
86
+
87
+
For more information, see [What's new in ML.NET](../../../machine-learning/whats-new/overview.md).
88
+
89
+
#### Tokenizers
90
+
91
+
The [Microsoft.ML.Tokenizers](https://www.nuget.org/packages/Microsoft.ML.Tokenizers) library provides .NET developers with capabilities for encoding and decoding text to tokens. For AI scenarios, this is important to manage context, calculate cost, and preprocess text when working with local models.
92
+
93
+
The latest release introduces significant new capabilities for tokenizers:
94
+
95
+
- Tiktoken for GPT (3, 3.5, 4, 4o, o1) and Llam3 models
96
+
- Llama (based on SentencePiece) for Llama and Mistral models
97
+
- CodeGen for code-generation models like codegen-350M-mono
98
+
- Phi2 (based on CodeGen) for Microsoft Phi2 model
99
+
- WordPiece
100
+
- Bert (based on WordPiece) for Bert-supported models like optimum--all-MiniLM-L6-v2
0 commit comments