@@ -19,15 +19,12 @@ limitations under the License.
1919
2020# Apache SystemDS
2121
22- ** Overview:** SystemDS is an open source ML system for the end-to-end data science lifecycle from data integration, cleaning,
23- and feature engineering, over efficient, local and distributed ML model training, to deployment and serving. To this
24- end, we aim to provide a stack of declarative languages with R-like syntax for (1) the different tasks of the data-science
25- lifecycle, and (2) users with different expertise. These high-level scripts are compiled into hybrid execution plans of
26- local, in-memory CPU and GPU operations, as well as distributed operations on Apache Spark. In contrast to existing
27- systems - that either provide homogeneous tensors or 2D Datasets - and in order to serve the entire data science lifecycle,
28- the underlying data model are DataTensors, i.e., tensors (multi-dimensional arrays) whose first dimension may have a
29- heterogeneous and nested schema.
30-
22+ ** Overview:** Apache SystemDS is an open-source machine learning (ML) system for the end-to-end
23+ data science lifecycle from data preparation and cleaning, over efficient ML model training,
24+ to debugging and serving. ML algorithms or pipelines are specified in a high-level language
25+ with R-like syntax or related Python and Java APIs (with many builtin primitives), and the
26+ system automatically generates hybrid runtime plans of local, in-memory operations and distributed
27+ operations on Apache Spark. Additional backends exist for GPUs and federated learning.
3128
3229Resource | Links
3330---------|------
0 commit comments