You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/machine-learning/data-science-virtual-machine/dsvm-tools-explore-and-visualize.md
+21-21Lines changed: 21 additions & 21 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -22,44 +22,44 @@ ms.author: vijetaj
22
22
23
23
# Data exploration and visualization tools on the Data Science Virtual Machine
24
24
25
-
A key step in data scienceis to understand the data. Visualization and data exploration tools help accelerate data understanding. Here are some toolsprovided on the DSVM that make this key step easier.
25
+
In data science, the key is to understand the data. Visualization and data exploration tools help accelerate data understanding. The following tools, which are provided on the Data Science Virtual Machine (DSVM), make this key step easier.
26
26
27
27
## Apache Drill
28
28
|||
29
29
| ------------- | ------------- |
30
-
| What is it? | Opensource SQL query engine on Big data |
31
-
| Supported DSVM Versions| Windows, Linux |
32
-
| How is it configured / installed on the DSVM? | Installed in `/dsvm/tools/drill*` in embedded mode only |
33
-
| Typical Uses|In-situ Data exploration without requiring ETL. Query different data sources and formats including CSV, JSON, relational tables, Hadoop |
34
-
| How to use / run it?| Desktop Shortcut <br/> [Get started with Drill in 10 minutes](https://drill.apache.org/docs/drill-in-10-minutes/)|
35
-
| Related Tools on the DSVM | Rattle, Weka, SQL Server Management Studio |
30
+
| What is it? | Open-source SQL query engine on big data |
31
+
| Supported DSVM versions| Windows, Linux |
32
+
| How is it configured and installed on the DSVM? | Installed in `/dsvm/tools/drill*` in embedded mode only |
33
+
| Typical uses|For in-place data exploration without requiring extract, transform, load (ETL). Query different data sources and formats, including CSV, JSON, relational tables, and Hadoop.|
34
+
| How to use and run it | Desktop shortcut <br/> [Get started with Drill in 10 minutes](https://drill.apache.org/docs/drill-in-10-minutes/)|
35
+
| Related tools on the DSVM | Rattle, Weka, SQL Server Management Studio |
36
36
37
37
## Weka
38
38
|||
39
39
| ------------- | ------------- |
40
-
| What is it? |Weka is a collection of machinelearning algorithms for data mining tasks. The algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data pre-processing, classification, regression, clustering, association rules, and visualization. |
40
+
| What is it? |A collection of machine-learning algorithms for data mining tasks. These algorithms can either be applied directly to a dataset or called from your own Java code. Weka contains tools for data preprocessing, classification, regression, clustering, association rules, and visualization. |
41
41
| Supported DSVM Editions | Windows, Linux |
42
-
| Typical Uses| General ML Tool|
43
-
| How to use / run it?| On Windows, search for Weka in the Start Menu. On Linux, sign in with X2Go, then navigate to Applications -> Development -> Weka. |
42
+
| Typical uses| General machine-learning tool|
43
+
| How to use and run it | On Windows, search for Weka on the Start menu. On Linux, sign in with X2Go, and then go to Applications > Development > Weka. |
44
44
| Links to Samples |[Weka samples](https://www.cs.waikato.ac.nz/ml/weka/documentation.html)|
45
-
| Related Tools on the DSVM |LightGBM, Rattle, Xgboost |
45
+
| Related tools on the DSVM |LightGBM, Rattle, Xgboost |
46
46
47
47
## Rattle
48
48
|||
49
49
| ------------- | ------------- |
50
-
| What is it? | A Graphical User Interface for Data Mining using R |
50
+
| What is it? | A graphical user interface (GUI) for data mining using R |
51
51
| Supported DSVM Editions | Windows, Linux |
52
-
| Typical Uses| General UI Data Mining tool for R |
53
-
| How to use / run it?| UI tool. On Windows, start a Command Prompt, run R, then inside R run `rattle()`. On Linux, connect with X2Go, start a terminal, run R, then inside R run `rattle()`. |
54
-
| Links to Samples|[Rattle](https://togaware.com/onepager/)|
55
-
| Related Tools on the DSVM |LightGBM, Weka, Xgboost |
52
+
| Typical uses| General UI Data Mining tool for R |
53
+
| How to use and run it | UI tool. On Windows, open a command prompt, run R, and then inside R, run `rattle()`. On Linux, connect with X2Go, start a terminal, run R, and then inside R, run `rattle()`. |
54
+
| Links to samples|[Rattle](https://togaware.com/onepager/)|
55
+
| Related tools on the DSVM |LightGBM, Weka, Xgboost |
56
56
57
57
## Power BI Desktop
58
58
|||
59
59
| ------------- | ------------- |
60
-
| What is it? | Interactive Data Visualization and BI Tool|
61
-
| Supported DSVM Versions| Windows |
62
-
| Typical Uses| Data Visualization and building Dashboards|
63
-
| How to use / run it?| Desktop Shortcut (`C:\Program Files\Microsoft Power BI Desktop\bin\PBIDesktop.exe`) |
64
-
| Related Tools on the DSVM | Visual Studio 2019, Visual Studio Code, Juno |
60
+
| What is it? | Interactive data visualization and BI tool|
61
+
| Supported DSVM versions| Windows |
62
+
| Typical uses| Data visualization and building dashboards|
63
+
| How to use and run it | Desktop shortcut (`C:\Program Files\Microsoft Power BI Desktop\bin\PBIDesktop.exe`) |
64
+
| Related tools on the DSVM | Visual Studio 2019, Visual Studio Code, Juno |
0 commit comments