Skip to content

Commit 612dc33

Browse files
authored
[deploy] Merge pull request #161 from microsoft/dev
Dev: adding azure and s3 data loaders
2 parents 0d3e6c0 + e01e544 commit 612dc33

21 files changed

+727
-84
lines changed

README.md

Lines changed: 6 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -14,22 +14,24 @@
1414

1515
Transform data and create rich visualizations iteratively with AI 🪄. Try Data Formulator now!
1616

17-
[![Open in GitHub Codespaces](https://github.com/codespaces/badge.svg)](https://codespaces.new/microsoft/data-formulator?quickstart=1)
17+
Any questions? Ask on the Discord channel! [![Discord](https://img.shields.io/badge/discord-chat-green?logo=discord)](https://discord.gg/mYCZMQKYZb)
18+
19+
<!-- [![Open in GitHub Codespaces](https://github.com/codespaces/badge.svg)](https://codespaces.new/microsoft/data-formulator?quickstart=1) -->
1820

1921
<kbd>
2022
<a target="_blank" rel="noopener noreferrer" href="https://codespaces.new/microsoft/data-formulator?quickstart=1" title="open Data Formulator in GitHub Codespaces"><img src="public/data-formulator-screenshot.png"></a>
2123
</kbd>
2224

2325

26+
2427
## News 🔥🔥🔥
2528

26-
- [05-13-2025] Data Formulator 0.2.1: External Data Loader
29+
- [05-13-2025] Data Formulator 0.2.3: External Data Loader
2730
- We introduced external data loader class to make import data easier. [Readme](https://github.com/microsoft/data-formulator/tree/main/py-src/data_formulator/data_loader) and [Demo](https://github.com/microsoft/data-formulator/pull/155)
28-
- Example data loaders from MySQL and Azure Data Explorer (Kusto) are provided.
31+
- Current data loaders: MySQL, Azure Data Explorer (Kusto), Azure Blob and Amazon S3 (json, parquet, csv).
2932
- Call for action [link](https://github.com/microsoft/data-formulator/issues/156):
3033
- Users: let us know which data source you'd like to load data from.
3134
- Developers: let's build more data loaders.
32-
- Discord channel for discussions: join us! [![Discord](https://img.shields.io/badge/discord-chat-green?logo=discord)](https://discord.gg/mYCZMQKYZb)
3335

3436
- [04-23-2025] Data Formulator 0.2: working with *large* data 📦📦📦
3537
- Explore large data by:
@@ -68,8 +70,6 @@ Transform data and create rich visualizations iteratively with AI 🪄. Try Data
6870

6971
- [10-01-2024] Initial release of Data Formulator, check out our [[blog]](https://www.microsoft.com/en-us/research/blog/data-formulator-exploring-how-ai-can-help-analysts-create-rich-data-visualizations/) and [[video]](https://youtu.be/3ndlwt0Wi3c)!
7072

71-
72-
7373
## Overview
7474

7575
**Data Formulator** is an application from Microsoft Research that uses large language models to transform data, expediting the practice of data visualization.

package.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -82,6 +82,6 @@
8282
"globals": "^15.12.0",
8383
"sass": "^1.77.6",
8484
"typescript-eslint": "^8.16.0",
85-
"vite": "^5.4.15"
85+
"vite": "^5.4.19"
8686
}
8787
}

py-src/data_formulator/agents/agent_code_explanation.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,7 @@
11
# Copyright (c) Microsoft Corporation.
22
# Licensed under the MIT License.
33

4-
import pandas as pd
5-
from data_formulator.agents.agent_utils import generate_data_summary, extract_code_from_gpt_response
4+
from data_formulator.agents.agent_utils import generate_data_summary
65

76
import logging
87

py-src/data_formulator/agents/agent_py_concept_derive.py

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,6 @@
11
# Copyright (c) Microsoft Corporation.
22
# Licensed under the MIT License.
33

4-
import json
54
import time
65

76
from data_formulator.agents.agent_utils import generate_data_summary, extract_code_from_gpt_response
@@ -10,7 +9,6 @@
109
import traceback
1110

1211
import logging
13-
import datetime
1412

1513
logger = logging.getLogger(__name__)
1614

py-src/data_formulator/agents/agent_py_data_transform.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,6 @@
22
# Licensed under the MIT License.
33

44
import json
5-
import sys
65

76
from data_formulator.agents.agent_utils import extract_json_objects, generate_data_summary, extract_code_from_gpt_response
87
import data_formulator.py_sandbox as py_sandbox

py-src/data_formulator/agents/agent_query_completion.py

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,9 @@
11
# Copyright (c) Microsoft Corporation.
22
# Licensed under the MIT License.
33

4-
import pandas as pd
54
import json
65

7-
from data_formulator.agents.agent_utils import extract_code_from_gpt_response, extract_json_objects
6+
from data_formulator.agents.agent_utils import extract_json_objects
87
import re
98
import logging
109

py-src/data_formulator/agents/agent_utils.py

Lines changed: 0 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,10 +6,6 @@
66
import pandas as pd
77
import numpy as np
88

9-
import base64
10-
11-
from pprint import pprint
12-
139
import re
1410

1511
def string_to_py_varname(var_str):

py-src/data_formulator/agents/client_utils.py

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,3 @@
1-
import os
21
import litellm
32
import openai
43
from azure.identity import DefaultAzureCredential, get_bearer_token_provider
Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,14 @@
11
from data_formulator.data_loader.external_data_loader import ExternalDataLoader
22
from data_formulator.data_loader.mysql_data_loader import MySQLDataLoader
33
from data_formulator.data_loader.kusto_data_loader import KustoDataLoader
4+
from data_formulator.data_loader.s3_data_loader import S3DataLoader
5+
from data_formulator.data_loader.azure_blob_data_loader import AzureBlobDataLoader
46

57
DATA_LOADERS = {
68
"mysql": MySQLDataLoader,
7-
"kusto": KustoDataLoader
9+
"kusto": KustoDataLoader,
10+
"s3": S3DataLoader,
11+
"azure_blob": AzureBlobDataLoader,
812
}
913

10-
__all__ = ["ExternalDataLoader", "MySQLDataLoader", "KustoDataLoader", "DATA_LOADERS"]
14+
__all__ = ["ExternalDataLoader", "MySQLDataLoader", "KustoDataLoader", "S3DataLoader", "AzureBlobDataLoader", "DATA_LOADERS"]

0 commit comments

Comments
 (0)