Skip to content

Commit 92fb903

Browse files
authored
Merge pull request #11541 from IQSS/11540-ai-guide
add AI Guide: Ask Dataverse, Ask the Data, AutoSage, DataChat, TurboCurator, MCP
2 parents 7877e90 + 4d260d6 commit 92fb903

File tree

4 files changed

+51
-0
lines changed

4 files changed

+51
-0
lines changed
Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,3 @@
1+
### AI Guide for Dataverse
2+
3+
Information about various Dataverse-related AI efforts have been documented in a new [AI Guide](https://dataverse-guide--11541.org.readthedocs.build/en/11541/ai/index.html). See #11474, #11540, and #11541.
Lines changed: 42 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,42 @@
1+
# AI Guide
2+
3+
Artificial Intelligence (AI) is a growing component of the Dataverse ecosystem.
4+
5+
```{contents} Contents:
6+
:local:
7+
:depth: 2
8+
```
9+
10+
## Tools
11+
12+
### Ask Dataverse
13+
14+
Ask Dataverse ([ask.dataverse.org](https://ask.dataverse.org)) is a place to ask questions about the Dataverse Project and the Dataverse software. It was created by Slava Tykhonov who [announced](https://groups.google.com/g/dataverse-community/c/tqwCoygO4oE/m/MNSfrw_QAwAJ) it in December 2024 and presented it February 2025 ([video](https://harvard.zoom.us/rec/share/bOizatNdMdxINRCnqpt87fPITPvsDWTv3ysvA8kIaEE4wnmZPSeSUkdmpKYP1ooA.rKoNMqED_L8KtHOi), [slides](https://docs.google.com/presentation/d/1HFN-wAe4eUGwJAhYCLbNcNHAsi-Hy8jQ/edit?usp=sharing&ouid=117275479921759507378&rtpof=true&sd=true), [notes](https://docs.google.com/document/d/1Dz07WKceGrBGdq5wWf0NJS08CO0FEmi4TgQBcsDcpRE/edit?usp=sharing)).
15+
16+
### Ask the Data
17+
18+
Ask the Data is an {ref}`external tool <inventory-of-external-tools>` that allows you ask natural language questions about the data contained in Dataverse tables (tabular data). See the README.md file at <https://github.com/IQSS/askdataverse/tree/main/askthedata> for the instructions on adding Ask the Data to your Dataverse installation.
19+
20+
### AutoSage
21+
22+
AutoSage provides metadata suggestions for datasets. Learn more at <https://github.com/siacus/autosage>.
23+
24+
### DataChat
25+
26+
DataChat is a multilingual open source natural language interface for Dataverse and other data platforms with an experimental Graph AI implementation for Croissant support. DataChat can literally talk back to you and explain what is inside of every single dataset, you can ask any question and it responds on the level of metadata described by Croissant standard. Learn more at <https://github.com/gdcc/datachat>.
27+
28+
### TurboCurator
29+
30+
TurboCurator is an {ref}`external tool <inventory-of-external-tools>` that generates metadata improvements for title, description, and keywords. It relies on OpenAI's ChatGPT & ICPSR best practices. See the [TurboCurator Dataverse Administrator](https://turbocurator.icpsr.umich.edu/tc/adminabout/) page for more details on how it works and adding TurboCurator to your Dataverse installation.
31+
32+
33+
## Protocols
34+
35+
(mcp)=
36+
### Model Context Protocol (MCP)
37+
38+
[Model Context Protocol (MCP)](https://modelcontextprotocol.io/introduction) is a standard for AI Agents to communicate with tools and services, [announced](https://www.anthropic.com/news/model-context-protocol) in November 2024.
39+
40+
An MCP server for Dataverse has been deployed to [mcp.dataverse.org][], powered by the code at <https://github.com/gdcc/mcp-dataverse>. See the code's README for information on configuring MCP clients (e.g. Cursor, Visual Studio Code, Windsurf, Zed, etc.) to use [mcp.dataverse.org][] or your own local installation (setup instructions are also provided).
41+
42+
[mcp.dataverse.org]: https://mcp.dataverse.org

doc/sphinx-guides/source/api/apps.rst

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -108,6 +108,11 @@ A Python CLI tool for bulk extraction of dataverses, datasets, and data file met
108108

109109
https://github.com/scholarsportal/dataverse-metadata-crawler
110110

111+
mcp-dataverse
112+
~~~~~~~~~~~~~
113+
114+
The code at https://github.com/gdcc/mcp-dataverse powers a :ref:`mcp` server for Dataverse.
115+
111116
Java
112117
----
113118

doc/sphinx-guides/source/index.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,7 @@ These documentation guides are for the |version| version of Dataverse. To find g
1515

1616
user/index
1717
admin/index
18+
ai/index
1819
api/index
1920
installation/index
2021
contributor/index

0 commit comments

Comments
 (0)