datacommonsorg
diff --git a/‎assets/images/mcp.png‎
43 KB b/‎assets/images/mcp.png‎
43 KB
diff --git a/‎llm/index.md‎ b/‎llm/index.md‎
diff --git a/‎mcp/develop_agent.md‎
Lines changed: 30 additions & 0 deletions b/‎mcp/develop_agent.md‎
Lines changed: 30 additions & 0 deletions
diff --git a/‎mcp/index.md‎
Lines changed: 42 additions & 4 deletions b/‎mcp/index.md‎
Lines changed: 42 additions & 4 deletions
diff --git a/‎mcp/run_tools.md‎
Lines changed: 211 additions & 0 deletions b/‎mcp/run_tools.md‎
Lines changed: 211 additions & 0 deletions
@@ -0,0 +1,30 @@
+---
+layout: default
+title: Develop an ADK agent
+nav_order: 3
+parent: MCP - Query data interactively with an AI agent
+---
+
+# Develop your own ADK agent
+
+We provide two sample Google Agent Development Kit-based agents you can use as inspiration for building your own agent:
+
+- [Try Data Commons MCP Tools with a Custom Agent](https://github.com/datacommonsorg/agent-toolkit/blob/main/notebooks/datacommons_mcp_tools_with_custom_agent.ipynb) is a Google Colab tutorial that shows how to build an ADK Python agent step by step. 
+- The sample [basic agent](https://github.com/datacommonsorg/agent-toolkit/tree/main/packages/datacommons-mcp/examples/sample_agents/basic_agent) is a simple Python [Google ADK](https://google.github.io/adk-docs/) agent you can use to develop locally. 
+
+## Customize the sample agent
+
+You can make changes directly to the Python files in <https://github.com/datacommonsorg/agent-toolkit/blob/main/packages/datacommons-mcp/examples/sample_agents/basic_agent/>. You'll need to [restart the agent](/mcp/run_tools.html#use-the-sample-agent) any time you make changes.
+
+> Tip: You do not need to install the Google ADK; when you use the [command we provide](run_tools.md#use-the-sample-agent) to start the agent, it downloads the ADK dependencies at run time.
+
+### Customize the model
+
+To change to a different LLM, edit the `AGENT_MODEL` constant in [packages/datacommons-mcp/examples/sample_agents/basic_agent/agent.py](https://github.com/datacommonsorg/agent-toolkit/blob/main/packages/datacommons-mcp/examples/sample_agents/basic_agent/agent.py#L23){: target="_blank"}.
+
+### Customize agent behavior
+
+The agent's behavior is determined by prompts provided in the `AGENT_INSTRUCTIONS` in [packages/datacommons-mcp/examples/sample_agents/basic_agent/instructions.py](https://github.com/datacommonsorg/agent-toolkit/blob/main/packages/datacommons-mcp/examples/sample_agents/basic_agent/instructions.py){: target="_blank"}.
+
+You can add your own prompts to modify how the agent handles tool results. For example, you might want to give a prompt to "build a report for every response" or "always save tabular results to a CSV file". See the Google ADK page on [LLM agent instructions](https://google.github.io/adk-docs/agents/llm-agents/#guiding-the-agent-instructions-instruction){: target="_blank"} for tips on how to write good prompts.
+
@@ -5,10 +5,48 @@ nav_order: 20
 has_children: true
 ---
 
-# MCP overview
+{:.no_toc}
+# Query data interactively with an AI agent
 
-Data Commons has recently launched a [Model Context Protocol](https://github.com/datacommonsorg/agent-toolkit){: target="_blank"} server. This allows you to use any MCP-enabled agent, powered by a Large Language Model (LLM) like Google Gemini, to interactively query Data Commons data. See the following pages for details:
+* TOC
+{:toc}
 
-- [Quickstart: Use the Data Commons MCP Server with Gemini CLI](https://github.com/datacommonsorg/agent-toolkit/blob/main/docs/quickstart.md){: target="_blank"}
-- [User Guide](https://github.com/datacommonsorg/agent-toolkit/blob/main/docs/user_guide.md){: target="_blank"}
+## Overview
 
+The Data Commons [Model Context Protocol (MCP)](https://modelcontextprotocol.io/docs/getting-started/intro) service gives AI agents access to the Data Commons knowledge graph and returns data related to statistical variables, topics, and observations. It allows end users to formulate complex natural-language queries interactively, get data in textual, structured or unstructured formats, and download the data as desired. For example, depending on the agent, a user can answer high-level questions such as "give me the economic indicators of the BRICS countries", view simple tables, and download a CSV file of the data in tabular format.
+
+The MCP server returns data from datacommons.org by default or can be configured for a Custom Data Commons instance. 
+
+The server is a Python binary based on the [FastMCP 2.0 framework](https://gofastmcp.com). A prebuilt package is available at <https://pypi.org/project/datacommons-mcp/>.
+
+At this time, there is no centrally deployed server; you run your own server, and any client you want to connect to it.
+
+![alt text](/assets/images/mcp.png)
+
+## Tools
+
+The server currently supports the following tools:
+
+- `search_indicators`: Searches for available variables and/or topics (a hierarchy of sub-topics and member variables) for a given place or metric. 
+- `get_observations`: Fetches statistical data for a given variable and place.
+
+## Clients
+
+To connect to the Data Commons MCP Server, you can use any available AI application that supports MCP, or your own custom agent. 
+
+The server supports both standard MCP [transport protocols](https://modelcontextprotocol.io/docs/learn/architecture#transport-layer):
+- Stdio: For clients that connect directly using local processes
+- Streamable HTTP: For clients that connect remotely or otherwise require HTTP (e.g. Typescript)
+
+See [Run and connect to the server](run_tools.md) for procedures for using [Gemini CLI](https://github.com/google-gemini/gemini-cli).
+
+## Unsupported features
+
+At the current time, the following are not supported:
+- Non-geographical ("custom") entities
+- Events
+- Exploring nodes and relationships in the graph
+
+## Disclaimer
+
+AI applications using the MCP server can make mistakes, so please double-check responses.
@@ -0,0 +1,211 @@
+---
+layout: default
+title: Run MCP tools
+nav_order: 2
+parent: MCP - Query data interactively with an AI agent
+---
+
+{:.no_toc}
+# Run and connect to the server
+
+This page shows you how to run a local agent and connect to a Data Commons MCP server running locally or remotely.
+
+* TOC
+{:toc}
+
+
+We provide specific instructions for the following agents:
+
+- [Gemini CLI](https://github.com/google-gemini/gemini-cli) 
+   - Can be used for datacommons.org or a Custom Data Commons instance
+   - Requires minimal setup 
+
+   See [Use Gemini CLI](#use-gemini-cli) for this option.
+- A sample basic agent based on the Google [Agent Development Kit](https://google.github.io/adk-docs/) and [Gemini Flash 2.5](https://deepmind.google/models/gemini/flash/) 
+   - Best for interacting with a Web UI
+   - Can be used for datacommons.org or a Custom Data Commons instance
+   - Can be customized to run other LLMs
+   - Requires some additional setup
+
+   See [Use the sample agent](#use-the-sample-agent) for this option.
+
+For an end-to-end tutorial using a server and agent over HTTP, see the sample Data Commons Colab notebook, [Try Data Commons MCP Tools with a Custom Agent](https://github.com/datacommonsorg/agent-toolkit/blob/main/notebooks/datacommons_mcp_tools_with_custom_agent.ipynb).
+
+For other clients/agents, see the relevant documentation; you should be able to reuse the commands and arguments detailed below.
+
+## Prerequisites
+
+- A (free) Data Commons API key. To obtain an API key, go to <https://apikeys.datacommons.org> and request a key for the `api.datacommons.org` domain.
+- Install `uv` for managing and installing Python packages; see the instructions at <https://docs.astral.sh/uv/getting-started/installation/>. 
+- For running the sample agent or the Colab notebook, a GCP project and a Google AI API key. For details on supported keys, see <https://google.github.io/adk-docs/get-started/quickstart/#set-up-the-model>.
+- For running the sample agent locally, install [Git](https://git-scm.com/).
+
+> **Important**: Additionally, for custom Data Commons instances:
+> If you have not rebuilt your Data Commons image since the stable release of 2025-09-08, you must [sync to the latest stable release](/custom_dc/build_image.html#sync-code-to-the-stable-branch), [rebuild your image](/custom_dc/build_image.html#build-package) and [redeploy](/custom_dc/deploy_cloud.html#manage-your-service).
+
+
+## Configure environment variables
+
+### Base Data Commons (datacommons.org)
+
+For basic usage against datacommons.org, set the required `DC_API_KEY` in your shell/startup script (e.g. `.bashrc`).
+<pre>
+export DC_API_KEY=<var>YOUR API KEY</var>
+</pre>
+
+### Custom Data Commons
+
+If you're running a against a custom Data Commons instance, we recommend using a `.env` file, which the server locates automatically, to keep all the settings in one place. All supported options are documented in <https://github.com/datacommonsorg/agent-toolkit/blob/main/packages/datacommons-mcp/.env.sample>. 
+
+To set variables using a `.env` file:
+
+1. From Github, download the file [`.env.sample`](https://github.com/datacommonsorg/agent-toolkit/blob/main/packages/datacommons-mcp/.env.sample) to the desired directory. Or, if you plan to run the sample agent, clone the repo <https://github.com/datacommonsorg/agent-toolkit/>.
+
+1. From the directory where you saved the sample file, copy it to a new file called `.env`. For example:
+   ```bash
+   cd ~/agent-toolkit/packages/datacommons-mcp
+   cp .env.sample .env
+   ```
+1. Set the following variables: 
+   - `DC_API_KEY`: Set to your Data Commons API key
+   - `DC_TYPE`: Set to `custom`.
+   - `CUSTOM_DC_URL`: Uncomment and set to the URL of your instance. 
+1. Optionally, set other variables.
+1. Save the file.
+
+## Use Gemini CLI
+
+1. Install Gemini CLI: see instructions at <https://github.com/google-gemini/gemini-cli#quick-install>. 
+2. To configure Gemini CLI to recognize the Data Commons server, edit your `~/.gemini/settings.json` file to add the following:
+
+<pre>
+{
+// ...
+    "mcpServers": {
+       "datacommons-mcp": {
+           "command": "uvx",
+            "args": [
+                "datacommons-mcp@latest",
+                "serve",
+                "stdio"
+            ],
+            "env": {
+                "DC_API_KEY": "<var>YOUR DATA COMMONS API KEY</var>"
+            },
+            "trust": true
+        }
+    }
+// ...
+}
+</pre>
+1. From any directory, run `gemini`. 
+1. To see the Data Commons tools, use `/mcp tools`.
+1. Start sending [natural-language queries](#sample-queries).
+
+> **Tip**: To ensure that Gemini CLI uses the Data Commons MCP tools, and not its own `GoogleSearch` tool, include a prompt to use Data Commons in your query. For example, use a query like "Use Data Commons tools to answer the following: ..."  You can also add such a prompt to a [`GEMINI.md` file](https://codelabs.developers.google.com/gemini-cli-hands-on#9) so that it's persisted across sessions.
+
+## Use the sample agent
+
+We provide a basic agent for interacting with the MCP Server in [packages/datacommons-mcp/examples/sample_agents/basic_agent](https://github.com/datacommonsorg/agent-toolkit/tree/main/packages/datacommons-mcp/examples/sample_agents/basic_agent). To run the agent locally:
+
+1. If not already installed, install `uv` for managing and installing Python packages; see the instructions at <https://docs.astral.sh/uv/getting-started/installation/>. 
+1. From the desired directory, clone the `agent-toolkit` repo:
+   ```bash
+   git clone https://github.com/datacommonsorg/agent-toolkit.git
+   ```
+1. Set the following environment variables in your shell or startup script:
+  <pre>
+   export DC_API_KEY=<var>YOUR DATA COMMONS API KEY</var>
+   export GEMINI_API_KEY=<var>YOUR GOOGLE AI API KEY</var>
+   </pre>
+1. Go to the root directory of the repo:
+   ```bash
+   cd agent-toolkit
+   ```
+1. Run the agent using one of the following methods.
+
+### Web UI (recommended)
+
+1. Run the following command:
+   ```bash
+   uvx --from google-adk adk web ./packages/datacommons-mcp/examples/sample_agents/
+   ```
+1. Point your browser to the address and port displayed on the screen (e.g. `http://127.0.0.1:8000/`). The Agent Development Kit Dev UI is displayed. 
+1. From the **Type a message** box, type your [query for Data Commons](#sample-queries) or select another action.
+
+### Command line interface
+
+1. Run the following command:
+   ```bash
+   uvx --from google-adk adk run ./packages/datacommons-mcp/examples/sample_agents/basic_agent
+   ```
+1. Enter your [queries](#sample-queries) at the `User` prompt in the terminal.
+
+## Sample queries
+
+The Data Commons MCP tools excel at natural-language queries that involve:
+- Comparisons between two or more entities, such as countries or metrics
+- Exploring data available for a given topic
+
+Here are some examples of such queries:
+
+- "What health data do you have for Africa?"
+- "What data do you have on water quality in Zimbabwe?"
+- "Compare the life expectancy, economic inequality, and GDP growth for BRICS nations."
+- "Generate a concise report on income vs diabetes in US counties."
+
+## Use a remote server/client
+
+### Run a standalone server
+
+1. Ensure you've set up the relevant server [environment variables](#configure-environment-variables). If you're using a `.env` file, go to the directory where the file is stored.
+1. Run:
+   <pre>
+   uvx datacommons-mcp serve http [--port <var>PORT</var>]
+   </pre>
+By default, the port is 8080 if you don't set it explicitly.
+
+The server is addressable with the endpoint `mcp`. For example, `http://my-mcp-server:8080/mcp`.
+
+### Connect to an already-running server from a remote client
+
+Below we provide instructions for Gemini CLI and a sample ADK agent. If you're using a different client, consult its documentation to determine how to specify an HTTP URL.
+
+#### Gemini CLI
+
+To configure Gemini CLI to connect to a remote Data Commons server over HTTP, replace the `mcpServers` section in `~/.gemini/settings.json` (or other `settings.json` file) with the following:
+
+<pre>
+{
+// ... (additional configuration)
+  "mcpServers": {
+     "datacommons-mcp": {
+       "httpUrl": "http://<var>HOST</var>:<var>PORT</var>/mcp"
+      }
+    // ... (other mcpServers entries)
+   }
+}
+</pre>
+
+#### Sample agent
+
+To configure the sample agent to connect to a remote Data Commons MCP server over HTTP, you need to modify the code in [`basic_agent/agent.py`](https://github.com/datacommonsorg/agent-toolkit/blob/main/packages/datacommons-mcp/examples/sample_agents/basic_agent/agent.py).  Set import modules and agent initialization parameters as follows:
+
+```python
+from google.adk.tools.mcp_tool.mcp_toolset import (
+   MCPToolset,
+   StreamableHTTPConnectionParams
+)
+
+root_agent = LlmAgent(
+      # ...
+      tools=[McpToolset(
+         connection_params=StreamableHTTPConnectionParams(
+            url=f"http://<host>:<port>/mcp"
+         )
+      )],
+   )
+```
+Run the agent as described in [Use the sample agent](#use-the-sample-agent) above.
+
+