update cli and llm docs

zh1peng · zh1peng · commit ea477245f3fd · 2025-07-15T14:22:01.000+08:00
diff --git a/docs/tutorials/cli_usage.md b/docs/tutorials/cli_usage.md
@@ -52,21 +52,18 @@ psyflow-init
 Before copying template files, the CLI checks for existing files or folders with the same names. If any conflicts are found, you will be prompted:
 
 ```
-⚠ Existing file 'main.py' detected. Overwrite? [y/N]:
+⚠ Existing file 'main.py' detected. Overwrite this and all remaining? [y/N]:
 ```
 
-- Enter `y` to proceed and replace the file.
+- Enter `y` to proceed and replace all existing files.
 - Enter `n` (or press Enter) to skip that file and continue with others.
 
 This interactive confirmation prevents unintentional data loss during in-place initialization.
 
 ## 3. How It Works Internally
-
 1. **Locate template**: Uses `importlib.resources` to find the `psyflow.templates` package and the `cookiecutter-psyflow` folder.
 2. **Cookiecutter render**:
    - **New‑directory mode**: Directly runs Cookiecutter into `./<project_name>`.
    - **In‑place mode**: Renders into a temporary directory, then copies files into the current folder.
 3. **Cleanup**: In-place mode deletes the temporary render directory when finished.
 
-> *Tip*: All rendering is done with `no_input=True` so the command never pauses for prompts.
-
diff --git a/docs/tutorials/llm_client.md b/docs/tutorials/llm_client.md
@@ -1,99 +1,157 @@
-# Interacting with Large Language Models (LLMs)
+## Overview
 
-`psyflow` provides a powerful and unified `LLMClient` to connect your experiments with various Large Language Models (LLMs). This client can be used for a variety of tasks, including generating text, creating documentation for your task, and even translating content.
+The `LLMClient` class in `psyflow` offers a lightweight, unified interface for interacting with various Large Language Model (LLM) backends, including Google Gemini, OpenAI, Deepseek, and Moonshot. Instead of relying on heavy frameworks like LangChain, we built a minimal wrapper to keep things simple: no extra dependencies beyond provider SDKs, a clean API (e.g., `generate()`, `translate()`, `count_tokens()`), and fast, low-overhead execution.
 
-The `LLMClient` supports multiple providers out-of-the-box:
-- `gemini` (Google)
-- `openai` (OpenAI)
-- `deepseek` (DeepSeek)
+## Supported Providers
 
-## Getting Started: Initializing the Client
+Our library supports flexible, cost-effective access across multiple providers:
 
-First, you need to import the `LLMClient` and initialize it with your provider details. You will need an API key from your chosen provider.
+- **Gemini** (Google GenAI): Free-tier access to powerful models—ideal for getting started at no cost.
+- **OpenAI**: Official OpenAI SDK support for GPT‑series models and fine-tuned endpoints.
+- **Deepseek**: A cost-effective alternative via the OpenAI-compatible SDK for users without Gemini access.
+- **Moonshot**: A cost-effective alternative via the OpenAI-compatible SDK for users without Gemini access.
 
-```python
-from psyflow.LLM import LLMClient
-import os
+## Key Features
 
-# Make sure to set your API key securely
-# For example, load it from an environment variable
-# api_key = os.environ.get("OPENAI_API_KEY")
+| Feature                | Description                                                          |
+| ---------------------- | -------------------------------------------------------------------- |
+| Multi-provider support | Out-of-the-box: Gemini, OpenAI, Deepseek, Moonshot                   |
+| Text generation        | `generate()` with sampling and deterministic options                 |
+| Model discovery        | `list_models()` lists IDs from each provider                         |
+| Task documentation     | `task2doc()` auto-creates a structured `README.md`                   |
+| Translation            | `translate()` for strings, `translate_config()` for YAML             |
+| Knowledge management   | `add_knowledge()` & `save_knowledge()` manage few-shot examples      |
+| Error handling         | Raises `LLMAPIError` for failures, missing models, or token overflow |
 
-llm_client = LLMClient(
-    provider="openai",
-    api_key="YOUR_API_KEY",  # Replace with your actual key
-    model="gpt-3.5-turbo"
-)
-```
+## Quick Reference
+
+| Purpose               | Method                                                  | Example                                                                  |
+| --------------------- | ------------------------------------------------------- | ------------------------------------------------------------------------ |
+| Initialize client     | `LLMClient(provider, api_key, model)`                   | `client = LLMClient("openai", os.getenv("OPENAI_KEY"), "gpt-4o-mini")`   |
+| Generate text         | `generate(prompt, deterministic=False, **kwargs)`       | `resp = client.generate("Hello world", temperature=0.5)`                 |
+| List models           | `list_models()`                                         | `models = client.list_models()`                                          |
+| Smoke-test connection | `test(ping, max_tokens)`                                | `client.test("Hi", max_tokens=5)`                                        |
+| Auto-generate README  | `task2doc(logic_paths, config_paths, output_path)`      | `client.task2doc(["src/run_trial.py"], ["config/config.yaml"], "./")`    |
+| Translate string      | `translate(text, target_language)`                      | `client.translate("Welcome", "Japanese")`                                |
+| Translate config YAML | `translate_config(target_language, config, output_dir)` | `client.translate_config("Spanish", "./config/config.yaml", "./config")` |
 
-When you create an `LLMClient` instance, you specify the `provider`, your `api_key`, and the `model` you wish to use.
+## Detailed Usage Guide
 
-## Basic Text Generation
+### 1. Verify Native SDKs
 
-The most fundamental use of the client is to generate text from a prompt using the `generate()` method.
+#### 1.1 Google-GenAI (Gemini)
 
 ```python
-prompt = "Explain the Stroop effect in one sentence."
-response = llm_client.generate(prompt)
-print(response)
+from google import genai
+# Initialize the Gemini client
+genai.configure(api_key="…your Gemini API key…")
+client = genai.Client()
+
+# List available models
+models = client.models.list()
+model_ids = [m.name.split('/')[-1] for m in models]
+print("Available models:", model_ids)
+
+# Quick echo test
+resp = client.models.generate_content(
+    model="gemini-1.5-flash",
+    contents="Hello, how are you?"
+)
+print(resp.text)
+# -> I am doing well... How are you today?
 ```
 
-You can also control the creativity of the response. For a more predictable, less random output, set `deterministic=True`.
+#### 1.2 OpenAI / Deepseek
 
 ```python
-response = llm_client.generate(prompt, deterministic=True)
-print(response)
+from openai import OpenAI
+client = OpenAI(api_key="…your key…", base_url="https://api.deepseek.com")
+
+# List models from Deepseek
+resp = client.models.list()
+ids = [m.id for m in resp.data]
+print("Available models:", ids)
+
+# Quick echo test
+echo = client.chat.completions.create(
+    model="deepseek-chat",
+    messages=[{"role": "user", "content": "Hello"}],
+    stream=False
+)
+print(echo.choices[0].message.content)
+# -> Hello! How can I assist you today?
 ```
 
-## Listing Available Models
-
-If you are not sure which model identifier to use, you can list all available models for your configured provider.
+### 2. Use Psyflow `LLMClient` Wrapper
 
 ```python
-available_models = llm_client.list_models()
-print(available_models)
+from psyflow import LLMClient
+import os
+
+# Instantiate wrappers for each provider
+gemini = LLMClient("gemini", os.getenv("GEMINI_KEY"), "gemini-2.0-flash")
+deep   = LLMClient("deepseek", os.getenv("DEESEEK_KEY"), "deepseek-chat")
+
+# List models via wrapper
+print("Gemini sees:", gemini.list_models())
+print("Deepseek sees:", deep.list_models())
+
+# Echo test via wrapper
+gemini_echo = gemini.test(max_tokens=5)
+print("Gemini echo:", gemini_echo)
+deepseal_echo = deep.test(max_tokens=5)
+print("Deepseek echo:", deepseal_echo)
 ```
-This is a great way to explore and find the perfect model for your needs.
 
-## Advanced Usage: Auto-generating Task Documentation
+### 3. LLMs-Powered Task Documentation
 
-One of the powerful features of the `LLMClient` is its ability to automatically generate a `README.md` file for your task based on your source code and configuration. This is done with the `task2doc()` method.
+Use `task2doc()` to generate a complete `README.md` for your PsyFlow task:
 
 ```python
-# This assumes you are running from the root of a psyflow project
-readme_content = llm_client.task2doc(
-    logic_paths=["./src/run_trial.py", "./main.py"],
-    config_paths=["./config/config.yaml"],
-    output_path="./"  # Save the README.md in the current directory
+client = LLMClient("gemini", os.getenv("GEMINI_KEY"), "gemini-2.5-flash")
+readme = client.task2doc(
+    logic_paths=["main.py", "src/run_trial.py"],
+    config_paths=["config/config.yaml"],
+    output_path="./"
 )
-
-print("README.md has been generated!")
+print("Generated README content:")
+print(readme)
 ```
-This method reads your task logic and configuration, sends it to the LLM with a carefully crafted prompt, and saves the generated documentation.
 
-## Advanced Usage: Translating Content
+This reads your code and config, sends them to the LLM, and writes a structured markdown document with:
+
+- **Meta Information**: version, author, requirements
+- **Task Overview** and **Flow Tables**
+- **Configuration Summaries**: stimuli, timing, triggers
+- **Methods** section ready for manuscripts
+
+### 4. LLMs-Powered Localization
 
-The `LLMClient` can also be used to translate text, which is incredibly useful for creating multilingual experiments.
+#### 4.1 In-Memory Translation
 
-### Translating a simple string
-You can translate any string to a target language using the `translate()` method.
 ```python
-english_text = "Welcome to the experiment."
-german_text = llm_client.translate(english_text, target_language="German")
-print(german_text)
-# Expected output: Willkommen zum Experiment.
+client = LLMClient("deepseek", os.getenv("DEESEEK_KEY"), "deepseek-chat")
+translated = client.translate_config(
+    target_language="Japanese"
+)
+print(translated)
 ```
 
-### Translating a configuration file
-You can even translate a whole configuration file using the `translate_config()` method. This is useful for localizing instructions or stimuli defined in your `config.yaml`.
+#### 4.2 Translate and Save
+
 ```python
-# This will translate relevant fields in the config file
-# and save a new file (e.g., config.translated.yaml)
-translated_config = llm_client.translate_config(
+translated = client.translate_config(
     target_language="Spanish",
     config="./config/config.yaml",
-    output_dir="./config"
+    output_dir="./config",
+    output_name="config.es.yaml"
 )
-print("Translated config has been saved!")
+print("Saved to ./config/config.es.yaml")
 ```
-This will automatically find text-based stimuli and other translatable fields in your configuration and translate them.
+
+This updates your YAML fields (labels, stimuli text) and writes a `.translated.yaml` file.
+
+```{Note}
+I am trying to implement a more robust doc2task pipeline for PsyFlow tasks.
+Stay tuned for updates!
+```