Be the first to know when I publish new AI builds + demos!
An autonomous AI researcher. It takes a research objective, breaks it into experiments, spins up separate agents with access to their own GPUs to run these experiments, and delivers a paper-style writeup with findings.
- Decomposes your prompt into experiments and assigns them to specialist researcher agents.
- Each agent can launch GPU-enabled sandboxes to train models/run inference/etc., evaluate, and collect evidence.
- Based on the results of these experiments, the orchestrator can decide to finalize, or run more experiments.
- The orchestrator goes over all of the results and turns them into a coherent "paper".
The fastest way to use it:
python run_app.py
This installs missing deps, starts the API + frontend, and opens the notebook. If Google/Modal keys aren’t set, the UI will prompt you and save them locally before the run starts.
- Google AI Studio key:
GOOGLE_API_KEY - Modal tokens:
MODAL_TOKEN_IDandMODAL_TOKEN_SECRET(for GPU sandboxes) - Add them to
.envin the repo root, or paste them into the web prompt when asked.
Prefer the terminal?
python -m venv venv && source venv/bin/activate
pip install -r requirements.txt
python main.py "Does label smoothing improve ViT-Base on CIFAR-10?" --mode single --gpu any
Orchestrator (multi-agent):
python main.py "Characterize scaling laws for sparse attention transformers" \
--mode orchestrator --num-agents 3 --max-rounds 3 --max-parallel 2 --gpu any
Dry run:
python main.py "Sanity check the pipeline" --mode orchestrator --test-mode
This is a super-early, experimental harness. There are a number of improvements to be worked out (i.e. dataset sharing between agents, key management, etc.), literature search, that would make this way more capable. If anyone wants to add these in, feel free!