pydantic
diff --git a/‎README.md‎
Lines changed: 38 additions & 36 deletions b/‎README.md‎
Lines changed: 38 additions & 36 deletions
diff --git a/‎crates/fuzz/fuzz_targets/string_input_panic.rs‎
Lines changed: 0 additions & 1 deletion b/‎crates/fuzz/fuzz_targets/string_input_panic.rs‎
Lines changed: 0 additions & 1 deletion
diff --git a/‎crates/fuzz/fuzz_targets/tokens_input_panic.rs‎
Lines changed: 1 addition & 1 deletion b/‎crates/fuzz/fuzz_targets/tokens_input_panic.rs‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎crates/monty-cli/src/main.rs‎
Lines changed: 34 additions & 32 deletions b/‎crates/monty-cli/src/main.rs‎
Lines changed: 34 additions & 32 deletions
diff --git a/‎crates/monty-js/README.md‎
Lines changed: 2 additions & 5 deletions b/‎crates/monty-js/README.md‎
Lines changed: 2 additions & 5 deletions
@@ -25,22 +25,24 @@ Monty avoids the cost, latency, complexity and general faff of using a full cont
 Instead, it lets you safely run Python code written by an LLM embedded in your agent, with startup times measured in single digit microseconds not hundreds of milliseconds.
 
 What Monty **can** do:
-* Run a reasonable subset of Python code - enough for your agent to express what it wants to do
-* Completely block access to the host environment: filesystem, env variables and network access are all implemented via external function calls the developer can control
-* Call functions on the host - only functions you give it access to
-* Run typechecking - monty supports full modern python type hints and comes with [ty](https://docs.astral.sh/ty/) included in a single binary to run typechecking
-* Be snapshotted to bytes at external function calls, meaning you can store the interpreter state in a file or database, and resume later
-* Startup extremely fast (<1μs to go from code to execution result), and has runtime performance that is similar to CPython (generally between 5x faster and 5x slower)
-* Be called from Rust, Python, or Javascript - because Monty has no dependencies on cpython, you can use it anywhere you can run Rust
-* Control resource usage - Monty can track memory usage, allocations, stack depth, and execution time and cancel execution if it exceeds preset limits
-* Collect stdout and stderr and return it to the caller
-* Run async or sync code on the host via async or sync code on the host
+
+- Run a reasonable subset of Python code - enough for your agent to express what it wants to do
+- Completely block access to the host environment: filesystem, env variables and network access are all implemented via external function calls the developer can control
+- Call functions on the host - only functions you give it access to
+- Run typechecking - monty supports full modern python type hints and comes with [ty](https://docs.astral.sh/ty/) included in a single binary to run typechecking
+- Be snapshotted to bytes at external function calls, meaning you can store the interpreter state in a file or database, and resume later
+- Startup extremely fast (<1μs to go from code to execution result), and has runtime performance that is similar to CPython (generally between 5x faster and 5x slower)
+- Be called from Rust, Python, or Javascript - because Monty has no dependencies on cpython, you can use it anywhere you can run Rust
+- Control resource usage - Monty can track memory usage, allocations, stack depth, and execution time and cancel execution if it exceeds preset limits
+- Collect stdout and stderr and return it to the caller
+- Run async or sync code on the host via async or sync code on the host
 
 What Monty **cannot** do:
-* Use the standard library (except a few select modules: `sys`, `typing`, `asyncio`, `dataclasses` (soon), `json` (soon))
-* Use third party libraries (like Pydantic), support for external python library is not a goal
-* define classes (support should come soon)
-* use match statements (again, support should come soon)
+
+- Use the standard library (except a few select modules: `sys`, `typing`, `asyncio`, `dataclasses` (soon), `json` (soon))
+- Use third party libraries (like Pydantic), support for external python library is not a goal
+- define classes (support should come soon)
+- use match statements (again, support should come soon)
 
 ---
 
@@ -49,10 +51,11 @@ In short, Monty is extremely limited and designed for **one** use case:
 **To run code written by agents.**
 
 For motivation on why you might want to do this, see:
-* [Codemode](https://blog.cloudflare.com/code-mode/) from Cloudflare
-* [Programmatic Tool Calling](https://platform.claude.com/docs/en/agents-and-tools/tool-use/programmatic-tool-calling) from Anthropic
-* [Code Execution with MCP](https://www.anthropic.com/engineering/code-execution-with-mcp) from Anthropic
-* [Smol Agents](https://github.com/huggingface/smolagents) from Hugging Face
+
+- [Codemode](https://blog.cloudflare.com/code-mode/) from Cloudflare
+- [Programmatic Tool Calling](https://platform.claude.com/docs/en/agents-and-tools/tool-use/programmatic-tool-calling) from Anthropic
+- [Code Execution with MCP](https://www.anthropic.com/engineering/code-execution-with-mcp) from Anthropic
+- [Smol Agents](https://github.com/huggingface/smolagents) from Hugging Face
 
 In very simple terms, the idea of all the above is that LLMs can work faster, cheaper and more reliably if they're asked to write Python (or Javascript) code, instead of relying on traditional tool calling. Monty makes that possible without the complexity of a sandbox or risk of running code directly on the host.
 
@@ -105,7 +108,6 @@ prompt: str = ''
 m = pydantic_monty.Monty(
     code,
     inputs=['prompt'],
-    external_functions=['call_llm'],
     script_name='agent.py',
     type_check=True,
     type_check_stubs=type_definitions,
@@ -151,13 +153,13 @@ data = fetch(url)
 len(data)
 """
 
-m = pydantic_monty.Monty(code, inputs=['url'], external_functions=['fetch'])
+m = pydantic_monty.Monty(code, inputs=['url'])
 
 # Start execution - pauses when fetch() is called
 result = m.start(inputs={'url': 'https://example.com'})
 
 print(type(result))
-#> <class 'pydantic_monty.MontySnapshot'>
+#> <class 'pydantic_monty.FunctionSnapshot'>
 print(result.function_name)  # fetch
 #> fetch
 print(result.args)
@@ -174,7 +176,7 @@ print(result.output)
 
 #### Serialization
 
-Both `Monty` and `MontySnapshot` can be serialized to bytes and restored later.
+Both `Monty` and snapshot types like `FunctionSnapshot` can be serialized to bytes and restored later.
 This allows caching parsed code or suspending execution across process boundaries:
 
 ```python
@@ -190,12 +192,12 @@ print(m2.run(inputs={'x': 41}))
 #> 42
 
 # Serialize execution state mid-flight
-m = pydantic_monty.Monty('fetch(url)', inputs=['url'], external_functions=['fetch'])
+m = pydantic_monty.Monty('fetch(url)', inputs=['url'])
 progress = m.start(inputs={'url': 'https://example.com'})
 state = progress.dump()
 
 # Later, restore and resume (e.g., in a different process)
-progress2 = pydantic_monty.MontySnapshot.load(state)
+progress2 = pydantic_monty.FunctionSnapshot.load(state)
 result = progress2.resume(return_value='response data')
 print(result.output)
 #> response data
@@ -215,7 +217,7 @@ def fib(n):
 fib(x)
 "#;
 
-let runner = MontyRun::new(code.to_owned(), "fib.py", vec!["x".to_owned()], vec![]).unwrap();
+let runner = MontyRun::new(code.to_owned(), "fib.py", vec!["x".to_owned()]).unwrap();
 let result = runner.run(vec![MontyObject::Int(10)], NoLimitTracker, &mut PrintWriter::Stdout).unwrap();
 assert_eq!(result, MontyObject::Int(55));
 ```
@@ -228,7 +230,7 @@ assert_eq!(result, MontyObject::Int(55));
 use monty::{MontyRun, MontyObject, NoLimitTracker, PrintWriter};
 
 // Serialize parsed code
-let runner = MontyRun::new("x + 1".to_owned(), "main.py", vec!["x".to_owned()], vec![]).unwrap();
+let runner = MontyRun::new("x + 1".to_owned(), "main.py", vec!["x".to_owned()]).unwrap();
 let bytes = runner.dump().unwrap();
 
 // Later, restore and run
@@ -337,15 +339,15 @@ I'll try to run through the most obvious alternatives, and why there aren't righ
 
 NOTE: all these technologies are impressive and have widespread uses, this commentary on their limitations for our use case should not be seen as a criticism. Most of these solutions were not conceived with the goal of providing an LLM sandbox, which is why they're not necessary great at it.
 
-| Tech               | Language completeness | Security     | Start latency  | FOSS       | Setup complexity | File mounting  | Snapshotting |
-|--------------------|-----------------------|--------------|----------------|------------|------------------|----------------|--------------|
-| Monty              | partial               | strict       | 0.06ms         | free / OSS | easy             | easy           | easy         |
-| Docker             | full                  | good         | 195ms          | free / OSS | intermediate     | easy           | intermediate |
-| Pyodide            | full                  | poor         | 2800ms         | free / OSS | intermediate     | easy           | hard         |
-| starlark-rust      | very limited          | good         | 1.7ms          | free / OSS | easy             | not available? | impossible?  |
-| WASI / Wasmer      | partial, almost full  | strict       | 66ms           | free *     | intermediate     | easy           | intermediate |
-| sandboxing service | full                  | strict       | 1033ms         | not free   | intermediate     | hard           | intermediate |
-| YOLO Python        | full                  | non-existent | 0.1ms / 30ms   | free / OSS | easy             | easy / scary   | hard         |
+| Tech               | Language completeness | Security     | Start latency | FOSS       | Setup complexity | File mounting  | Snapshotting |
+| ------------------ | --------------------- | ------------ | ------------- | ---------- | ---------------- | -------------- | ------------ |
+| Monty              | partial               | strict       | 0.06ms        | free / OSS | easy             | easy           | easy         |
+| Docker             | full                  | good         | 195ms         | free / OSS | intermediate     | easy           | intermediate |
+| Pyodide            | full                  | poor         | 2800ms        | free / OSS | intermediate     | easy           | hard         |
+| starlark-rust      | very limited          | good         | 1.7ms         | free / OSS | easy             | not available? | impossible?  |
+| WASI / Wasmer      | partial, almost full  | strict       | 66ms          | free \*    | intermediate     | easy           | intermediate |
+| sandboxing service | full                  | strict       | 1033ms        | not free   | intermediate     | hard           | intermediate |
+| YOLO Python        | full                  | non-existent | 0.1ms / 30ms  | free / OSS | easy             | easy / scary   | hard         |
 
 See [./scripts/startup_performance.py](scripts/startup_performance.py) for the script used to calculate the startup performance numbers.
 
@@ -397,7 +399,7 @@ Running Python in WebAssembly via [Wasmer](https://wasmer.io/).
 - **Security**: In principle WebAssembly should provide strong sandboxing guarantees.
 - **Start latency**: The [wasmer](https://pypi.org/project/wasmer/) python package hasn't been updated for 3 years and I couldn't find docs on calling Python in wasmer from Python, so I called it via subprocess. Start latency was 66ms.
 - **Setup complexity**: wasmer download is 100mb, the "python/python" package is 50mb.
-- **FOSS**: I marked this as "free *" since the cost is zero but not everything seems to be open source. As of 2026-02-10 the [`python/python` wasmer package](https://wasmer.io/python/python) package has no readme, no license, no source link and no indication of how it's built, the recently uploaded versions show size as "0B" although the download is ~50MB - the build process for the Python binary is not clear and transparent. _(If I'm wrong here, please create an issue to correct correct me)_
+- **FOSS**: I marked this as "free \*" since the cost is zero but not everything seems to be open source. As of 2026-02-10 the [`python/python` wasmer package](https://wasmer.io/python/python) package has no readme, no license, no source link and no indication of how it's built, the recently uploaded versions show size as "0B" although the download is ~50MB - the build process for the Python binary is not clear and transparent. _(If I'm wrong here, please create an issue to correct correct me)_
 - **File mounting**: Supported
 - **Snapshotting**: Supported via journaling
 
 
@@ -28,7 +28,6 @@ fuzz_target!(|code: String| {
         code.to_owned(),
         "fuzz.py",
         vec![], // no inputs
-        vec![], // no external functions
     ) else {
         return; // Parse errors are expected for random input
     };
 
@@ -543,7 +543,7 @@ fuzz_target!(|tokens: Tokens| {
     let code = tokens.to_code();
 
     // Try to parse the code
-    let Ok(runner) = MontyRun::new(code, "fuzz.py", vec![], vec![]) else {
+    let Ok(runner) = MontyRun::new(code, "fuzz.py", vec![]) else {
         return; // Parse errors are expected
     };
 
 
@@ -6,8 +6,8 @@ use std::{
 
 use clap::Parser;
 use monty::{
-    LimitedTracker, MontyObject, MontyRepl, MontyRun, NoLimitTracker, PrintWriter, ReplContinuationMode,
-    ResourceLimits, ResourceTracker, RunProgress, detect_repl_continuation_mode,
+    LimitedTracker, MontyObject, MontyRepl, MontyRun, NameLookupResult, NoLimitTracker, PrintWriter,
+    ReplContinuationMode, ResourceLimits, ResourceTracker, RunProgress, detect_repl_continuation_mode,
 };
 use rustyline::{DefaultEditor, error::ReadlineError};
 // disabled due to format failing on https://github.com/pydantic/monty/pull/75 where CI and local wanted imports ordered differently
@@ -201,9 +201,8 @@ fn run_script(file_path: &str, code: String, type_check_enabled: bool, tracker:
 
     let input_names = vec![];
     let inputs = vec![];
-    let ext_functions = vec!["add_ints".to_owned()];
 
-    let runner = match MontyRun::new(code, file_path, input_names, ext_functions) {
+    let runner = match MontyRun::new(code, file_path, input_names) {
         Ok(ex) => ex,
         Err(err) => {
             eprintln!("{BOLD_RED}error{RESET}:\n{err}");
@@ -278,23 +277,15 @@ fn run_script(file_path: &str, code: String, type_check_enabled: bool, tracker:
 fn run_repl(file_path: &str, code: String, tracker: impl ResourceTracker) -> ExitCode {
     let input_names = vec![];
     let inputs = vec![];
-    let ext_functions = vec!["add_ints".to_owned()];
-
-    let (mut repl, init_output) = match MontyRepl::new(
-        code,
-        file_path,
-        input_names,
-        ext_functions,
-        inputs,
-        tracker,
-        &mut PrintWriter::Stdout,
-    ) {
-        Ok(v) => v,
-        Err(err) => {
-            eprintln!("{BOLD_RED}error{RESET} initializing repl:\n{err}");
-            return ExitCode::FAILURE;
-        }
-    };
+
+    let (mut repl, init_output) =
+        match MontyRepl::new(code, file_path, input_names, inputs, tracker, &mut PrintWriter::Stdout) {
+            Ok(v) => v,
+            Err(err) => {
+                eprintln!("{BOLD_RED}error{RESET} initializing repl:\n{err}");
+                return ExitCode::FAILURE;
+            }
+        };
 
     if init_output != MontyObject::None {
         println!("{init_output}");
@@ -401,15 +392,10 @@ fn run_until_complete(mut progress: RunProgress<impl ResourceTracker>) -> Result
     loop {
         match progress {
             RunProgress::Complete(value) => return Ok(value),
-            RunProgress::FunctionCall {
-                function_name,
-                args,
-                state,
-                ..
-            } => {
-                let return_value = resolve_external_call(&function_name, &args)?;
-                progress = state
-                    .run(return_value, &mut PrintWriter::Stdout)
+            RunProgress::FunctionCall(call) => {
+                let return_value = resolve_external_call(&call.function_name, &call.args)?;
+                progress = call
+                    .resume(return_value, &mut PrintWriter::Stdout)
                     .map_err(|err| format!("{err}"))?;
             }
             RunProgress::ResolveFutures(state) => {
@@ -418,8 +404,24 @@ fn run_until_complete(mut progress: RunProgress<impl ResourceTracker>) -> Result
                     state.pending_call_ids()
                 ));
             }
-            RunProgress::OsCall { function, args, .. } => {
-                return Err(format!("OS calls not supported in CLI: {function:?}({args:?})"));
+            RunProgress::NameLookup(lookup) => {
+                let result = if lookup.name == "add_ints" {
+                    NameLookupResult::Value(MontyObject::Function {
+                        name: "add_ints".to_string(),
+                        docstring: None,
+                    })
+                } else {
+                    NameLookupResult::Undefined
+                };
+                progress = lookup
+                    .resume(result, &mut PrintWriter::Stdout)
+                    .map_err(|err| format!("{err}"))?;
+            }
+            RunProgress::OsCall(call) => {
+                return Err(format!(
+                    "OS calls not supported in CLI: {:?}({:?})",
+                    call.function, call.args
+                ));
             }
         }
     }
 
@@ -30,7 +30,7 @@ const result = m.run({ inputs: { x: 10, y: 20 } }) // returns 30
 For synchronous external functions, pass them directly to `run()`:
 
 ```ts
-const m = new Monty('add(2, 3)', { externalFunctions: ['add'] })
+const m = new Monty('add(2, 3)')
 
 const result = m.run({
   externalFunctions: {
@@ -46,7 +46,6 @@ import { Monty, runMontyAsync } from '@pydantic/monty'
 
 const m = new Monty('fetch_data(url)', {
   inputs: ['url'],
-  externalFunctions: ['fetch_data'],
 })
 
 const result = await runMontyAsync(m, {
@@ -65,7 +64,7 @@ const result = await runMontyAsync(m, {
 For fine-grained control over external function calls, use `start()` and `resume()`:
 
 ```ts
-const m = new Monty('a() + b()', { externalFunctions: ['a', 'b'] })
+const m = new Monty('a() + b()')
 
 let progress = m.start()
 while (progress instanceof MontySnapshot) {
@@ -161,13 +160,11 @@ if (snapshot instanceof MontySnapshot) {
 - `Monty.load(data)` - Deserialize from binary format
 - `scriptName` - The script name (default: `'main.py'`)
 - `inputs` - Declared input variable names
-- `externalFunctions` - Declared external function names
 
 ### `MontyOptions`
 
 - `scriptName?: string` - Name used in tracebacks (default: `'main.py'`)
 - `inputs?: string[]` - Input variable names
-- `externalFunctions?: string[]` - External function names
 - `typeCheck?: boolean` - Enable type checking on construction
 - `typeCheckPrefixCode?: string` - Code to prepend for type checking