Releases: jonathan-laurent/delphyne
Releases · jonathan-laurent/delphyne
v0.14.0
- VSCode Extension: when a test is stuck on a query, a new command allows seeing diffs with all unreachable queries of the same type, which is very useful for repairing demonstrations after a strategy change.
- Add
delphyne browsecommand for generating browsable traces from raw traces. - Add
Runnodes that subclassBranchfor extracting a single element from an opaque space without branching. - Breaking: Overhaul example selectors for better flexibility (see
select_examplesargument offew_shot). - Breaking: Overhaul
Streamto make it an iterable directly and removeStream.gen. - Breaking: Overhaul the
Experimentclass to make it more ergonomic. - Implement support for embeddings and embedding-based example retrieval (including the standard MMR algorithm).
- Remove
auto_reloadsetting, which is fundamentally unsafe. - Experimental: add support for feedback backpropagation.
- Add
initkey todelphyne.yamlto register custom initializers. - Remove automatic reloading of Python modules, which is very error prone. Users should manually restart the server instead.
- Allow replaying failing experiment configurations in a debugger.
v0.13.0
- Breaking: Change signature of
dp.computeto allow passing additional keyword arguments. You must now writedp.compute(fun)(*args, **kwargs)instead ofdp.compute(fun, *args, **kwargs). - Add
override_argsargument todp.computeanddp.elim_compute. This allows policies to override timeout arguments for tool calls. - Add
elim_valuesandbinarize_valuestree transformers. - Add new policy for handling
Abductionnodes:abduct_recursively. - Add a
Dataeffect for loading evolving external data. - Fix summary generation bug in experiment launcher.
v0.12.0
- Add a
taketest instruction to the demonstration language for manually specifying an action to take at a given node. This ensures that the demonstration language is complete (any success node can be reached via a demonstration). - Breaking: improve navigation function for
Abductionnodes to make it (conditionally) invertible. - Add
workers_setupargument toExperiment. - Experimental: add support for external answer sources in demonstrations and in the
run_strategycommand, and for hindsight feedback. - Fix path bug in experiment launcher.