|
3 | 3 | All notable changes to this project will be documented in this file. |
4 | 4 |
|
5 | 5 |
|
| 6 | +## [0.4] - 2025-05-21 |
| 7 | + |
| 8 | +### 🚀 Features |
| 9 | + |
| 10 | +- Support thinking model in judge |
| 11 | +- Add nb_tool_call as ops metrics + add MCP_BRIFGE_URL + format |
| 12 | +- Parquet dataset support + ocr metrics + notebook demo |
| 13 | +- Add and handle new with_vision and prelude_prompt attribute |
| 14 | +- Calculation of the environmental impact of models for the response generation part. |
| 15 | +- Creation of two new environmental metrics: energy_consumption and gwp_consumption. |
| 16 | + |
| 17 | +### 🔧 Improvements |
| 18 | + |
| 19 | +- [UI] display of the environmental brick in the OPS pane and experiments_set metric results. |
| 20 | + |
| 21 | +### 🐛 Bug Fixes |
| 22 | + |
| 23 | +- *(runners)* Nb_tool_calls metrics computation |
| 24 | +- Improve log level warning. |
| 25 | +- *(ui)* Use two point float precision in score table. |
| 26 | +- Temporary url for MCP bridge |
| 27 | +- Tool activation and rag metrics error handling |
| 28 | +- *(ui)* Show dataset name and all model paramsi in expeset overview |
| 29 | +- *(mcp)* Allow tool_choice tuning |
| 30 | +- *(clients)* Add support for aliases models in v1/models |
| 31 | +- *(api)* Judge model must be unique in a set. |
| 32 | +- Strip answer + think |
| 33 | +- Fix multi-step agent loop generation if max_steps is reached. |
| 34 | +- Remove rerun_metric in patch exp + better handle error in patch expset route + fix format |
| 35 | +- Disallow model_judge patch for experiment and experiment_set |
| 36 | +- Parquet support and schemas |
| 37 | +- *(schema)* Rename prompt_system to system_prompt |
| 38 | +- Columns_map for ocr marker demo dataset |
| 39 | +- Dataset views |
| 40 | +- *(tasks)* Empty query |
| 41 | + |
| 42 | +### 💼 Other |
| 43 | + |
| 44 | +- Strip answer + not test on integrer !!! |
| 45 | +- *(runner)* Limit deep search steps + tool_choice 'none'. |
| 46 | +- Import collections |
| 47 | +- *(mcp)* Fix the multi-step loop |
| 48 | +- Unbound variables |
| 49 | +- Non blocking model sync |
| 50 | + |
| 51 | + |
6 | 52 | ## [0.3.1] - 2025-04-02 |
7 | 53 |
|
8 | 54 | ### 🚀 Features |
|
0 commit comments