Commit 6d8f631
fix(core): extract pi-cli tool calls from streaming events (#782)
* fix(core): extract pi-cli tool calls from streaming events for skill-trigger
Pi CLI emits tool_execution_start/end events in JSONL output, but the
provider only extracted tool calls from message content arrays. This
caused the skill-trigger evaluator to miss pi's skill file reads.
Now extractMessages() also scans for tool_execution_start/end events
and injects reconstructed tool calls into assistant messages. Also
handles tool_call (snake_case) content type variant.
Closes #780
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* fix(core): avoid mutating readonly Message in injectEventToolCalls
Replace target message with a new object instead of casting to bypass
readonly constraint, per code review feedback.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* fix(evals): restore skill-trigger assertion for agent-plugin-review eval
Re-adds the skill-trigger assertion that was removed as a workaround
for #780. Now that pi-cli tool call extraction is fixed, the evaluator
can detect when pi loads the agent-plugin-review skill.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* fix(evals): configure pi-cli target with model and remove workers: 1
Pi-cli target needs subprovider/model/api_key to produce meaningful
output. Without them, pi uses its default which returns empty responses.
Also removes workers: 1 from agent-plugin-review eval since all test
cases are read-only reviews that can safely run in parallel.
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
* style: fix formatting in package.json files
Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
---------
Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>1 parent c5c7a11 commit 6d8f631
File tree
7 files changed
+351
-27
lines changed- .agentv
- apps/cli
- evals/agentic-engineering
- packages
- core
- src/evaluation/providers
- test/evaluation/providers
- eval
7 files changed
+351
-27
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
13 | 13 | | |
14 | 14 | | |
15 | 15 | | |
| 16 | + | |
| 17 | + | |
16 | 18 | | |
17 | 19 | | |
18 | 20 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
17 | | - | |
18 | | - | |
19 | | - | |
20 | | - | |
| 17 | + | |
21 | 18 | | |
22 | 19 | | |
23 | 20 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3 | 3 | | |
4 | 4 | | |
5 | 5 | | |
6 | | - | |
7 | 6 | | |
8 | 7 | | |
9 | 8 | | |
| |||
20 | 19 | | |
21 | 20 | | |
22 | 21 | | |
| 22 | + | |
| 23 | + | |
23 | 24 | | |
24 | 25 | | |
25 | 26 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
38 | 38 | | |
39 | 39 | | |
40 | 40 | | |
41 | | - | |
42 | | - | |
43 | | - | |
44 | | - | |
| 41 | + | |
45 | 42 | | |
46 | 43 | | |
47 | 44 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
539 | 539 | | |
540 | 540 | | |
541 | 541 | | |
| 542 | + | |
| 543 | + | |
| 544 | + | |
| 545 | + | |
542 | 546 | | |
543 | 547 | | |
544 | 548 | | |
| |||
580 | 584 | | |
581 | 585 | | |
582 | 586 | | |
| 587 | + | |
| 588 | + | |
583 | 589 | | |
584 | 590 | | |
585 | 591 | | |
586 | 592 | | |
587 | 593 | | |
588 | 594 | | |
589 | | - | |
590 | | - | |
| 595 | + | |
| 596 | + | |
591 | 597 | | |
592 | | - | |
| 598 | + | |
| 599 | + | |
593 | 600 | | |
594 | 601 | | |
595 | | - | |
| 602 | + | |
| 603 | + | |
| 604 | + | |
| 605 | + | |
| 606 | + | |
| 607 | + | |
| 608 | + | |
| 609 | + | |
| 610 | + | |
| 611 | + | |
| 612 | + | |
| 613 | + | |
| 614 | + | |
| 615 | + | |
| 616 | + | |
| 617 | + | |
| 618 | + | |
| 619 | + | |
| 620 | + | |
| 621 | + | |
| 622 | + | |
| 623 | + | |
| 624 | + | |
| 625 | + | |
| 626 | + | |
| 627 | + | |
| 628 | + | |
| 629 | + | |
| 630 | + | |
| 631 | + | |
| 632 | + | |
| 633 | + | |
596 | 634 | | |
597 | 635 | | |
598 | | - | |
599 | | - | |
600 | | - | |
601 | | - | |
| 636 | + | |
| 637 | + | |
| 638 | + | |
| 639 | + | |
| 640 | + | |
| 641 | + | |
| 642 | + | |
| 643 | + | |
| 644 | + | |
| 645 | + | |
| 646 | + | |
| 647 | + | |
| 648 | + | |
| 649 | + | |
| 650 | + | |
| 651 | + | |
| 652 | + | |
| 653 | + | |
| 654 | + | |
| 655 | + | |
| 656 | + | |
| 657 | + | |
| 658 | + | |
| 659 | + | |
| 660 | + | |
| 661 | + | |
| 662 | + | |
| 663 | + | |
| 664 | + | |
| 665 | + | |
| 666 | + | |
| 667 | + | |
| 668 | + | |
| 669 | + | |
| 670 | + | |
| 671 | + | |
| 672 | + | |
| 673 | + | |
| 674 | + | |
| 675 | + | |
| 676 | + | |
| 677 | + | |
| 678 | + | |
| 679 | + | |
| 680 | + | |
| 681 | + | |
| 682 | + | |
| 683 | + | |
| 684 | + | |
| 685 | + | |
| 686 | + | |
| 687 | + | |
| 688 | + | |
| 689 | + | |
602 | 690 | | |
603 | 691 | | |
604 | 692 | | |
605 | | - | |
| 693 | + | |
| 694 | + | |
| 695 | + | |
| 696 | + | |
| 697 | + | |
| 698 | + | |
| 699 | + | |
606 | 700 | | |
607 | 701 | | |
608 | 702 | | |
| |||
720 | 814 | | |
721 | 815 | | |
722 | 816 | | |
723 | | - | |
724 | | - | |
| 817 | + | |
725 | 818 | | |
726 | 819 | | |
727 | | - | |
| 820 | + | |
728 | 821 | | |
729 | 822 | | |
730 | | - | |
731 | | - | |
| 823 | + | |
732 | 824 | | |
733 | 825 | | |
734 | 826 | | |
| |||
830 | 922 | | |
831 | 923 | | |
832 | 924 | | |
| 925 | + | |
| 926 | + | |
| 927 | + | |
| 928 | + | |
| 929 | + | |
| 930 | + | |
| 931 | + | |
0 commit comments