-
Notifications
You must be signed in to change notification settings - Fork 0
Add swe-bench results for qwen3-coder-next #539
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Add swe-bench results for qwen3-coder-next #539
Conversation
📊 Progress Report❌ Schema ValidationThis report measures progress towards the 3D array goal (benchmarks × models × metrics) as described in #2. |
|
@OpenHands search the web and add fields for "country", "release_date", "parameter_count_b", "active_parameter_count_b" show your work add a comment in the PR with links proving each of the fields. |
|
I'm on it! juanmichelini can track my progress at all-hands.dev |
…b to qwen3-coder-next metadata - country: cn (Alibaba is a Chinese company) - release_date: 2026-02-02 (from official Qwen blog) - parameter_count_b: 80 (80B total parameters) - active_parameter_count_b: 3 (3B active parameters per token) Co-authored-by: openhands <[email protected]>
Sources for Added Metadata FieldsI have added the following fields to
Additional Supporting Sources:
|
|
Looks like there are a few issues preventing this PR from being merged!
If you'd like me to help, just leave a comment, like Feel free to include any additional details that might help me get this PR into a better state. You can manage your notification settings |
SummaryI have completed the requested task to add metadata fields for the Changes MadeUpdated
Checklist
PR CommentA detailed comment with source links has been added to PR #539: #539 (comment) The comment includes:
|
juanmichelini
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
|
@OpenHands fix the validation errors! |
|
I'm on it! juanmichelini can track my progress at all-hands.dev |
|
https://github.com/OpenHands fix the validation errors! |
Evaluation Results
Model:
qwen3-coder-nextBenchmark:
swe-benchAgent Version:
v1.11.1Results
Report Summary
Additional Metadata
48920This PR was automatically created by the evaluation pipeline.