Skip to content

Pull requests: OpenEuroLLM/JudgeArena

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Add paper experiment scripts
#47 opened May 19, 2026 by ErlisLushtaku Collaborator Loading…
Add Elo sampling and Gemini safety handling
#46 opened May 19, 2026 by ErlisLushtaku Collaborator Loading…
Add localized judge prompt presets
#45 opened May 19, 2026 by ErlisLushtaku Collaborator Loading…
Add inference metadata and thinking support
#44 opened May 19, 2026 by ErlisLushtaku Collaborator Loading…
Add native baselines and judge controls
#43 opened May 19, 2026 by ErlisLushtaku Collaborator Loading…
Integration of Soft-ELO
#42 opened May 12, 2026 by kargibora Collaborator Loading…
Per-role GenerationConfig and backend plumbing
#41 opened Apr 29, 2026 by alexrs-cohere Loading…
3 of 5 tasks
Add judge-prompt registry with per-task defaults
#40 opened Apr 29, 2026 by alexrs-cohere Loading…
3 of 4 tasks
Pin dataset revisions for reproducibility
#39 opened Apr 29, 2026 by alexrs-cohere Loading…
1 of 3 tasks
feat: add support for mt-bench-101
#22 opened Mar 9, 2026 by ErlisLushtaku Collaborator Loading…
ProTip! Follow long discussions with comments:>50.