Datasets
| Dataset | Tasks |
|---|---|
| Dataset | Tasks |
|---|---|
| Dataset | Tasks |
|---|---|
openthoughts/tasktrove-nemotron-gym-agent-calendar | 3,358 |
openthoughts/tasktrove-nemotron-gym-agent-workplace-v2 | 297 |
openthoughts/tasktrove-nemotron-gym-competitive-coding | 15,713 |
openthoughts/tasktrove-nemotron-gym-identity-following-v2 | 21,660 |
openthoughts/tasktrove-nemotron-gym-instruction-following-adversarial-v3 | 1,000 |
openthoughts/tasktrove-nemotron-gym-instruction-following-calendar | 8,387 |
openthoughts/tasktrove-nemotron-gym-instruction-following-structured | 9,437 |
openthoughts/tasktrove-nemotron-gym-instruction-following-v2 | 46,391 |
openthoughts/tasktrove-nemotron-gym-knowledge-web-search-mcqa | 2,915 |
openthoughts/tasktrove-nemotron-gym-math-advanced-calculations-v3 | 5,291 |
openthoughts/tasktrove-nl2bash-tasks-cleaned-oracle | 1,570 |
openthoughts/tasktrove-openswe-tasks-patched-v5-oracle-success | 17,504 |
openthoughts/tasktrove-r2egym-patched-full-oracle | 3,328 |
openthoughts/tasktrove-selfinstruct-naive-sandboxes-2-verified | 9,638 |
openthoughts/tasktrove-swe-rebench-patched-oracle | 3,787 |
openthoughts/tasktrove-swe-rebench-v2-patched-oracle | 18,341 |
openthoughts/tasktrove-swegym-tasks-patched-validated-v2 | 989 |
openthoughts/tasktrove-swesmith-oracle-filtered | 12,942 |
pgcodellm/rebench-v2-test | 20 |
qcircuitbench/qcircuitbench | 28 |
quesma/compilebench | 15 |
quesma/otel-bench | 26 |
quixbugs/quixbugs | 80 |
reasoning-gym/reasoning-gym-easy | 288 |
reasoning-gym/reasoning-gym-hard | 288 |
replicationbench/replicationbench | 90 |
rexbench/rexbench | 2 |
satbench/satbench | 2,100 |
scale-ai/hil-bench | 600 |
scale-ai/swe-atlas-qna | 124 |
204 datasets