Datasets
| Dataset | Tasks |
|---|---|
shogunpurple/9CBCC967-BDAF-43D0-9485-52EBBE4A5094 | 156 |
openthoughts/tasktrove-swesmith-sandboxes-with-tests-gpt-5-mini-passed | 7,089 |
terminal-bench/terminal-bench-2-1 | 89 |
kgmon/deepsearchqa | 900 |
openthoughts/tasktrove-nl2bash-tasks-cleaned-oracle | 1,570 |
openthoughts/tasktrove-nemotron-prismmath-sandboxes-1 | 10,000 |
openthoughts/tasktrove-r2egym-numpy-test | 5 |
openthoughts/tasktrove-swe-rebench-patched | 6,542 |
openthoughts/tasktrove-code-contests-noblock | 8,728 |
openthoughts/tasktrove-exp-llmve-llm-verifier-clean-sandboxes-eval-set | 90 |
openthoughts/tasktrove-dclm-baseline-terminal-sandboxes | 10,000 |
openthoughts/tasktrove-swegym-tasks | 2,438 |
openthoughts/tasktrove-exp-llmve-llm-verifier-dcagent-dev-set | 70 |
openthoughts/tasktrove-exp-llmve-llm-verifier-code-contests | 10,000 |
openthoughts/tasktrove-freelancer-projects-sandboxes-ta-rl-gpt-5-mini | 9,999 |
openthoughts/tasktrove-exp-llmve-llm-verifier-freelancer-sandboxes | 8,580 |
openthoughts/tasktrove-harbor-devel-sandboxes-skywork-response | 5 |
openthoughts/tasktrove-llm-verifier-code-contests-noblock | 16 |
openthoughts/tasktrove-llm-verifier-code-contests | 16 |
openthoughts/tasktrove-swesmith-sandboxes-with-tests | 10,000 |
openthoughts/tasktrove-llm-verifier-freelancer | 10,000 |
openthoughts/tasktrove-r2egym-patched-codex-solved | 1,785 |
openthoughts/tasktrove-r2egym-patched-full-oracle | 3,328 |
openthoughts/tasktrove-swe-rebench-patched-oracle | 3,787 |
openthoughts/tasktrove-swegym-tasks-patched-validated | 989 |
openthoughts/tasktrove-swesmith-datascience-skorch-sandboxes | 20,000 |
openthoughts/tasktrove-swegym-tasks-patched | 2,438 |
openthoughts/tasktrove-nl2bash-verified-cleaned | 10,172 |
openthoughts/tasktrove-wikitable-format-conversion | 1,397 |
openthoughts/tasktrove-wikitable-format-conversion-qwen3-coder-480b-a35b-instruct-awq | 1,948 |
193 datasets