Datasets

DatasetTasks
shogunpurple/9CBCC967-BDAF-43D0-9485-52EBBE4A5094
156
openthoughts/tasktrove-swesmith-sandboxes-with-tests-gpt-5-mini-passed
7,089
terminal-bench/terminal-bench-2-1
89
kgmon/deepsearchqa
900
openthoughts/tasktrove-nl2bash-tasks-cleaned-oracle
1,570
openthoughts/tasktrove-nemotron-prismmath-sandboxes-1
10,000
openthoughts/tasktrove-r2egym-numpy-test
5
openthoughts/tasktrove-swe-rebench-patched
6,542
openthoughts/tasktrove-code-contests-noblock
8,728
openthoughts/tasktrove-exp-llmve-llm-verifier-clean-sandboxes-eval-set
90
openthoughts/tasktrove-dclm-baseline-terminal-sandboxes
10,000
openthoughts/tasktrove-swegym-tasks
2,438
openthoughts/tasktrove-exp-llmve-llm-verifier-dcagent-dev-set
70
openthoughts/tasktrove-exp-llmve-llm-verifier-code-contests
10,000
openthoughts/tasktrove-freelancer-projects-sandboxes-ta-rl-gpt-5-mini
9,999
openthoughts/tasktrove-exp-llmve-llm-verifier-freelancer-sandboxes
8,580
openthoughts/tasktrove-harbor-devel-sandboxes-skywork-response
5
openthoughts/tasktrove-llm-verifier-code-contests-noblock
16
openthoughts/tasktrove-llm-verifier-code-contests
16
openthoughts/tasktrove-swesmith-sandboxes-with-tests
10,000
openthoughts/tasktrove-llm-verifier-freelancer
10,000
openthoughts/tasktrove-r2egym-patched-codex-solved
1,785
openthoughts/tasktrove-r2egym-patched-full-oracle
3,328
openthoughts/tasktrove-swe-rebench-patched-oracle
3,787
openthoughts/tasktrove-swegym-tasks-patched-validated
989
openthoughts/tasktrove-swesmith-datascience-skorch-sandboxes
20,000
openthoughts/tasktrove-swegym-tasks-patched
2,438
openthoughts/tasktrove-nl2bash-verified-cleaned
10,172
openthoughts/tasktrove-wikitable-format-conversion
1,397
openthoughts/tasktrove-wikitable-format-conversion-qwen3-coder-480b-a35b-instruct-awq
1,948