Datasets
| Dataset | Tasks |
|---|---|
| Dataset | Tasks |
|---|---|
| Dataset | Tasks |
|---|---|
lica-world/gdb | 33,786 |
LiteCoder/LiteCoder-rl | 602 |
livecodebench/livecodebench | 100 |
maxbittker/runebench | 32 |
meta/mlgym-bench | 12 |
MichaelY310/devopsgym | 728 |
minnesotanlp/aar | 1,400 |
mmtb/multimedia-terminalbench | 105 |
NovitaAI/tb21-code-debug | 32 |
NovitaAI/tb21-data-science | 26 |
NovitaAI/tb21-file-recovery | 9 |
NovitaAI/tb21-systems-security | 22 |
openai/mmmlu | 150 |
openai/simpleqa | 4,326 |
openai/swe-lancer-diamond-all | 463 |
openai/swe-lancer-diamond-ic | 198 |
openai/swe-lancer-diamond-manager | 265 |
openthoughts/openthoughts-tblite | 100 |
openthoughts/tasktrove-code-contests-noblock | 8,728 |
openthoughts/tasktrove-exp-flat25-pseudocode-v2 | 728 |
openthoughts/tasktrove-exp-flat25-speed-bonus-v2 | 764 |
openthoughts/tasktrove-exp-flat25-stackoverflow-v2 | 765 |
openthoughts/tasktrove-exp-flat25-subtle-debug-v3 | 289 |
openthoughts/tasktrove-exp-rle-adversarial | 5,000 |
openthoughts/tasktrove-exp-rle-detailed-v3 | 413 |
openthoughts/tasktrove-exp-rle-error-report-v3 | 261 |
openthoughts/tasktrove-exp-rle-github-issue-v3 | 264 |
openthoughts/tasktrove-exp-rle-heavy-padding-v2 | 784 |
openthoughts/tasktrove-exp-rle-minimal-instructions-v3 | 233 |
openthoughts/tasktrove-exp-rpt-codenet-python-v2 | 10,000 |
204 datasets