Datasets
| Dataset | Tasks |
|---|---|
| Dataset | Tasks |
|---|---|
Terminal-Bench 2.1 subset for file operations, data manipulation, document processing, log analysis, and recovery tasks, curated for Novita Sandbox Hackathon.
harbor run -d NovitaAI/tb21-file-recoveryFile operations, data manipulation, document processing, log analysis, extraction, and recovery tasks from Terminal-Bench 2.1.
This public Harbor dataset is curated by NovitaAI for a Novita Sandbox hackathon track. It is a category-based subset of Terminal-Bench 2.1. Agents and models are not fixed; the only required runtime environment for the hackathon is Novita Sandbox.
NovitaAI/tb21-file-recovery-e novitaRun the full track once:
harbor run \
-d NovitaAI/tb21-file-recovery \
-a <agent> \
-m <model> \
-e novita \
-k 1 \
-n 1 \
-y
Run a small smoke test from the track:
harbor run \
-d NovitaAI/tb21-file-recovery \
-a <agent> \
-m <model> \
-e novita \
-l 1 \
-k 1 \
-n 1 \
-y
Upload a public result for the hackathon leaderboard:
harbor upload jobs/<job_name> --public
Submit the resulting Harbor Hub job link to the hackathon leaderboard form.
A valid track submission should satisfy:
NovitaAI/tb21-file-recovery.environment.type = "novita".Suggested ranking fields:
terminal-bench/db-wal-recoveryterminal-bench/extract-elfterminal-bench/extract-moves-from-videoterminal-bench/financial-document-processorterminal-bench/gcode-to-textterminal-bench/large-scale-text-editingterminal-bench/log-summary-date-rangesterminal-bench/multi-source-data-mergerterminal-bench/regex-log