Datasets
| Dataset | Tasks |
|---|---|
| Dataset | Tasks |
|---|---|
Terminal-Bench 2.1 subset for data science, scientific computing, machine learning, model training, optimization, querying, and video processing tasks, curated for Novita Sandbox Hackathon.
harbor run -d NovitaAI/tb21-data-scienceData science, scientific computing, machine learning, model training, optimization, querying, and video processing tasks from Terminal-Bench 2.1.
This public Harbor dataset is curated by NovitaAI for a Novita Sandbox hackathon track. It is a category-based subset of Terminal-Bench 2.1. Agents and models are not fixed; the only required runtime environment for the hackathon is Novita Sandbox.
NovitaAI/tb21-data-science-e novitaRun the full track once:
harbor run \
-d NovitaAI/tb21-data-science \
-a <agent> \
-m <model> \
-e novita \
-k 1 \
-n 1 \
-y
Run a small smoke test from the track:
harbor run \
-d NovitaAI/tb21-data-science \
-a <agent> \
-m <model> \
-e novita \
-l 1 \
-k 1 \
-n 1 \
-y
Upload a public result for the hackathon leaderboard:
harbor upload jobs/<job_name> --public
Submit the resulting Harbor Hub job link to the hackathon leaderboard form.
A valid track submission should satisfy:
NovitaAI/tb21-data-science.environment.type = "novita".Suggested ranking fields:
terminal-bench/adaptive-rejection-samplerterminal-bench/bn-fit-modifyterminal-bench/caffe-cifar-10terminal-bench/count-dataset-tokensterminal-bench/distribution-searchterminal-bench/dna-assemblyterminal-bench/dna-insertterminal-bench/hf-model-inferenceterminal-bench/llm-inference-batching-schedulerterminal-bench/mcmc-sampling-stanterminal-bench/modernize-scientific-stackterminal-bench/mteb-leaderboardterminal-bench/mteb-retrieveterminal-bench/portfolio-optimizationterminal-bench/protein-assemblyterminal-bench/pytorch-model-cliterminal-bench/pytorch-model-recoveryterminal-bench/query-optimizeterminal-bench/raman-fittingterminal-bench/reshard-c4-dataterminal-bench/rstan-to-pystanterminal-bench/sam-cell-segterminal-bench/sparql-universityterminal-bench/train-fasttextterminal-bench/tune-mjcfterminal-bench/video-processing