openthoughts/tasktrove-a1-codeactinstruct
a1 codeactinstruct tasks (10000 tasks). Part of OpenThoughts TaskTrove. See https://huggingface.co/datasets/open-thoughts/TaskTrove. Source: DCAgent/a1_codeactinstruct. Reward Kit LLM-judge verifier injected; consumers must set ANTHROPIC_API_KEY in their environment to run trials.
harbor run -d openthoughts/tasktrove-a1-codeactinstruct