openthoughts/tasktrove-a1-codeactinstruct

a1 codeactinstruct tasks (10000 tasks). Part of OpenThoughts TaskTrove. See https://huggingface.co/datasets/open-thoughts/TaskTrove. Source: DCAgent/a1_codeactinstruct. Reward Kit LLM-judge verifier injected; consumers must set ANTHROPIC_API_KEY in their environment to run trials.

harbor run -d openthoughts/tasktrove-a1-codeactinstruct

openthoughts/tasktrove-a1-codeactinstruct