openthoughts/tasktrove-a1-agenttuning-kg
a1 agenttuning kg tasks (10000 tasks). Part of OpenThoughts TaskTrove. See https://huggingface.co/datasets/open-thoughts/TaskTrove. Source: DCAgent/a1_agenttuning_kg. Reward Kit LLM-judge verifier injected; consumers must set ANTHROPIC_API_KEY in their environment to run trials.
harbor run -d openthoughts/tasktrove-a1-agenttuning-kg