openthoughts/tasktrove-a1-agenttuning-alfworld

a1 agenttuning alfworld tasks (10000 tasks). Part of OpenThoughts TaskTrove. See https://huggingface.co/datasets/open-thoughts/TaskTrove. Source: DCAgent/a1_agenttuning_alfworld. Reward Kit LLM-judge verifier injected; consumers must set ANTHROPIC_API_KEY in their environment to run trials.

harbor run -d openthoughts/tasktrove-a1-agenttuning-alfworld

openthoughts/tasktrove-a1-agenttuning-alfworld