nvats/codeskills-bench

A small set of real-life programming tasks: bug fixes, merge-conflict resolution, dependency cleanup, API migration, and performance regressions across compact Python repositories.

harbor run -d nvats/codeskills-bench
Task
nvats/bench-mutable-default-argument
nvats/bench-flask-2-to-3-middleware-break
nvats/bench-api-contract-backwards-compat
nvats/bench-pagination-last-page-off-by-one
nvats/bench-circular-import-cold-load
nvats/bench-pydantic-v1-to-v2-migration
nvats/bench-missing-teardown-fixture-leak
nvats/bench-datetime-utcnow-deprecation
nvats/bench-test-mocks-wrong-module-path
nvats/bench-cache-stale-after-write
nvats/bench-rebase-dropped-security-fix
nvats/bench-hidden-global-now-drift
nvats/bench-feature-flag-removal-with-sideeffect
nvats/bench-flaky-test-order-dependent
nvats/bench-merge-conflict-parser-features
nvats/bench-regex-greedy-cross-line
nvats/bench-missing-lock-on-counter
nvats/bench-concurrent-dict-key-conflict
nvats/bench-timezone-dst-boundary
nvats/bench-pytest-asyncio-strict-mode
nvats/bench-n-plus-one-db-lookup
nvats/bench-config-default-precedence
nvats/bench-sqlalchemy-legacy-to-2.0

Displaying 23 of 23 tasks