nvats/codeskills-bench
A small set of real-life programming tasks: bug fixes, merge-conflict resolution, dependency cleanup, API migration, and performance regressions across compact Python repositories.
harbor run -d nvats/codeskills-bench| Task |
|---|
nvats/bench-mutable-default-argument |
nvats/bench-flask-2-to-3-middleware-break |
nvats/bench-api-contract-backwards-compat |
nvats/bench-pagination-last-page-off-by-one |
nvats/bench-circular-import-cold-load |
nvats/bench-pydantic-v1-to-v2-migration |
nvats/bench-missing-teardown-fixture-leak |
nvats/bench-datetime-utcnow-deprecation |
nvats/bench-test-mocks-wrong-module-path |
nvats/bench-cache-stale-after-write |
nvats/bench-rebase-dropped-security-fix |
nvats/bench-hidden-global-now-drift |
nvats/bench-feature-flag-removal-with-sideeffect |
nvats/bench-flaky-test-order-dependent |
nvats/bench-merge-conflict-parser-features |
nvats/bench-regex-greedy-cross-line |
nvats/bench-missing-lock-on-counter |
nvats/bench-concurrent-dict-key-conflict |
nvats/bench-timezone-dst-boundary |
nvats/bench-pytest-asyncio-strict-mode |
nvats/bench-n-plus-one-db-lookup |
nvats/bench-config-default-precedence |
nvats/bench-sqlalchemy-legacy-to-2.0 |
Displaying 23 of 23 tasks