harb/tools
openhands a23064f576 fix: batch-eval.sh aborts entire generation on single candidate compile failure (#901)
- Add skip_candidate() helper that emits fitness=0 JSON to stdout and
  tracks the failed score for the output-dir file, satisfying the
  downstream scorer's expectation of one JSON line per candidate.
- Unify all failure paths (transpile, forge build, bytecode extract,
  empty bytecode) through skip_candidate() with a distinct error key.
- Log message now reads "WARNING: <id> compile failed — scoring as 0"
  as required by the acceptance criteria.
- Output-dir scores.jsonl now merges successful + failed scores so the
  file is complete even when some candidates fail to compile.
- All-candidates-fail path (COMPILED_COUNT=0) still exits 2 (no viable
  population); true infra errors (missing tool, bad RPC) unchanged.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-17 06:09:18 +00:00
..
push3-evolution fix: batch-eval.sh aborts entire generation on single candidate compile failure (#901) 2026-03-17 06:09:18 +00:00
push3-transpiler fix: package.json missing 'type': 'module' inconsistent with AGENTS.md (#850) 2026-03-16 06:35:05 +00:00
red-team fix: feat: persist red-team cross-patterns in repo for continuity across runs (#853) 2026-03-16 12:39:39 +00:00
deploy-optimizer.sh fix: add --tc DeployLocal to forge script invocations 2026-03-12 23:12:25 +00:00