harb/scripts/harb-evaluator
openhands 4d0390c4fa fix: address review findings for cross-candidate red-team sweep (#822)
- red-team-sweep.sh: reset CROSS_PATTERNS_FILE at sweep start to prevent
  stale patterns from prior invocations contaminating a fresh run
- red-team-sweep.sh: wrap pattern-extraction Python in set +e/set -e and
  capture output so log() prefix is applied; move memory truncation outside
  the if-block so it runs unconditionally even if Python fails
- red-team.sh: filter entries where candidate == current_candidate before
  grouping, removing self-referential cross-candidate evidence
- red-team.sh: skip entries with empty pattern key (both pattern and
  strategy fields empty) to prevent spurious bucket merging

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-15 17:02:19 +00:00
..
helpers fix: Stale JSDoc in navigateToStakePage refers to '/stake' not '/app/stake' (#509) 2026-03-13 10:37:14 +00:00
scenarios/passive-confidence fix: correct buyKrk call sites for new opts param, add eslint-disable for polling loop 2026-03-05 05:53:19 +00:00
bootstrap-light.sh fix: DeployLocal.sol feeDest 0xf6a3... may have code on Base Sepolia fork (#760) 2026-03-14 20:58:34 +00:00
evaluate.sh fix: wait_healthy does not fail fast when a service exits or crashes during the health-check window (#387) 2026-03-06 11:20:54 +00:00
export-attacks.py fix: red-team.sh and export-attacks.py use Base Sepolia addresses labeled as mainnet (#794) 2026-03-15 06:48:16 +00:00
holdout.config.ts fix: address PR #438 review findings 2026-03-04 08:20:11 +00:00
red-team-sweep.sh fix: address review findings for cross-candidate red-team sweep (#822) 2026-03-15 17:02:19 +00:00
red-team.sh fix: address review findings for cross-candidate red-team sweep (#822) 2026-03-15 17:02:19 +00:00