harb/scripts/harb-evaluator
openhands fe3a3d7d94 fix: feat: persist red-team cross-patterns in repo for continuity across runs (#853)
- Move CROSS_PATTERNS_FILE from /tmp/red-team-cross-patterns.jsonl to
  tools/red-team/cross-patterns.jsonl (repo-tracked path)
- Remove the reset (> file) at sweep start so patterns accumulate across runs
- Generate a SWEEP_ID (sweep-YYYYMMDD-HHMMSS) at sweep start and stamp
  each new entry with sweep_id for traceability
- Deduplicate on (pattern, candidate, result): entries already present in
  the file are skipped; intra-batch duplicates are also suppressed
- Create tools/red-team/ directory with .gitkeep
- Add mkdir -p guards in both scripts so the directory is created on first run

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-16 12:39:39 +00:00
..
helpers fix: Stale JSDoc in navigateToStakePage refers to '/stake' not '/app/stake' (#509) 2026-03-13 10:37:14 +00:00
scenarios/passive-confidence fix: correct buyKrk call sites for new opts param, add eslint-disable for polling loop 2026-03-05 05:53:19 +00:00
bootstrap-light.sh fix: fix: red-team.sh V3_FACTORY hardcodes Base mainnet address instead of Sepolia (#854) 2026-03-16 12:02:17 +00:00
evaluate.sh fix: wait_healthy does not fail fast when a service exits or crashes during the health-check window (#387) 2026-03-06 11:20:54 +00:00
export-attacks.py fix: red-team.sh and export-attacks.py use Base Sepolia addresses labeled as mainnet (#794) 2026-03-15 06:48:16 +00:00
holdout.config.ts fix: address PR #438 review findings 2026-03-04 08:20:11 +00:00
red-team-program.md fix: red-team-program.md taxRate naming inconsistency (pre-existing) (#835) 2026-03-16 09:46:55 +00:00
red-team-sweep.sh fix: feat: persist red-team cross-patterns in repo for continuity across runs (#853) 2026-03-16 12:39:39 +00:00
red-team.sh fix: feat: persist red-team cross-patterns in repo for continuity across runs (#853) 2026-03-16 12:39:39 +00:00