harb/scripts/harb-evaluator
openhands f3fb1c3db0 fix: fix: red-team cross-pattern export records intermediate states as DECREASED (#852)
The extract_memory regex previously matched any "lm.?eth" mention,
including mid-execution "Total LM ETH: X wei" output lines produced by
the agent's cast check commands.  During a staking step these lines
reflect an intermediate chain state (ETH temporarily locked/moved)
rather than the final reverted state, causing strategies to be recorded
as DECREASED even when the runner confirmed ETH_SAFE.

Fix: narrow the capture to the structured `lm_eth_after: <value>`
label that the agent writes in its final RED-TEAM REPORT block.
Mid-execution total-ETH lines no longer match and cannot corrupt the
per-strategy result in memory or the cross-patterns file.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-16 13:19:03 +00:00
..
helpers fix: Stale JSDoc in navigateToStakePage refers to '/stake' not '/app/stake' (#509) 2026-03-13 10:37:14 +00:00
scenarios/passive-confidence fix: correct buyKrk call sites for new opts param, add eslint-disable for polling loop 2026-03-05 05:53:19 +00:00
bootstrap-light.sh fix: fix: red-team.sh V3_FACTORY hardcodes Base mainnet address instead of Sepolia (#854) 2026-03-16 12:02:17 +00:00
evaluate.sh fix: wait_healthy does not fail fast when a service exits or crashes during the health-check window (#387) 2026-03-06 11:20:54 +00:00
export-attacks.py fix: red-team.sh and export-attacks.py use Base Sepolia addresses labeled as mainnet (#794) 2026-03-15 06:48:16 +00:00
holdout.config.ts fix: address PR #438 review findings 2026-03-04 08:20:11 +00:00
red-team-program.md fix: red-team-program.md taxRate naming inconsistency (pre-existing) (#835) 2026-03-16 09:46:55 +00:00
red-team-sweep.sh fix: feat: persist red-team cross-patterns in repo for continuity across runs (#853) 2026-03-16 12:39:39 +00:00
red-team.sh fix: fix: red-team cross-pattern export records intermediate states as DECREASED (#852) 2026-03-16 13:19:03 +00:00