johba/harb - Forgejo: Beyond coding. We forge.

johba/harb

Author	SHA1	Message	Date
johba	ff86b3691d	chore: extract shared inject.sh, add red-team-sweep.sh (#806 ) ## What - `tools/push3-transpiler/inject.sh` — shared transpile+inject logic used by both batch-eval and red-team-sweep - `batch-eval.sh` — replaced inline 60-line Python block with `inject.sh` call - `scripts/harb-evaluator/red-team-sweep.sh` — red-teams each kindergarten seed using existing `red-team.sh`, with random smoke test gate ## Why Sweep script kept breaking because I rewrote the injection logic instead of reusing batch-eval's proven Python. Now there's one copy. ## Testing - inject.sh tested manually on DO box with optimizer_v3 seed - Smoke test picks random seed, injects + compiles before starting sweep Co-authored-by: openhands <openhands@all-hands.dev> Reviewed-on: https://codeberg.org/johba/harb/pulls/806 Reviewed-by: review_bot <review_bot@noreply.codeberg.org>	2026-03-15 10:24:03 +01:00
openhands	7618309db5	fix: red-team.sh and export-attacks.py use Base Sepolia addresses labeled as mainnet (#794 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-15 06:48:16 +00:00
openhands	0e33d6cbba	fix: DeployLocal.sol feeDest 0xf6a3... may have code on Base Sepolia fork (#760 )	2026-03-14 20:58:34 +00:00
openhands	e9397891ed	fix: remove setRecenterAccess from red-team.sh — recenter() is now public	2026-03-14 15:10:59 +00:00
openhands	dbf78de793	fix: bootstrap + red-team on forked networks Bootstrap fixes: - Idempotency check: skip if Kraiken already deployed on Anvil - anvil_setCode to strip ERC-4337 code from deployer + feeDest - DeployLocal.sol: feeDest derived from keccak256('harb.local.feeDest') Red-team fixes: - New bootstrap-light.sh: Anvil-only, ~30s deploy - red-team.sh uses bootstrap-light instead of full docker compose - anvil_setBalance for feeDest before impersonation - forge --color never, path resolution, docker chown Address fixes (all Base mainnet, in both FitnessEvaluator + AttackRunner): - V3_FACTORY: 0x33128a8fC17869897dcE68Ed026d694621f6FDfD - SWAP_ROUTER: 0x2626664c2603336E57B271c5C0b26F421741e481 - NPM_ADDR: 0x03a520b32C04BF3bEEf7BEb72E919cf822Ed34f1	2026-03-14 13:31:23 +00:00
johba	6ff8282a7e	Merge pull request 'fix: Remove recenterAccess — make recenter() public with TWAP enforcement (#706 )' (#713 ) from fix/issue-706 into master	2026-03-14 10:48:59 +01:00
openhands	52ed8ef233	fix: red-team.sh sudo strips FORK_URL before docker compose sees it (#729 ) red-team.sh called bare `sudo docker compose up/down` which applies env_reset and drops FORK_URL before anvil-entrypoint.sh can read it. Change both calls to `sudo -E` so the caller's FORK_URL override is propagated to docker-compose and into the anvil container. Update ENVIRONMENT.md to reflect that a plain `FORK_URL=... bash red-team.sh` invocation now works correctly. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 08:30:49 +00:00
openhands	44df166b73	fix: Bare integer interpolation in agent-prompt heredoc at line 494 (#671 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 03:07:55 +00:00
openhands	cbab4c36da	fix: NPM_ADDR may be Base Sepolia address in both files (#686 ) Replace 0x27F971cb582BF9E50F397e4d29a5C7A34f11faA2 (Base Sepolia NonfungiblePositionManager) with the correct Base mainnet address 0x03a520B32c04bf3beef7BEb72E919cF822Ed34F3 in all four files that referenced it, and add an inline comment citing the chain and source. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 02:22:51 +00:00
openhands	1a410a30b7	fix: Remove recenterAccess — make recenter() public with TWAP enforcement (#706 )	2026-03-13 22:32:53 +00:00
openhands	a18512a644	fix: Stale JSDoc in navigateToStakePage refers to '/stake' not '/app/stake' (#509 )	2026-03-13 10:37:14 +00:00
openhands	659044e2d1	fix: claude subprocess not killed on INT/TERM in cleanup trap (#530 ) Track CLAUDE_PID before launching the claude subprocess so cleanup() can kill it before reverting Anvil state. Running claude via `&` + `wait` lets the trap fire immediately on INT/TERM, killing the subprocess and preventing it from making calls against an already-reverted chain. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-13 09:48:34 +00:00
openhands	2ae07e7a49	fix: $FLOOR_BEFORE/$FLOOR_AFTER unquoted inside python3 -c string (#531 )	2026-03-13 08:28:26 +00:00
openhands	6924cb03f3	fix: Protocol Mechanics section in agent prompt still exposes ethPerToken formula (#550 )	2026-03-13 07:47:35 +00:00
openhands	b902b89e3b	fix: address review findings — CREATE2 guard, transition test, docs - LiquidityManager.setFeeDestination: add CREATE2 bypass guard — also blocks re-assignment when the current feeDestination has since acquired bytecode (was a plain address when set, contract deployed to it later) - LiquidityManager.setFeeDestination: expand NatSpec to document the EOA-mutability trade-off and the CREATE2 guard explicitly - Test: add testSetFeeDestinationEOAToContract_Locks covering the realistic EOA→contract transition (the primary lock-activation path) - red-team.sh: add comment that DEPLOYER_PK is Anvil account-0 and must only be used against a local ephemeral Anvil instance - ARCHITECTURE.md: document feeDestination conditional-lock semantics and contrast with Kraiken's strictly set-once liquidityManager/stakingPool Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-12 17:13:50 +00:00
openhands	512640226b	fix: fix: Conditional lock on feeDestination — lock when set to contract (#580 ) (#580 ) - Add `feeDestinationLocked` bool to LiquidityManager - Replace one-shot setter with conditional trapdoor: EOAs may be set repeatedly, but setting a contract address locks permanently - Remove `AddressAlreadySet` error (superseded by the new lock mechanic) - Replace fragile SLOT7 storage hack in red-team.sh with a proper `setFeeDestination()` call using the deployer key - Update tests: replace AddressAlreadySet test with three new tests covering EOA multi-set, contract lock, and locked revert Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-12 16:13:44 +00:00
johba	514a55a1ac	Merge pull request 'fix: Backtesting: replay red-team attack sequences against optimizer candidates (#536 )' (#565 ) from fix/issue-536 into master	2026-03-11 19:24:27 +01:00
openhands	58729b98b4	fix: fix: strip cast formatted annotations from red-team.sh (#577 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-11 10:19:14 +00:00
openhands	0834433db1	Fix PR #540 review findings Critical fixes: - LmTotalEth.s.sol: Fix imports to use @aperture/uni-v3-lib/ (lines 8-9) - red-team.sh: Update memory regex to match lm.?eth pattern (line 266) Additional improvements: - red-team.sh: Update adversary balance claim to ~9000 ETH (after funding LM) - red-team.sh: Add --no-color to forge invocation + emptiness guard - red-team.sh: Document feeDestination storage slot 7 fragility Tested: - Regex pattern matches all expected formats (lm_eth, lmeth, LM-ETH, etc.) - Import paths align with remappings.txt	2026-03-11 06:28:02 +00:00
openhands	0ddc1ccd80	fix: Red-team: replace ethPerToken with exact total-LM-ETH metric (#539 ) Replace the ethPerToken metric (free balance / adjusted supply) with total LM ETH (free + WETH + position-locked) using a forge script with exact Uni V3 integer math. Collapses 4+ RPC calls and Python float approximation into a single forge script call using LiquidityAmounts + TickMath. Also updates red-team prompt, report format, memory extraction, and adds roadmap items for #536-#538 (backtesting pipeline, Push3 evolution). Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-11 06:28:02 +00:00
openhands	c8453f6a33	fix: Backtesting: replay red-team attack sequences against optimizer candidates (#536 ) - Add AttackRunner.s.sol: structured forge script that reads attack ops from a JSONL file (ATTACK_FILE env), executes them against the local Anvil deployment, and emits full state snapshots (tick, positions, VWAP, optimizer output, adversary balances) as JSON lines after every recenter and at start/end. - Add 5 canonical attack files in onchain/script/backtesting/attacks/: * il-crystallization-15.jsonl — 15 buy-recenter cycles + sell (extraction) * il-crystallization-80.jsonl — 80 buy-recenter cycles + sell (extraction) * fee-drain-oscillation.jsonl — buy-recenter-sell-recenter oscillation * round-trip-safe.jsonl — 20 full round-trips (regression: safe) * staking-safe.jsonl — staking manipulation (regression: safe) - Add scripts/harb-evaluator/export-attacks.py: parses red-team-stream.jsonl for tool_use Bash blocks containing cast send commands and converts them to AttackRunner-compatible JSONL (buy/sell/recenter/stake/unstake/mint_lp/burn_lp). - Update scripts/harb-evaluator/red-team.sh: after each agent run, automatically exports the attack sequence via export-attacks.py and replays it with AttackRunner to capture structured snapshots in tmp/red-team-snapshots.jsonl. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-11 02:08:06 +00:00
openhands	816b211c2b	fix: address review findings in red-team memory (#528 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-09 10:00:56 +00:00
openhands	c1db4cb93e	fix: Red-team memory: persistent cross-run learning for adversarial agent (#528 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-09 09:23:37 +00:00
openhands	ea53e4cfce	fix: address review findings in red-team.sh (#520 ) - Move snapshot to after setRecenterAccess so agent reverts restore recenterAccess for account 2 on every retry - Read feeDestination() dynamically from LM (removes hardcoded constant) and add \|\| die guards on impersonation calls - Add EXIT/INT/TERM cleanup trap that reverts to the baseline snapshot - Fix agent floor-check snippet: add FEE_DEST/FEE_BAL reads so formula matches compute_eth_per_token (adj=s-f-k, not adj=s-k) - Use `timeout "$CLAUDE_TIMEOUT"` to enforce wall-clock process limit - Correct taxRateIndex range: 0-29 (30-element TAX_RATES array) - Fix outstandingSupply() description: excludes LM-held KRK, not all KRK Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-09 03:59:12 +00:00
openhands	23d460542b	fix: feat: Red-team agent runner — adversarial floor attack (#520 ) Adds scripts/harb-evaluator/red-team.sh which: - Verifies the Anvil stack is running and deployments exist - Grants recenterAccess to account 2 (impersonating feeDestination) - Takes an Anvil snapshot as the clean baseline - Computes ethPerToken before the agent run (mirrors floor.ts logic) - Builds a self-contained prompt with contract addresses, account keys, protocol mechanics, copy-paste cast command patterns, snapshot/revert instructions, and structured rules for the agent - Spawns `claude -p --dangerously-skip-permissions` with a 2-hour timeout - Captures output to tmp/red-team-report.txt - Computes ethPerToken after the agent run and reports pass/fail Exit code 0 = floor held, exit code 1 = floor broken, exit code 2 = infra error. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-09 03:28:10 +00:00
openhands	722ecaaa0e	fix: address review findings in stake-rpc.ts (#518 ) - Remove dead krkAddress field from UnstakeRpcConfig (bug) - Drop swap.js import to avoid transitive Playwright dependency; fix header comment to accurately describe the module boundary (warning) - Inline pollReceipt() returning TxReceipt so snatch receipt is reused for log parsing without a second round-trip (warning) - Use ZeroAddress from ethers instead of manual constant (info) - Add comment on fromBlock '0x0' genesis-scan caveat (info) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-09 02:48:51 +00:00
openhands	fd44fa0bcf	fix: feat: RPC-only staking helpers for red-team agent (#518 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-09 02:06:41 +00:00
openhands	e01ef23560	fix: feat: Anvil snapshot/revert and ethPerToken helpers (#519 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-09 01:23:36 +00:00
openhands	866474510b	fix: feat: Anvil snapshot/revert and ethPerToken helpers (#519 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-09 00:53:17 +00:00
openhands	b2db3c7ae5	fix: feat: Anvil snapshot/revert and ethPerToken helpers (#519 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-09 00:12:49 +00:00
openhands	962b126e8b	fix: networkidle still used in stake.ts login flow (#501 ) Replace waitForLoadState('networkidle') in the post-login redirect with waitForURL('/app/stake'). Persistent WebSocket connections prevent networkidle from ever firing, mirroring the same fix applied to navigate.ts. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-06 16:11:24 +00:00
openhands	c943db379f	fix: wait_healthy does not fail fast when a service exits or crashes during the health-check window (#387 )	2026-03-06 11:20:54 +00:00
johba	a937f1cb4c	docs: scope engineering principles to infra/tests, not frontend polling (#470 ) Clarifies that the event-driven engineering principles apply to infrastructure (Docker, scripts, startup/teardown) and test/scenario execution — NOT frontend HTTP API polling. Frontend polling (e.g. LiveStats → Ponder GraphQL every 30s) is fine. The scalability solution is caching at the proxy layer (`Cache-Control` headers via Caddy), not WebSocket subscriptions. Relates to #447 Co-authored-by: openhands <openhands@all-hands.dev> Reviewed-on: https://codeberg.org/johba/harb/pulls/470	2026-03-06 11:17:50 +01:00
openhands	d0e651ffc9	fix: sellAllKrk uses amountOutMinimum: 0n with no throw on 0 output (#450 ) Replace console.warn with a thrown Error when wethReceived <= 0n so any caller without a return-value check is protected, not just always-leave.spec.ts. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-06 01:51:23 +00:00
openhands	4e6182acc6	fix: evaluator: add stakeKrk and unstakeKrk browser helpers (#460 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-05 15:09:54 +00:00
openhands	b7bbbb9b89	fix: evaluator: add stakeKrk and unstakeKrk browser helpers (#460 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-05 14:37:56 +00:00
openhands	c891b3c617	fix: address review findings in sellKrk helper - Fix max-button race: wait for input to be non-empty after clicking Max (setMax is async, composable calls loadKrkBalance() before setting value) - Add on-chain confirmation via WETH Transfer event polling (mirrors buyKrk) so balance query happens after the swap is mined, not just UI-idle - Use Pick<SellConfig, 'rpcUrl' \| 'accountAddress'> since krkAddress is unused - Add page heading assertion after navigate (consistent with buyKrk) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-05 13:58:14 +00:00
openhands	61a9fd7e58	fix: evaluator: add sellKrk browser helper (uses sell widget from #456 ) (#461 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-05 13:13:04 +00:00
openhands	9cda5beb4a	fix: address review findings in market and recenter helpers - recenter.ts: parse isUp from Recentered event logs instead of a follow-up eth_call that would decode wrong post-recenter state - recenter.ts: remove hardcoded private key from comment; add blocks>0 guard in mineBlocks; call provider.destroy() to prevent leaked intervals - market.ts: snapshot KRK balance before buy to compute krkBought as delta instead of cumulative total; call provider.destroy() on exit; remove unused withdraw entry from WETH_ABI Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-05 11:27:31 +00:00
openhands	1973ccf25b	fix: evaluator: add market simulation and recenter helpers (#455 ) - Export waitForReceipt from swap.ts so market.ts and recenter.ts can reuse it - Add market.ts with roundTripSwap: direct-RPC buy+sell round-trip using ethers Wallet - Add recenter.ts with triggerRecenter (calls LiquidityManager.recenter()) and mineBlocks (anvil_mine) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-05 10:52:28 +00:00
openhands	cd459bb9b0	fix: correct buyKrk call sites for new opts param, add eslint-disable for polling loop - no-dilution.spec.ts: pass undefined for opts, screenshotPrefix as 4th arg - swap.ts: add eslint-disable-next-line for eth_getFilterLogs polling delay	2026-03-05 05:53:19 +00:00
openhands	f214ac8587	fix: address PR #437 review findings - Fix price impact formula: (10000n - ...) instead of (1n - ...) - Extract ETH_AMOUNT constant in always-leave to avoid duplication - Add screenshotPrefix param to buyKrk for unique screenshot paths	2026-03-05 05:51:08 +00:00
openhands	c25c757024	feat(holdout): add passive-confidence/no-dilution scenario Verifies that passive holders are not diluted when new buyers enter. - Two wallets (Anvil accounts 4 & 5) buy KRK sequentially - First buyer's balance must remain unchanged after second buy - Second buyer receives fewer tokens per ETH due to AMM price impact - Tests core protocol invariant: holding KRK does not dilute position	2026-03-05 05:51:03 +00:00
openhands	f6fe37dcc0	fix: address PR #438 review findings - Fix HOLDOUT_SCENARIOS_DIR to use absolute path (resolves Playwright testDir issue) - Remove dead SCENARIOS_DIR variable - Replace fallback with explicit error in holdout.config.ts - Add SSH key requirement comment	2026-03-04 08:20:11 +00:00
openhands	69f6a87e20	Move holdout scenarios to separate repo - Updated holdout.config.ts to use HOLDOUT_SCENARIOS_DIR env var - Modified evaluate.sh to clone harb-holdout-scenarios repo at runtime - Deleted scripts/harb-evaluator/scenarios/ directory - Added .holdout-scenarios/ to .gitignore - Holdout scenarios are now cloned into .holdout-scenarios/ during evaluation - This prevents dev-agent from seeing the holdout test set	2026-03-04 08:20:11 +00:00
johba	b2594a28b3	Merge pull request 'fix: lint: Ban waitForTimeout, setTimeout-as-delay, and fixed sleep patterns (#442 )' (#443 ) from fix/issue-442 into master	2026-03-03 23:37:46 +01:00
johba	05191bb15f	Merge pull request 'feat(holdout): Add reasonable slippage assertion to always-leave scenario' (#436 ) from fix/holdout-slippage-check into master	2026-03-03 22:41:23 +01:00
johba	16abdcbefb	fix: add RPC propagation delay after browser-initiated swap (#434 ) After `buyKrk()` completes (swap widget returns to idle), the Anvil RPC may briefly return stale balance state. Adds 2s delay to ensure `getKrkBalance` reads post-swap state. Without this fix, the holdout scenario reports identical KRK balance before and after swap despite the transaction succeeding (success toast visible). Co-authored-by: openhands <openhands@all-hands.dev> Reviewed-on: https://codeberg.org/johba/harb/pulls/434	2026-03-03 22:20:02 +01:00
openhands	748557bc83	fix: lint: Ban waitForTimeout, setTimeout-as-delay, and fixed sleep patterns (#442 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-03 20:58:01 +00:00
openhands	74a043262d	feat(holdout): Add reasonable slippage assertion to always-leave scenario - Modified sellAllKrk helper to return WETH delta received - Added assertion: WETH received >= 90% of ETH spent (0.09 ETH minimum) - Added log showing actual slippage percentage - This proves 'always leave' with reasonable slippage, not just exit ability	2026-03-03 19:45:46 +00:00

1 2

60 commits