johba/harb - Forgejo: Beyond coding. We forge.

johba/harb

Author	SHA1	Message	Date
openhands	5d369cfab6	fix: address review findings for gas-limit fitness pressure (#637 ) - Optimizer.sol: move CALCULATE_PARAMS_GAS_LIMIT constant to top of contract (after error declaration) to avoid mid-contract placement. Expand natspec with EIP-150 63/64 note: callers need ~203 175 gas to deliver the full 200 000 budget to the inner staticcall. - Optimizer.sol: add ret.length < 128 guard before abi.decode in getLiquidityParams(). Malformed return data (truncated / wrong ABI) from an evolved program now falls back to _bearDefaults() instead of propagating an unhandled revert. The 128-byte minimum is the ABI encoding of (uint256, uint256, uint24, uint256) — four 32-byte slots. - Optimizer.sol: add cross-reference comment to _bearDefaults() noting that its values must stay in sync with LiquidityManager.recenter()'s catch block to prevent silent divergence. - FitnessEvaluator.t.sol: add CALCULATE_PARAMS_GAS_LIMIT mirror constant (must match Optimizer.sol). Disqualify candidates whose measured gas exceeds the production cap with fitness=0 and error="gas_over_limit" — prevents the pipeline from selecting programs that are functionally dead on-chain (would always produce bear defaults in production). - batch-eval.sh: update output format comment to document the gas_used field and over-gas-limit error object added by this feature. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-13 01:05:37 +00:00
openhands	64f1af3041	fix: feat: Push3 evolution — elitism (top N survive unchanged) (#640 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-12 22:29:23 +00:00
openhands	87bb5859e2	fix: revm evaluator — UUPS bypass, deployedBytecode, graceful attack ops - Skip UUPS upgradeTo: etch + vm.store ERC1967 implementation slot directly (OptimizerV3Push3 is standalone, no UUPS inheritance needed for evolution) - Use deployedBytecode (runtime) instead of bytecode (creation) for vm.etch - Inject transpiled body into OptimizerV3.sol (has getLiquidityParams via Optimizer) instead of using standalone OptimizerV3Push3.sol - Wrap buy/sell/stake/unstake in try/catch — attack ops should not abort the batch - Add /tmp read to fs_permissions for batch-eval manifest files - Bootstrap recenter returns bool instead of reverting (soft-fail per candidate)	2026-03-12 19:54:58 +00:00
openhands	403a304c98	fix: tighten stack-depth guard to !== 4 to catch overflow (#584 ) Reviewer noted that `< 4` only catches underflow; programs leaving 5+ values on the DYADIC stack silently passed isValid(). Change the guard to `!== 4` so both under- and overflow are rejected, matching the documented 'exactly 4 outputs' contract. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-12 14:48:15 +00:00
openhands	e770191e56	fix: transpile() does not throw on <4 stack outputs (#584 ) Replace silent ?? '0' fallbacks with an explicit length check that throws when the DYADIC stack holds fewer than 4 values at program termination. isValid() in the evolution pipeline now correctly rejects underflow programs instead of silently scoring them as valid with zeroed outputs. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-12 14:07:32 +00:00
openhands	26b8876691	fix: feat: revm-based fitness evaluator for evolution at scale (#604 ) Replace per-candidate Anvil+forge-script pipeline with in-process EVM execution using Foundry's native revm backend, achieving 10-100× speedup for evolutionary search at scale. New files: - onchain/test/FitnessEvaluator.t.sol — Forge test that forks Base once, deploys the full KRAIKEN stack, then for each candidate uses vm.etch to inject the compiled optimizer bytecode, UUPS-upgrades the proxy, runs all attack sequences with in-memory vm.snapshot/revertTo (no RPC overhead), and emits one {"candidate_id","fitness"} JSON line per candidate. Skips gracefully when BASE_RPC_URL is unset (CI-safe). - tools/push3-evolution/revm-evaluator/batch-eval.sh — Wrapper that transpiles+compiles each candidate sequentially, writes a two-file manifest (ids.txt + bytecodes.txt), then invokes FitnessEvaluator.t.sol in a single forge test run and parses the score JSON from stdout. Modified: - tools/push3-evolution/evolve.sh — Adds EVAL_MODE env var (anvil\|revm). When EVAL_MODE=revm, batch-scores every candidate in a generation with one batch-eval.sh call instead of N sequential fitness.sh processes; scores are looked up from the JSONL output in the per-candidate loop. Default remains EVAL_MODE=anvil for backward compatibility. Key design decisions: - Per-candidate Solidity compilation is unavoidable (each Push3 candidate produces different Solidity); the speedup is in the evaluation phase. - vm.snapshot/revertTo in forge test are O(1) memory operations (true revm), not RPC calls — this is the core speedup vs Anvil. - recenterAccess is set in bootstrap so TWAP stability checks are bypassed during attack sequences (mirrors the existing fitness.sh bootstrap). - Test skips cleanly when BASE_RPC_URL is absent, keeping CI green. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-12 11:54:41 +00:00
openhands	ade7e2033a	fix: Evolution pipeline UUPS upgrade + Foundry PATH (#593 ) - Add virtual to Optimizer.calculateParams() for UUPS override - Create OptimizerV3.sol: UUPS-upgradeable optimizer with transpiled Push3 logic - Update deploy-optimizer.sh to deploy OptimizerV3 instead of Optimizer - Add ~/.foundry/bin to PATH in evolve.sh, fitness.sh, deploy-optimizer.sh	2026-03-12 06:47:35 +00:00
openhands	0496c94681	fix: address review findings in evolve.sh (#546 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-11 22:06:18 +00:00
openhands	2ee7feb621	fix: address review findings in evolve.sh (#546 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-11 21:29:14 +00:00
openhands	547e8beae8	fix: Push3 evolution: selection loop orchestrator (#546 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-11 20:56:19 +00:00
openhands	4564637f85	fix: Push3 evolution: fitness scoring wrapper (transpile → deploy → attack → score) (#545 ) Address round-2 review findings: - Move BASELINE_SNAP before deploy-optimizer.sh so cleanup fully reverts the deploy on a shared Anvil; fixes nonce/address collision when a second sequential evaluation reuses the same chain - Revert deploy output to capture-and-suppress on success / surface on failure; removes per-candidate stderr noise in evolution loop batch runs - Fix cast rpc anvil_mine arg order to match all other cast rpc calls in script Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-11 20:16:54 +00:00
openhands	0f91234dbe	fix: Push3 evolution: fitness scoring wrapper (transpile → deploy → attack → score) (#545 ) Address review findings: - Bug: add BASELINE_SNAP before bootstrap; cleanup reverts it on shared Anvil to undo setRecenterAccess/WETH-funding/recenter mutations (was dead code before) - Bug: require ANVIL_FORK_URL when cold-starting Anvil — DeployLocal.sol needs live Base contracts (Uniswap V3 Factory, WETH) that don't exist on a plain fork - Warning: flag DIRTY and emit warning when anvil_revert fails instead of \|\| true - Warning: tee deploy-optimizer.sh output to both log file and stderr so progress is visible and preserved for post-failure diagnosis - Nit: replace 50×evm_mine loop with single anvil_mine 0x32 (49 fewer RTTs) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-11 19:41:06 +00:00
openhands	a8db761de8	fix: Push3 evolution: fitness scoring wrapper (transpile → deploy → attack → score) (#545 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-11 19:02:00 +00:00
openhands	a6b64d3219	fix: Push3 evolution: mutation operators for bytecode programs (#544 ) Implements the five Push3 mutation operators and the meta-operator for the optimizer evolution pipeline: - mutateConstant: shifts a random integer literal by ±δ (clamped to 0) - swapOperator: swaps ADD↔SUB, MUL↔DIV, GT↔LT, GTE↔LTE - deleteInstruction: removes a random non-EXEC.IF instr; validates result - insertInstruction: inserts stack-neutral pair (push 0 + DYADIC.POP) - crossover: single-point crossover of two programs at instruction boundaries - mutate: applies N random mutations from the four single-program operators All mutations validate output via transpile() symbolic stack simulation. Invalid mutations silently return the original program. 35 unit tests cover all operators, edge cases (empty program, single instruction, deep stack), and the acceptance criterion that mutate(optimizer_v3, 3) produces ≥10 distinct valid variants in 20 trials. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-11 16:24:24 +00:00
openhands	5d204e5649	fix: Push3 optimizer: dyadic rational input interface (8 slots) + 4-output redesign (#548 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-11 15:23:36 +00:00
openhands	3244c0a975	fix: Unified Push3 → deploy pipeline: transpile, compile, upgrade in one command (#538 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-10 20:21:54 +00:00
openhands	5e8a94b7a9	feat: Push3 → Solidity transpiler + OptimizerV3 port	2026-02-23 14:47:38 +00:00

17 commits