johba/harb - Forgejo: Beyond coding. We forge.

johba/harb

Author	SHA1	Message	Date
openhands	acda1f72bb	fix: add sleep before continue in stale-patch error path to avoid busy loop (#866 ) When git apply --check fails, the daemon now sleeps 300s before retrying, preventing a tight busy loop that would hammer the git remote indefinitely. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 18:49:23 +00:00
openhands	57b83b6fe9	fix: evolution.patch has no apply-validation step in CI or evolve.sh (#866 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 18:49:23 +00:00
openhands	7949640b04	fix: feat: LLM seed — Balanced Adaptive optimizer (#676 ) Add llm_balanced.push3: arithmetic-only optimizer that keeps all outputs in a balanced mid-range. anchorShare=40-60% (linear with percentageStaked), anchorWidth=10-200 ticks (linear with taxRate), discoveryDepth=30-50% (linear with percentageStaked), ci=0. No EXEC.IF branches — all transitions via multiplication and division. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-18 14:10:36 +00:00
openhands	26df0a15dc	fix: evo_run004_champion fitness also stale after #655 (#847 ) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-18 00:17:46 +00:00
openhands	a23064f576	fix: batch-eval.sh aborts entire generation on single candidate compile failure (#901 ) - Add skip_candidate() helper that emits fitness=0 JSON to stdout and tracks the failed score for the output-dir file, satisfying the downstream scorer's expectation of one JSON line per candidate. - Unify all failure paths (transpile, forge build, bytecode extract, empty bytecode) through skip_candidate() with a distinct error key. - Log message now reads "WARNING: <id> compile failed — scoring as 0" as required by the acceptance criteria. - Output-dir scores.jsonl now merges successful + failed scores so the file is complete even when some candidates fail to compile. - All-candidates-fail path (COMPILED_COUNT=0) still exits 2 (no viable population); true infra errors (missing tool, bad RPC) unchanged. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-17 06:09:18 +00:00
openhands	777bec8563	fix: evolution.patch references removed LiquidityManager constant (pre-existing structural debt) (#842 ) Extend the patch to also replace the NatSpec comments above MAX_ANCHOR_WIDTH, which became misleading after switching to type(uint24).max. The old comments claimed overflow-safety ("fits in int24"); the new comments document that the production cap is 1233, that values above 123358 overflow int24 and revert, and that this is tolerable in the evolution context where reverts score zero fitness. The patch now correctly updates both the constant and its documentation. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-16 09:20:41 +00:00
openhands	c0fa8c064f	fix: evolution.patch references removed LiquidityManager constant (pre-existing structural debt) (#842 ) Regenerate evolution.patch from the current ThreePositionStrategy.sol. The old patch had a corrupt hunk header (@@ -33,7 +33,7 @@ claiming 7 lines but only supplying 4) and placeholder index hashes (0000000..0000000), causing `git apply` to reject it with "corrupt patch". MAX_ANCHOR_WIDTH still exists in the file at value 1233; the patch correctly overrides it to type(uint24).max for unbounded evolution runs. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-16 08:53:33 +00:00
openhands	79bcb81b81	fix: Fitness re-evaluation for fixed evo_run007_champion (#811 ) Null out the stale fitness score (7116531284966772550194) for evo_run007_champion.push3, which was recorded against the buggy processExecIf interpreter (pre-#655 fix). Setting fitness to null marks the entry for re-scoring by evaluate-seeds.sh once a valid ANVIL_FORK_URL is available. Updated the note field to document why the fitness was cleared. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-15 23:21:04 +00:00
openhands	aa274fd8ed	fix: address review findings for anchorWidth guard (#817 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-15 22:04:13 +00:00
johba	ff86b3691d	chore: extract shared inject.sh, add red-team-sweep.sh (#806 ) ## What - `tools/push3-transpiler/inject.sh` — shared transpile+inject logic used by both batch-eval and red-team-sweep - `batch-eval.sh` — replaced inline 60-line Python block with `inject.sh` call - `scripts/harb-evaluator/red-team-sweep.sh` — red-teams each kindergarten seed using existing `red-team.sh`, with random smoke test gate ## Why Sweep script kept breaking because I rewrote the injection logic instead of reusing batch-eval's proven Python. Now there's one copy. ## Testing - inject.sh tested manually on DO box with optimizer_v3 seed - Smoke test picks random seed, injects + compiles before starting sweep Co-authored-by: openhands <openhands@all-hands.dev> Reviewed-on: https://codeberg.org/johba/harb/pulls/806 Reviewed-by: review_bot <review_bot@noreply.codeberg.org>	2026-03-15 10:24:03 +01:00
openhands	d8a109baf8	fix: evo_run007_champion.push3 always returns fixed params regardless of staking (#791 ) Replace the broken EXEC.IF branches where TRUE was ( ) and FALSE was 0 DYADIC.POP, causing the trailing push sequence to execute unconditionally. Now EXEC.IF correctly branches on STAKED > 88%: - TRUE (staked > 88%): bear defaults ( 0 0 0 0 ) — CI=0, AW=0, AS=0, DD=0 - FALSE (staked ≤ 88%): ( 200000000000000000 153 200000000000000000 0 ) — CI=0, AW=153, AS=20%, DD=20% Also correct the manifest.jsonl run 7 note which had CI and DD inverted (CI=20%/DD=0 → CI=0/DD=20%). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-15 07:30:45 +00:00
openhands	70ef0eb1bc	fix: Old-format CIDs are warned but still silently dropped from the pool (#801 ) - Change WARNING to explicitly state "legacy CID format ... migration not supported, skipping" - Expand comment near the startswith('candidate_') guard to document the CID format contract and explain why re-admission is intentionally out of scope (no surviving generation_N.jsonl files from runs 1-6 exist in the repo) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-15 06:17:12 +00:00
openhands	4a47e8e2d1	fix: evolve.sh does not write \`note\` field — schema drift between hand-written and evolved entries (#719 ) - Pass seed basename into the admission Python block as argv[7] - Add \`note\` field to every new evolved entry: "Evolved from <seed> (run<N> gen<G>)" - Add migration comment noting entries admitted before this fix may have note: null Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-15 04:57:58 +00:00
openhands	6694b2daa8	fix: CID format change silently drops historical generation JSONL on re-admission (#757 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-15 04:27:38 +00:00
openhands	2aad9e98f1	fix: manifest.jsonl schema has no canonical machine-readable definition (#720 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-15 03:57:31 +00:00
openhands	c508efa31f	fix: address review findings for evaluate-seeds.sh (#724 ) - Replace unquoted heredoc (shell-injection path) with a temp file: the shell loop now appends tab-separated filename/score lines to a temp file, which is passed as a plain path argument to the Python manifest- rewrite block. Python reads only file contents, never executes shell- expanded strings. - Add early abort on fitness.sh exit code 2 (infra error: Anvil down, missing tool). Iterating past an infra failure produces no useful results; aborting immediately surfaces the real problem. - Remove unused `os` import from the manifest-rewrite Python block. - Fix inaccurate comment in evolve.sh --diverse-seeds sampling: the pool sampler does a flat random shuffle with no fitness weighting; null- fitness seeds are not "treated as 0" — they are sampled with equal probability to any other seed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-15 03:29:47 +00:00
openhands	cb6e6708b6	fix: \`llm\`-origin entries in manifest have null fitness and no evaluation path (#724 ) - Add evaluate-seeds.sh: standalone script that reads manifest.jsonl, finds every entry with fitness: null, runs fitness.sh against each seed file, and atomically writes results back to manifest.jsonl. Supports --dry-run to preview without evaluating. - Add comment to --diverse-seeds sampling in evolve.sh documenting that null-fitness seeds are included with effective_fitness=0 and that evaluate-seeds.sh should be run to score them. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-15 03:08:29 +00:00
openhands	273615cfed	fix: No generic flag dispatch: only \`token_value_inflation\` is ever zero-rated (#723 ) Define ZERO_RATED_FLAGS set near effective_fitness and check each flag with any(...in flags...) instead of a single hard-coded substring test. token_value_inflation behaviour is preserved; new flags can be added to the set without touching the dispatch logic. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-15 02:36:57 +00:00
openhands	7930770570	fix: feat: add evolution run 8 champion to seed pool (#781 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-15 01:31:06 +00:00
openhands	56aedfae49	fix: feat: add evolution run 8 champion to seed pool (#781 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-15 01:07:12 +00:00
openhands	7a9b4206ae	fix: llm_contrarian.push3 AW=150/250 clamped to 100 — three rounds unaddressed (#756 ) Replace AW=250 (VERY AGGRESSIVE) with 100 and AW=150 (AGGRESSIVE) with 80 so neither value is silently clamped by LiquidityManager.MAX_ANCHOR_WIDTH=100. Update header comment block to match the corrected values. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 21:40:31 +00:00
openhands	17c904aaa3	fix: batch-eval.sh MANIFEST_DIR (mktemp -d) has no cleanup trap (#763 )	2026-03-14 19:46:50 +00:00
openhands	ab40930812	fix: fitness.sh individual-scoring path still silences errors (#766 )	2026-03-14 19:07:17 +00:00
openhands	524a05286e	fix: address review feedback on evolution-daemon.sh (#748 ) - Stream evolve.sh output directly to stderr instead of buffering in a command substitution; long runs (tens of minutes) are now visible live. - Use an array (EVOLVE_ARGS) for evolve.sh arguments instead of an unquoted DIVERSE_FLAG string variable. - Abort the current run (continue to next loop iteration) when the patch fails to apply, rather than silently running with wrong evaluation semantics. - Fix notify() to pass the message via stdin to avoid SSH single-quote interpolation breakage on messages containing special characters. - Fix step comment/counter mismatch: "Step 7" comment now reads "Step 6" to match the [6/7] log label for the summary-write step. - Clarify in evolution.conf that GAS_LIMIT and ANCHOR_WIDTH_UNBOUNDED are documentation-only (they document what evolution.patch does); editing them has no runtime effect. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 17:39:20 +00:00
openhands	bbf3b871b3	fix: feat: evolution-daemon.sh — perpetual evolution loop on DO box (#748 ) - Add tools/push3-evolution/evolution-daemon.sh: single-command daemon that runs git-pull → apply-patch → clean-tmpdirs → evolve.sh → summary → notify → revert-patch → loop, handling SIGINT/SIGTERM cleanly. - Add tools/push3-evolution/evolution.conf: config file (EVAL_MODE, BASE_RPC_URL, POPULATION=20, GENERATIONS=30, MUTATION_RATE=1, ELITES=2, DIVERSE_SEEDS=true, GAS_LIMIT=500000, ANCHOR_WIDTH_UNBOUNDED=true). - Add tools/push3-evolution/evolution.patch: overrides CALCULATE_PARAMS_GAS_LIMIT 200k→500k in Optimizer.sol + FitnessEvaluator.t.sol, and removes MAX_ANCHOR_WIDTH=100 cap in LiquidityManager.sol for unbounded AW exploration. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 17:21:51 +00:00
openhands	f355974cc8	fix: fix: evolve.sh silences all batch-eval errors with 2>/dev/null (#749 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 16:51:04 +00:00
openhands	89a9d3e575	fix: fix: evolve.sh silences all batch-eval errors with 2>/dev/null (#749 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 16:27:09 +00:00
johba	cf94d4c342	Merge pull request 'fix: fix: evolve.sh stale tmpdirs break subsequent runs (#750 )' (#762 ) from fix/issue-750 into master	2026-03-14 17:19:42 +01:00
openhands	b168a05930	fix: fix: evolve.sh stale tmpdirs break subsequent runs (#750 ) Replace `mktemp -d` with a fixed working directory `evolved/.work/` that is wiped at startup. Stale `/tmp/tmp.*` directories from killed runs can no longer interfere with batch-eval.sh path resolution. Run outputs are already preserved in `evolved/run_NNN/` before the work dir is cleaned. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 15:48:07 +00:00
openhands	564f0c5c69	fix: add fitness_flags to evo_run004, fix run007 note accuracy - evo_run004_champion: added missing fitness_flags field - evo_run007_champion: clarified both branches (staked<=88% vs >88%)	2026-03-14 15:43:58 +00:00
openhands	37ecf413d8	fix: resolve manifest.jsonl conflict markers	2026-03-14 15:43:58 +00:00
openhands	648e247ce3	feat: add run 7 champion to kindergarten evo_run007_champion: fitness 7.117e21, anchorWidth=153 (unbounded), discoveryDepth=0. Simplified to single percentageStaked>88% threshold. Evolved under IL crystallization attack pressure.	2026-03-14 15:43:58 +00:00
openhands	34f142ae17	feat: add run7 evolution champion to seed pool	2026-03-14 15:43:58 +00:00
openhands	5f7d002e2a	feat: add recovered LLM seeds (floor hugger + contrarian) Recovered from reflog after rebase accident destroyed PRs #692, #699. Balanced Adaptive (#688) was garbage collected — will be regenerated. Kindergarten (#683) needs fresh implementation due to evolve.sh conflicts. Closes #672, #675.	2026-03-14 15:43:58 +00:00
openhands	fafe317fa5	fix: feat: LLM seed — Defensive Floor Hugger optimizer (#672 ) Add llm_floor_hugger.push3: pure-constant Push3 optimizer that keeps anchorShare=0.05e18, anchorWidth=5 ticks, discoveryDepth=0.05e18, CI=0. Ignores all staking/tax inputs — floor position is always maximised. Transpiles without error; manifest.jsonl updated. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 12:48:22 +00:00
openhands	cd86774ac8	fix: address review findings for #751 — STATE.md and script header docs Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 12:17:23 +00:00
openhands	83ab1683f5	fix: fix: EVAL_MODE defaults to anvil — should default to revm (#751 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 11:56:52 +00:00
openhands	266500fde1	fix: address review findings for #752 — regex and STATE.md cleanup - Fix run_NNN scan regex: r'run(\d+)' → r'run_(\d+)' so it correctly matches the underscore-separated directory names the script creates (previously always resolved to 001, overwriting the same dir each run) - Remove [in-progress] tag from STATE.md entry for #752 Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 11:27:53 +00:00
openhands	b5bf53b010	fix: feat: evolve.sh auto-incrementing per-run results directory (#752 ) - --output now accepts a base dir (default: evolved/) instead of requiring an explicit path each run - On each invocation, scan base dir for existing run_NNN/ subdirectories, find the highest N, and create run_(N+1)/ for this run's outputs - All generation JSONL files, best.push3, diff.txt, and evolution.log are written to the new run dir — previous runs are never overwritten - Log header now shows both Base dir and Output (run dir) for clarity Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 11:08:04 +00:00
openhands	b6c07b1d93	fix: generation_N.jsonl candidate_id format mismatch vs filenames (#669 )	2026-03-14 04:27:59 +00:00
openhands	0aa819f168	fix: generation_N.jsonl candidate_id format mismatch vs filenames (#669 )	2026-03-14 04:07:00 +00:00
openhands	958b8cfaa0	fix: batch-eval.sh header comment claims wrong candidate_id format (#668 )	2026-03-14 03:36:43 +00:00
openhands	c42a1ca768	fix: evo_run004_champion fitness inflated by token value (#670 ) (#704 ) - Add fitness_flags="token_value_inflation" to evo_run004_champion in manifest.jsonl so callers can detect the inflated value without discarding the entry entirely. - Add effective_fitness() helper in evolve.sh pool admission (step 5) that returns 0 for any entry with a token_value_inflation flag, preventing inflated scores from biasing the top-100 evolved pool ranking or eviction decisions. - Document in evolve.sh that raw fitness values are only comparable within the same evaluation run. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-14 01:08:13 +00:00
openhands	b7d0b63ca1	fix: int(e.get('fitness', 0)) crashes on null-fitness manifest entries (#711 )	2026-03-13 23:37:00 +00:00
johba	0c4cd23dfa	fix: feat: Seed kindergarten — persistent top-100 candidate pool (#667 ) (#683 ) Fixes #667 ## Changes ## Summary Implemented persistent top-100 candidate pool in `tools/push3-evolution/evolve.sh`: ### Changes `--run-id <N>` flag (line 96) - Optional integer; auto-increments from highest `run` field in `manifest.jsonl` when omitted - Zero-padded to 3 digits (`001`, `002`, …) Seeds pool constants (after path canonicalization) - `SEEDS_DIR` → `$SCRIPT_DIR/seeds/` - `POOL_MANIFEST` → `seeds/manifest.jsonl` - `ADMISSION_THRESHOLD` → `6000000000000000000000` (6e21 wei) `--diverse-seeds` mode now has two paths: 1. Pool mode (pool non-empty): random-shuffles the pool and takes up to `POPULATION` candidates — real evolved diversity, not parametric clones 2. Fallback (pool empty): original `seed-gen-cli` parametric variant behavior - Both paths fall back to mutating `--seed` to fill any shortfall Step 5 — End-of-run admission (after the diff step): 1. Scans all `generation_*.jsonl` in `OUTPUT_DIR` for candidates with `fitness ≥ 6e21` 2. Maps `candidate_id` (e.g. `gen2_c005`) back to `.push3` files in `WORK_DIR` (still exists since cleanup fires on EXIT) 3. Deduplicates by SHA-256 content hash against existing pool 4. Names new files `run{RUN_ID}_gen{N}_c{MMM}.push3` 5. Merges with existing pool, sorts by fitness descending, keeps top 100 6. Copies admitted files to `seeds/`, removes evicted evolved files (never hand-written), rewrites `manifest.jsonl` Co-authored-by: openhands <openhands@all-hands.dev> Reviewed-on: https://codeberg.org/johba/harb/pulls/683 Reviewed-by: review_bot <review_bot@noreply.codeberg.org>	2026-03-13 20:45:03 +01:00
johba	e230a62e11	fix: feat: LLM seed — Fee Maximizer optimizer (#673 ) (#696 ) Fixes #673 ## Changes Done. Here's a summary of what was implemented: ## Changes `tools/push3-evolution/seeds/llm_fee_maximizer.push3` (new file) A Push3 optimizer seed implementing the "every trade pays us" philosophy. It reads `percentageStaked` (slot 0) and `averageTaxRate` (slot 1), then branches on two thresholds: - Staking threshold: 60% (bullish vs. neutral sentiment) - Tax threshold: 10% of 1e18 (high vs. low swap volume) Strategy matrix: \| \| tax < 10% \| tax ≥ 10% \| \|---\|---\|---\| \| staked < 60% \| AS=0.70, AW=60, DD=0.50 \| AS=0.80, AW=80, DD=0.60 \| \| staked ≥ 60% \| AS=0.90, AW=40, DD=0.80 \| AS=0.95, AW=50, DD=0.90 \| CI is always 0. Anchor share is always ≥ 0.70e18 (capital stays in fee-earning zone). High staking shifts discovery depth up; high tax widens the anchor to capture more swap volume. `tools/push3-evolution/seeds/manifest.jsonl` — new entry for `llm_fee_maximizer.push3` with `origin=llm`. Transpiled successfully: 48-line Solidity function body, outputs correctly bound to `ci`, `anchorShare`, `anchorWidth`, `discoveryDepth`. Co-authored-by: openhands <openhands@all-hands.dev> Reviewed-on: https://codeberg.org/johba/harb/pulls/696 Reviewed-by: review_bot <review_bot@noreply.codeberg.org>	2026-03-13 19:56:39 +01:00
openhands	24c4e94a6b	fix: feat: LLM seed — Momentum Follower optimizer (#674 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-13 14:50:26 +00:00
openhands	c7196aa2b0	feat: seed kindergarten — initial population with OG + first evolution champion seeds/optimizer_v3.push3 — hand-written original (8.26e21) seeds/evo_run004_champion.push3 — first evolution winner (2.31e24, gen 3) seeds/manifest.jsonl — fitness + provenance tracking Note: champion fitness inflated by token value (#670). Strategy is 'always bull' — wide positions, max anchor share. Pending ETH-only metric fix to validate.	2026-03-13 10:32:35 +00:00
johba	3f435f8459	fix: evolution scoring — 3 bugs made all candidates report fitness=0 (#665 ) ## Three bugs in evolve.sh 1. Heredoc stdin conflict — `py_stats()` used `<<PYEOF` heredoc which stole stdin from the pipe, so python never received score values → stats always `min=0 max=0 mean=0` 2. Bash integer overflow — global best comparison used `[ $MAX -gt $GLOBAL_BEST_FITNESS ]` which overflows on uint256 wei values (>9.2e18) → best always tracked as 0 3. candidate_id mismatch — evolve.sh looked up `gen0_c000` but batch-eval produces `candidate_000` (derived from filename) → score lookup always returned default 0 All 3 previous evolution runs (150+ candidates) reported all zeros despite batch-eval correctly scoring them at ~8.26e21 wei. ## Fix - `py_stats`: heredoc → `python3 -c` inline - Global best: bash `[ -gt ]` → `python3` big number comparison - Score lookup: use `basename $CAND_FILE` instead of synthetic CID Co-authored-by: root <root@debian-g-2vcpu-8gb-ams3-01> Reviewed-on: https://codeberg.org/johba/harb/pulls/665 Reviewed-by: review_bot <review_bot@noreply.codeberg.org>	2026-03-13 10:02:24 +01:00
openhands	f8b765a9f8	fix: feat: Push3 evolution — crossover operator (#639 ) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-13 05:54:48 +00:00

1 2

64 commits