Files

Motiejus bcf6dcdf71 stage0: add analyzeMemoizedStateC during module loading + fix CG builtin ns

Three fixes toward closing the 64-entry IP gap for return_integer.zig:

1. Call analyzeMemoizedStateC() after the full module chain is loaded.
   This creates CallingConvention, Signedness, AddressSpace, and other
   builtin type entries that the Zig compiler creates during its
   analyzeMemoizedState(.main) call chain.

2. Fix ensureNavValUpToDate CG builtin namespace collision: when
   std/builtin.zig and the CG builtin module share the same namespace
   index, check has_zir to distinguish them. Without this, builtins
   like CallingConvention (which have ZIR in std/builtin.zig) were
   incorrectly routed to the CG builtin resolution path and returned
   IP_INDEX_NONE.

3. Limit memoized state resolution to the first 6 direct std.builtin
   declarations (Signedness through SourceLocation). Skip Type and its
   21 children (indices 15-35) — the C sema's resolveTypeFullyC is too
   aggressive for these complex nested types.

Gap reduced from 64 to 3 entries. Remaining gap is from module chain
entry ordering (C sema creates struct types and ptr_navs in batches,
reference interleaves them as pairs).

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-01 19:24:42 +00:00

sema_tests

stage0: add anti-pattern rule against filling the IP gap dishonestly

2026-03-01 16:40:47 +00:00

.clang-format

Add 'stage0/' from commit 'b3d106ec971300a9c745f4681fab3df7518c4346'

2026-02-13 23:32:08 +02:00

.gitignore

Add 'stage0/' from commit 'b3d106ec971300a9c745f4681fab3df7518c4346'

2026-02-13 23:32:08 +02:00

air.c

stage0: add Sema data structures (Phase A)

2026-02-17 19:42:24 +00:00

air.h

fmt and remove some comments

2026-02-20 12:33:48 +02:00

ast.c

sema: fix memory leaks, shift UB, cppcheck warnings, format

2026-02-21 20:00:39 +00:00

ast.h

stage0: remove GNU C extensions for strict C11 compliance

2026-02-25 22:32:10 +00:00

astgen_test.zig

simplify

2026-02-26 00:45:27 +02:00

astgen.c

cppcheck: remove all suppressions, fix all warnings

2026-02-17 18:13:52 +00:00

astgen.h

Add 'stage0/' from commit 'b3d106ec971300a9c745f4681fab3df7518c4346'

2026-02-13 23:32:08 +02:00

CLAUDE.md

stage0: add analyzeMemoizedStateC during module loading + fix CG builtin ns

2026-03-01 19:24:42 +00:00

common.h

stage0: remove GNU C extensions for strict C11 compliance

2026-02-25 22:32:10 +00:00

corpus.zig

stage0: handle decl_val in param type body, bump num_passing to 5

2026-03-01 16:13:38 +00:00

intern_pool.c

stage0: add wyhash and replace boost hash combine in InternPool

2026-03-01 17:45:50 +00:00

intern_pool.h

sema: remove all type function cheats, replace with honest comptime eval

2026-02-28 14:20:08 +00:00

main.c

stage0: prepare for IP gap closure (return_integer.zig)

2026-03-01 17:22:17 +00:00

parser_test.zig

use C parser in AstGen

2026-02-17 10:56:11 +00:00

parser.c

stage0: remove GNU C extensions for strict C11 compliance

2026-02-25 22:32:10 +00:00

parser.h

unify error handling: SET_ERROR(ctx, msg) for both parser and astgen

2026-02-17 17:02:00 +00:00

plan-demand-driven-modules.md

docs: mark all demand-driven module loading phases complete

2026-02-28 16:56:45 +00:00

README.md

split README/CLAUDE: human docs vs agent instructions

2026-02-25 08:35:27 +00:00

sema_test.zig

sema: analyze all named functions, remove pass 2c deferral

2026-03-01 14:01:41 +00:00

sema.c

stage0: add analyzeMemoizedStateC during module loading + fix CG builtin ns

2026-03-01 19:24:42 +00:00

sema.h

sema: port zirTypeInfo and zirReify for .int case

2026-02-28 19:40:29 +00:00

stages_test.zig

stage0: prepare for IP gap closure (return_integer.zig)

2026-03-01 17:22:17 +00:00

tokenizer_test.zig

Add 'stage0/' from commit 'b3d106ec971300a9c745f4681fab3df7518c4346'

2026-02-13 23:32:08 +02:00

tokenizer.c

stage0: remove GNU C extensions for strict C11 compliance

2026-02-25 22:32:10 +00:00

tokenizer.h

Add 'stage0/' from commit 'b3d106ec971300a9c745f4681fab3df7518c4346'

2026-02-13 23:32:08 +02:00

type.c

stage0: add Sema data structures (Phase A)

2026-02-17 19:42:24 +00:00

type.h

stage0: add Sema data structures (Phase A)

2026-02-17 19:42:24 +00:00

value.c

stage0: add Sema data structures (Phase A)

2026-02-17 19:42:24 +00:00

value.h

stage0: add Sema data structures (Phase A)

2026-02-17 19:42:24 +00:00

verbose_air.c

sema: resolve cross-module inline return types, memoize type functions; enable mul*c3

2026-02-23 03:46:37 +00:00

verbose_air.h

verbose_air: add human-readable AIR printer and --verbose-air CLI flag

2026-02-21 21:17:06 +02:00

verbose_intern_pool.c

sema: add internStrLit, STR handler, Pass 1b ipForceIntern

2026-02-28 00:23:20 +00:00

verbose_intern_pool.h

stage0: add --verbose-intern-pool flag and IP dumper

2026-02-25 20:37:34 +00:00

wyhash_test.zig

stage0: add wyhash and replace boost hash combine in InternPool

2026-03-01 17:45:50 +00:00

wyhash.c

stage0: add wyhash and replace boost hash combine in InternPool

2026-03-01 17:45:50 +00:00

wyhash.h

stage0: add wyhash and replace boost hash combine in InternPool

2026-03-01 17:45:50 +00:00

zig0.c

stage0: prepare for IP gap closure (return_integer.zig)

2026-03-01 17:22:17 +00:00

zir.c

Add 'stage0/' from commit 'b3d106ec971300a9c745f4681fab3df7518c4346'

2026-02-13 23:32:08 +02:00

zir.h

Merge commit '6204bb245b4a05e0f4f00bb48d83b76ebcd899e2' into zig0-0.15.2

2026-02-14 10:05:42 +02:00

README.md

About

zig0 aspires to be an interpreter of zig 0.15.2 written in C.

Except for the lexer (written by hand by yours truly), it's been written by an LLM.

The goal of stage0 is to be able to implement enough zig to be able to build zig1.wasm. For that we need:

Lexer: DONE, written by hand by yours truly in late 2024.
Parser: DONE, written mostly by an LLM.
AstGen: DONE, written fully by an LLM.
Sema: in progress.

Testing

Quick test:

zig build fmt-zig0 test-zig0

Static analysis (takes a while, run separately):

zig build lint-zig0

More elaborate (tries all compilers + static analysis + ReleaseSafe):

zig build all-zig0 -Doptimize=ReleaseSafe

Most elaborate, takes >10m:

zig build all-zig0 -Doptimize=ReleaseSafe -Dvalgrind |& grep -v Warning

Debugging tips

Test runs infinitely? Build the test program executable:

$ zig build test-zig0 -Dzig0-no-exec

And then run it, capturing the stack trace:

gdb -batch \
    -ex "python import threading; threading.Timer(1.0, lambda: gdb.post_event(lambda: gdb.execute('interrupt'))).start()" \
    -ex run \
    -ex "bt full" \
    -ex quit \
    zig-out/bin/test

You are welcome to replace -ex "bt full" with anything other of interest.

Float Handling

Float literals are parsed with strtold() (C11 standard, portable). On x86-64 Linux, long double is 80-bit extended precision (63 fraction bits).

When a float doesn't round-trip through f64, it's emitted as f128 (ZIR float128 instruction). The 80-bit extended value is converted to IEEE 754 binary128 encoding by bit manipulation — both formats share the same 15-bit exponent with bias 16383. The top 63 of binary128's 112 fraction bits come from the 80-bit value; the bottom 49 are zero-padded.

This means float128 literals lose ~49 bits of precision compared to the upstream Zig implementation (which uses native f128). This is acceptable because stage0 is a bootstrap tool — the real Zig compiler re-parses all source with full f128 precision in later stages. The test comparison mask in astgen_test.zig skips float128 payloads to account for this.

Previous approach used __float128/strtof128 (GCC/glibc extensions) for full precision, but these are not portable to TCC and other C11 compilers.