motiejus/zig - zig - gitea: Gitea Service

Author	SHA1	Message	Date
Jacob Young	e046977354	codegen: implement output to the `.debug_info` section	2024-09-10 12:27:57 -04:00
mlugg	5e12ca9fe3	compiler: implement labeled switch/continue	2024-09-01 18:30:31 +01:00
mlugg	5fb4a7df38	Air: add explicit `repeat` instruction to repeat loops This commit introduces a new AIR instruction, `repeat`, which causes control flow to move back to the start of a given AIR loop. `loop` instructions will no longer automatically perform this operation after control flow reaches the end of the body. The motivation for making this change now was really just consistency with the upcoming implementation of #8220: it wouldn't make sense to have this feature work significantly differently. However, there were already some TODOs kicking around which wanted this feature. It's useful for two key reasons: * It allows loops over AIR instruction bodies to loop precisely until they reach a `noreturn` instruction. This allows for tail calling a few things, and avoiding a range check on each iteration of a hot path, plus gives a nice assertion that validates AIR structure a little. This is a very minor benefit, which this commit does apply to the LLVM and C backends. * It should allow for more compact ZIR and AIR to be emitted by having AstGen emit `repeat` instructions more often rather than having `continue` statements `break` to a `block` which is followed by a `repeat`. This is done in status quo because `repeat` instructions only ever cause the direct parent block to repeat. Now that AIR is more flexible, this flexibility can be pretty trivially extended to ZIR, and we can then emit better ZIR. This commit does not implement this. Support for this feature is currently regressed on all self-hosted native backends, including x86_64. This support will be added where necessary before this branch is merged.	2024-09-01 18:30:31 +01:00
mlugg	1b000b90c9	Air: direct representation of ranges in switch cases This commit modifies the representation of the AIR `switch_br` instruction to represent ranges in cases. Previously, Sema emitted different AIR in the case of a range, where the `else` branch of the `switch_br` contained a simple `cond_br` for each such case which did a simple range check (`x > a and x < b`). Not only does this add complexity to Sema, which we would like to minimize, but it also gets in the way of the implementation of #8220. That proposal turns certain `switch` statements into a looping construct, and for optimization purposes, we want to lower this to AIR fairly directly (i.e. without involving a `loop` instruction). That means we would ideally like a single instruction to represent the entire `switch` statement, so that we can dispatch back to it with a different operand as in #8220. This is not really possible to do correctly under the status quo system. This commit implements lowering of this new `switch_br` usage in the LLVM and C backends. The C backend just turns any case containing ranges entirely into conditionals, as before. The LLVM backend is a little smarter, and puts scalar items into the `switch` instruction, only using conditionals for the range cases (which direct to the same bb). All remaining self-hosted backends are temporarily regressed in the presence of switch range cases. This functionality will be restored for at least the x86_64 backend before merge.	2024-09-01 18:30:31 +01:00
mlugg	0fe3fd01dd	std: update `std.builtin.Type` fields to follow naming conventions The compiler actually doesn't need any functional changes for this: Sema does reification based on the tag indices of `std.builtin.Type` already! So, no zig1.wasm update is necessary. This change is necessary to disallow name clashes between fields and decls on a type, which is a prerequisite of #9938.	2024-08-28 08:39:59 +01:00
mlugg	6808ce27bd	compiler,lib,test,langref: migrate `@setCold` to `@branchHint`	2024-08-27 00:44:35 +01:00
mlugg	457c94d353	compiler: implement `@branchHint`, replacing `@setCold` Implements the accepted proposal to introduce `@branchHint`. This builtin is permitted as the first statement of a block if that block is the direct body of any of the following: * a function (not a `test`) * either branch of an `if` * the RHS of a `catch` or `orelse` * a `switch` prong * an `or` or `and` expression It lowers to the ZIR instruction `extended(branch_hint(...))`. When Sema encounters this instruction, it sets `sema.branch_hint` appropriately, and `zirCondBr` etc are expected to reset this value as necessary. The state is on `Sema` rather than `Block` to make it automatically propagate up non-conditional blocks without special handling. If `@panic` is reached, the branch hint is set to `.cold` if none was already set; similarly, error branches get a hint of `.unlikely` if no hint is explicitly provided. If a condition is comptime-known, `cold` hints from the taken branch are allowed to propagate up, but other hints are discarded. This is because a `likely`/`unlikely` hint just indicates the direction this branch is likely to go, which is redundant information when the branch is known at comptime; but `cold` hints indicate that control flow is unlikely to ever reach this branch, meaning if the branch is always taken from its parent, then the parent is also unlikely to ever be reached. This branch information is stored in AIR `cond_br` and `switch_br`. In addition, `try` and `try_ptr` instructions have variants `try_cold` and `try_ptr_cold` which indicate that the error case is cold (rather than just unlikely); this is reachable through e.g. `errdefer unreachable` or `errdefer @panic("")`. A new API `unwrapSwitch` is introduced to `Air` to make it more convenient to access `switch_br` instructions. In time, I plan to update all AIR instructions to be accessed via an `unwrap` method which returns a convenient tagged union a la `InternPool.indexToKey`. The LLVM backend lowers branch hints for conditional branches and switches as follows: * If any branch is marked `unpredictable`, the instruction is marked `!unpredictable`. * Any branch which is marked as `cold` gets a `llvm.assume(i1 true) [ "cold"() ]` call to mark the code path cold. * If any branch is marked `likely` or `unlikely`, branch weight metadata is attached with `!prof`. Likely branches get a weight of 2000, and unlikely branches a weight of 1. In `switch` statements, un-annotated branches get a weight of 1000 as a "middle ground" hint, since there could be likely and unlikely and un-annotated branches. For functions, a `cold` hint corresponds to the `cold` function attribute, and other hints are currently ignored -- as far as I can tell LLVM doesn't really have a way to lower them. (Ideally, we would want the branch hint given in the function to propagate to call sites.) The compiler and standard library do not yet use this new builtin. Resolves: #21148	2024-08-27 00:41:49 +01:00
David Rubin	80cd53d3bb	sema: clean-up `{union,struct}FieldAlignment` and friends My main gripes with this design were that it was incorrectly namespaced, the naming was inconsistent and a bit wrong (`fooAlign` vs `fooAlignment`). This commit moves all the logic from `PerThread.zig` to use the zcu + tid system that the previous couple commits introduce. I've organized and merged the functions to be a bit more specific to their own purpose. - `fieldAlignment` takes a struct or union type, an index, and a Zcu (or the Sema version which takes a Pt), and gives you the alignment of the field at the index. - `structFieldAlignment` takes the field type itself, and provides the logic to handle special cases, such as externs. A design goal I had in mind was to avoid using the word 'struct' in the function name, when it worked for things that aren't structs, such as unions.	2024-08-25 15:16:46 -07:00
David Rubin	b4bb64ce78	sema: rework type resolution to use Zcu when possible	2024-08-25 15:16:42 -07:00
Jacob Young	62f7276501	Dwarf: emit info about inline call sites	2024-08-20 08:09:33 -04:00
Jacob Young	ef11bc9899	Dwarf: rework self-hosted debug info from scratch This is in preparation for incremental and actually being able to debug executables built by the x86_64 backend.	2024-08-16 15:22:55 -04:00
Jakub Konka	1bd54a55fa	fix compile errors in other codegen backends	2024-08-13 21:52:40 +02:00
mlugg	548a087faf	compiler: split Decl into Nav and Cau The type `Zcu.Decl` in the compiler is problematic: over time it has gained many responsibilities. Every source declaration, container type, generic instantiation, and `@extern` has a `Decl`. The functions of these `Decl`s are in some cases entirely disjoint. After careful analysis, I determined that the two main responsibilities of `Decl` are as follows: * A `Decl` acts as the "subject" of semantic analysis at comptime. A single unit of analysis is either a runtime function body, or a `Decl`. It registers incremental dependencies, tracks analysis errors, etc. * A `Decl` acts as a "global variable": a pointer to it is consistent, and it may be lowered to a specific symbol by the codegen backend. This commit eliminates `Decl` and introduces new types to model these responsibilities: `Cau` (Comptime Analysis Unit) and `Nav` (Named Addressable Value). Every source declaration, and every container type requiring resolution (so not including `opaque`), has a `Cau`. For a source declaration, this `Cau` performs the resolution of its value. (When #131 is implemented, it is unsolved whether type and value resolution will share a `Cau` or have two distinct `Cau`s.) For a type, this `Cau` is the context in which type resolution occurs. Every non-`comptime` source declaration, every generic instantiation, and every distinct `extern` has a `Nav`. These are sent to codegen/link: the backends by definition do not care about `Cau`s. This commit has some minor technically-breaking changes surrounding `usingnamespace`. I don't think they'll impact anyone, since the changes are fixes around semantics which were previously inconsistent (the behavior changed depending on hashmap iteration order!). Aside from that, this changeset has no significant user-facing changes. Instead, it is an internal refactor which makes it easier to correctly model the responsibilities of different objects, particularly regarding incremental compilation. The performance impact should be negligible, but I will take measurements before merging this work into `master`. Co-authored-by: Jacob Young <jacobly0@users.noreply.github.com> Co-authored-by: Jakub Konka <kubkon@jakubkonka.com>	2024-08-11 07:29:41 +01:00
Jakub Konka	41e9b8b6c8	elf: fix compile errors	2024-08-07 10:21:02 +02:00
mlugg	f93a10f664	Air: store param names directly instead of referencing Zir	2024-07-10 11:20:08 -04:00
Jacob Young	525f341f33	Zcu: introduce `PerThread` and pass to all the functions	2024-07-07 22:59:52 -04:00
Andrew Kelley	30ec43a6c7	Zcu: extract permanent state from File Primarily, this commit removes 2 fields from File, relying on the data being stored in the `files` field, with the key as the path digest, and the value as the struct decl corresponding to the File. This table is serialized into the compiler state that survives between incremental updates. Meanwhile, the File struct remains ephemeral data that can be reconstructed the first time it is needed by the compiler process, as well as operated on by independent worker threads. A key outcome of this commit is that there is now a stable index that can be used to refer to a File. This will be needed when serializing error messages to survive incremental compilation updates.	2024-07-04 17:51:35 -07:00
mlugg	2f0f1efa6f	compiler: type.zig -> Type.zig	2024-07-04 21:01:42 +01:00
mlugg	ded5c759f8	Zcu: store `LazySrcLoc` in error messages This change modifies `Zcu.ErrorMsg` to store a `Zcu.LazySrcLoc` rather than a `Zcu.SrcLoc`. Everything else is dominoes. The reason for this change is incremental compilation. If a failed `AnalUnit` is up-to-date on an update, we want to re-use the old error messages. However, the file containing the error location may have been modified, and `SrcLoc` cannot survive such a modification. `LazySrcLoc` is designed to be correct across incremental updates. Therefore, we defer source location resolution until `Compilation` gathers the compile errors into the `ErrorBundle`.	2024-07-04 21:01:41 +01:00
Andrew Kelley	0fcd59eada	rename src/Module.zig to src/Zcu.zig This patch is a pure rename plus only changing the file path in `@import` sites, so it is expected to not create version control conflicts, even when rebasing.	2024-06-22 22:59:56 -04:00
Andrew Kelley	9be8a9000f	Revert "implement `@expect` builtin (#19658 )" This reverts commit `a7de02e052`. This did not implement the accepted proposal, and I did not sign off on the changes. I would like a chance to review this, please.	2024-05-22 09:57:43 -07:00
David Rubin	a7de02e052	implement `@expect` builtin (#19658 ) * implement `@expect` * add docs * add a second arg for expected bool * fix typo * move `expect` to use BinOp * update to newer langref format	2024-05-22 10:51:16 -05:00
mlugg	a61def10c6	compiler: eliminate most usages of TypedValue	2024-03-26 13:48:07 +00:00
mlugg	c6f3e9d79c	Zcu.Decl: remove `ty` field `Decl` can no longer store un-interned values, so this field is now unnecessary. The type can instead be fetched with the new `typeOf` helper method, which just gets the type of the Decl's `Value`.	2024-03-26 13:48:06 +00:00
Tristan Ross	6067d39522	std.builtin: make atomic order fields lowercase	2024-03-11 07:09:10 -07:00
Jacob Young	aa688567f5	Air: replace `.dbg_inline_*` with `.dbg_inline_block` This prevents the possibility of not emitting a `.dbg_inline_end` instruction and reduces the allocation requirements of the backends. Closes #19093	2024-03-02 21:19:34 -08:00
mlugg	59447e5305	compiler: decide dbg_var scoping based on AIR blocks This commit eliminates the `dbg_block_{begin,end}` instructions from both ZIR and AIR. Instead, lexical scoping of `dbg_var_{ptr,val}` instructions is decided based on the AIR block they exist within. This is a much more robust system, and also results in a huge drop in ZIR bytes - around 7% for Sema.zig. This required some enhancements to Sema to prevent elision of blocks when they are required for debug variable scoping. This can be observed by looking at the AIR for the following simple test program with and without `-fstrip`: ```zig export fn f() void { { var a: u32 = 0; _ = &a; } { var a: u32 = 0; _ = &a; } } ``` When `-fstrip` is passed, no AIR blocks are generated. When `-fno-strip` is passed, the ZIR blocks are lowered to true AIR blocks to give correct lexical scoping to the debug vars. The changes here incidentally reolve #19060. A corresponding behavior test has been added. Resolves: #19060	2024-02-26 13:20:45 +00:00
Andrew Kelley	78f15bc714	compiler: rename value.zig to Value.zig This commit only does the file rename to be friendlier to version control conflicts.	2024-02-05 18:13:07 -07:00
Veikka Tuominen	7d75c3d3b8	llvm: ensure returned undef is 0xaa bytes when runtime safety is enabled Closes #13178	2024-01-29 17:35:07 -08:00
Andrew Kelley	48d5861f92	fix more compilation errors introduced by this branch	2024-01-01 17:51:20 -07:00
Andrew Kelley	c49957dbe8	fix a round of compile errors caused by this branch	2024-01-01 17:51:19 -07:00
Andrew Kelley	bc4d2b646d	compiler: update references to target	2024-01-01 17:51:19 -07:00
Andrew Kelley	f5ddef1e45	update references to module (to be renamed to zcu)	2024-01-01 17:51:19 -07:00
Jacob Young	daf91ed8d1	Air: use typesafe `Air.Inst.Index` I need some indices for a thing...	2023-12-03 02:05:06 -08:00
Techatrix	18608223ef	convert `toType` and `toValue` to `Type.fromInterned` and `Value.fromInterned`	2023-11-25 04:09:53 -05:00
Jakub Konka	9bdbb6312f	elf: move incremental codegen bits into ZigObject.zig	2023-10-30 19:09:13 +01:00
Jakub Konka	2be1250f24	x86_64: no more load/lea_symbol weirdness	2023-10-28 03:48:18 -04:00
Jakub Konka	9a1fbb2705	x86_64: rename load/lea_memory to load/lea_symbol	2023-10-28 03:48:18 -04:00
Jakub Konka	a6a10d9c2b	x86_64: do not hardcode memory passed by Elf linker	2023-10-28 03:48:18 -04:00
Jakub Konka	6993b3e23e	codegen: refactor .actual_got into .extern_got	2023-10-16 19:33:06 +02:00
Jakub Konka	45197ea7ad	codegen+elf: lower imported data refs	2023-10-16 19:33:06 +02:00
Jakub Konka	7be983ac92	elf: create new synthetic section ZigGotSection	2023-10-16 19:33:05 +02:00
antlilja	6a29646a55	Rename `@fabs` to `@abs` and accept integers Replaces the @fabs builtin with a new @abs builtins which accepts floats, signed integers and vectors of said types.	2023-09-27 11:15:53 -07:00
Andrew Kelley	accd5701c2	compiler: move struct types into InternPool proper Structs were previously using `SegmentedList` to be given indexes, but were not actually backed by the InternPool arrays. After this, the only remaining uses of `SegmentedList` in the compiler are `Module.Decl` and `Module.Namespace`. Once those last two are migrated to become backed by InternPool arrays as well, we can introduce state serialization via writing these arrays to disk all at once. Unfortunately there are a lot of source code locations that touch the struct type API, so this commit is still work-in-progress. Once I get it compiling and passing the test suite, I can provide some interesting data points such as how it affected the InternPool memory size and performance comparison against master branch. I also couldn't resist migrating over a bunch of alignment API over to use the log2 Alignment type rather than a mismash of u32 and u64 byte units with 0 meaning something implicitly different and special at every location. Turns out you can do all the math you need directly on the log2 representation of alignments.	2023-09-21 14:48:40 -07:00
Jakub Konka	ce88df497c	elf: do not store Symbol's index in Symbol	2023-09-13 21:51:43 +02:00
Jakub Konka	6ad5db030c	elf: store GOT index in symbol extra array; use GotSection for GOT	2023-09-08 18:12:53 +02:00
Jakub Konka	a9df098cd2	elf: make everything upside down - track by Symbol.Index rather than Atom.Index	2023-09-06 13:14:00 +02:00
Jakub Konka	bc37c95e56	elf: simplify accessors to symbols, atoms, etc	2023-09-04 11:23:19 +02:00
Andrew Kelley	db33ee45b7	rework generic function calls Abridged summary: * Move `Module.Fn` into `InternPool`. * Delete a lot of confusing and problematic `Sema` logic related to generic function calls. This commit removes `Module.Fn` and replaces it with two new `InternPool.Tag` values: * `func_decl` - corresponding to a function declared in the source code. This one contains line/column numbers, zir_body_inst, etc. * `func_instance` - one for each monomorphization of a generic function. Contains a reference to the `func_decl` from whence the instantiation came, along with the `comptime` parameter values (or types in the case of `anytype`) Since `InternPool` provides deduplication on these values, these fields are now deleted from `Module`: * `monomorphed_func_keys` * `monomorphed_funcs` * `align_stack_fns` Instead of these, Sema logic for generic function instantiation now unconditionally evaluates the function prototype expression for every generic callsite. This is technically required in order for type coercions to work. The previous code had some dubious, probably wrong hacks to make things work, such as `hashUncoerced`. I'm not 100% sure how we were able to eliminate that function and still pass all the behavior tests, but I'm pretty sure things were still broken without doing type coercion for every generic function call argument. After the function prototype is evaluated, it produces a deduplicated `func_instance` `InternPool.Index` which can then be used for the generic function call. Some other nice things made by this simplification are the removal of `comptime_args_fn_inst` and `preallocated_new_func` from `Sema`, and the messy logic associated with them. I have not yet been able to measure the perf of this against master branch. On one hand, it reduces memory usage and pointer chasing of the most heavily used `InternPool` Tag - function bodies - but on the other hand, it does evaluate function prototype expressions more than before. We will soon find out.	2023-07-18 19:02:05 -07:00
mlugg	ff37ccd298	Air: store interned values in Air.Inst.Ref Previously, interned values were represented as AIR instructions using the `interned` tag. Now, the AIR ref directly encodes the InternPool index. The encoding works as follows: * If the ref matches one of the static values, it corresponds to the same InternPool index. * Otherwise, if the MSB is 0, the ref corresponds to an InternPool index. * Otherwise, if the MSB is 1, the ref corresponds to an AIR instruction index (after removing the MSB). Note that since most static InternPool indices are low values (the exceptions being `.none` and `.var_args_param_type`), the first rule is almost a nop.	2023-06-27 01:21:32 -07:00

1 2 3 4 5 ...

298 Commits