motiejus/zig - zig - gitea: Gitea Service

Author	SHA1	Message	Date
Jacob Young	3d2dfbe828	InternPool: add `FileIndex` to `*File` mapping	2024-07-10 11:20:08 -04:00
mlugg	f93a10f664	Air: store param names directly instead of referencing Zir	2024-07-10 11:20:08 -04:00
Jacob Young	667b4f9054	Zcu: cache fully qualified name on Decl This avoids needing to mutate the intern pool from backends.	2024-07-10 11:10:49 -04:00
Jacob Young	ca02266157	Zcu: pass `PerThread` to intern pool string functions	2024-07-07 22:59:52 -04:00
Jacob Young	525f341f33	Zcu: introduce `PerThread` and pass to all the functions	2024-07-07 22:59:52 -04:00
Andrew Kelley	30ec43a6c7	Zcu: extract permanent state from File Primarily, this commit removes 2 fields from File, relying on the data being stored in the `files` field, with the key as the path digest, and the value as the struct decl corresponding to the File. This table is serialized into the compiler state that survives between incremental updates. Meanwhile, the File struct remains ephemeral data that can be reconstructed the first time it is needed by the compiler process, as well as operated on by independent worker threads. A key outcome of this commit is that there is now a stable index that can be used to refer to a File. This will be needed when serializing error messages to survive incremental compilation updates.	2024-07-04 17:51:35 -07:00
mlugg	0e5335aaf5	compiler: rework type resolution, fully resolve all types I'm so sorry. This commit was just meant to be making all types fully resolve by queueing resolution at the moment of their creation. Unfortunately, a lot of dominoes ended up falling. Here's what happened: * I added a work queue job to fully resolve a type. * I realised that from here we could eliminate `Sema.types_to_resolve` if we made function codegen a separate job. This is desirable for simplicity of both spec and implementation. * This led to a new AIR traversal to detect whether any required type is unresolved. If a type in the AIR failed to resolve, then we can't run codegen. * Because full type resolution now occurs by the work queue job, a bug was exposed whereby error messages for type resolution were associated with the wrong `Decl`, resulting in duplicate error messages when the type was also resolved "by" its owner `Decl` (which really all resolution should be done on). * A correct fix for this requires using a different `Sema` when performing type resolution: we need a `Sema` owned by the type. Also note that this fix is necessary for incremental compilation. * This means a whole bunch of functions no longer need to take `Sema`s. * First-order effects: `resolveTypeFields`, `resolveTypeLayout`, etc * Second-order effects: `Type.abiAlignmentAdvanced`, `Value.orderAgainstZeroAdvanced`, etc The end result of this is, in short, a more correct compiler and a simpler language specification. This regressed a few error notes in the test cases, but nothing that seems worth blocking this change. Oh, also, I ripped out the old code in `test/src/Cases.zig` which introduced a dependency on `Compilation`. This dependency was problematic at best, and this code has been unused for a while. When we re-enable incremental test cases, we must rewrite their executor to use the compiler server protocol.	2024-07-04 21:01:42 +01:00
mlugg	2f0f1efa6f	compiler: type.zig -> Type.zig	2024-07-04 21:01:42 +01:00
mlugg	ded5c759f8	Zcu: store `LazySrcLoc` in error messages This change modifies `Zcu.ErrorMsg` to store a `Zcu.LazySrcLoc` rather than a `Zcu.SrcLoc`. Everything else is dominoes. The reason for this change is incremental compilation. If a failed `AnalUnit` is up-to-date on an update, we want to re-use the old error messages. However, the file containing the error location may have been modified, and `SrcLoc` cannot survive such a modification. `LazySrcLoc` is designed to be correct across incremental updates. Therefore, we defer source location resolution until `Compilation` gathers the compile errors into the `ErrorBundle`.	2024-07-04 21:01:41 +01:00
mlugg	5f03c02505	Zcu: key compile errors on `AnalUnit` where appropriate This change seeks to more appropriately model the way semantic analysis works by drawing a more clear line between errors emitted by analyzing a `Decl` (in future a `Cau`) and errors emitted by analyzing a runtime function. This does change a few compile errors surrounding compile logs by adding more "also here" notes. The new notes are more technically correct, but perhaps not so helpful. They're not doing enough harm for me to put extensive thought into this for now.	2024-07-04 21:01:41 +01:00
mlugg	7e552dc1e9	Zcu: rework exports This commit reworks our representation of exported Decls and values in Zcu to be memory-optimized and trivially serialized. All exports are now stored in the `all_exports` array on `Zcu`. An `AnalUnit` which performs an export (either through an `export` annotation or by containing an analyzed `@export`) gains an entry into `single_exports` if it performs only one export, or `multi_exports` if it performs multiple. We no longer store a persistent mapping from a `Decl`/value to all exports of that entity; this state is not necessary for the majority of the pipeline. Instead, we construct it in `Zcu.processExports`, just before flush. This does not affect the algorithmic complexity of `processExports`, since this function already iterates all exports in the `Zcu`. The elimination of `decl_exports` and `value_exports` led to a few non-trivial backend changes. The LLVM backend has been wrangled into a more reasonable state in general regarding exports and externs. The C backend is currently disabled in this commit, because its support for `export` was quite broken, and that was exposed by this work -- I'm hoping @jacobly0 will be able to pick this up!	2024-07-04 21:01:41 +01:00
mlugg	5b523d0469	Zir: make `src_line` absolute for `declaration` instructions We need special logic for updating line numbers anyway, so it's fine to just use absolute numbers here. This eliminates a field from `Decl`.	2024-06-26 05:28:03 +01:00
Andrew Kelley	0fcd59eada	rename src/Module.zig to src/Zcu.zig This patch is a pure rename plus only changing the file path in `@import` sites, so it is expected to not create version control conflicts, even when rebasing.	2024-06-22 22:59:56 -04:00
Matthew Lugg	f73be120f4	Merge pull request #20299 from mlugg/the-great-decl-split The Great Decl Split (preliminary work): refactor source locations and eliminate `Sema.Block.src_decl`.	2024-06-20 11:07:17 +01:00
Jakub Konka	a154d8da8e	Merge pull request #20070 from Rexicon226/riscv more RISC-V backend progress	2024-06-19 17:42:08 +02:00
Jacob Young	d208e59496	llvm: avoid debug info for naked function arguments This is not supported by llvm.	2024-06-16 23:30:57 -04:00
mlugg	1eaeb4a0a8	Zcu: rework source locations `LazySrcLoc` now stores a reference to the "base AST node" to which it is relative. The previous tagged union is `LazySrcLoc.Offset`. To make working with this structure convenient, `Sema.Block` contains a convenience `src` method which takes an `Offset` and returns a `LazySrcLoc`. The "base node" of a source location is no longer given by a `Decl`, but rather a `TrackedInst` representing either a `declaration`, `struct_decl`, `union_decl`, `enum_decl`, or `opaque_decl`. This is a more appropriate model, and removes an unnecessary responsibility from `Decl` in preparation for the upcoming refactor which will split it into `Nav` and `Cau`. As a part of these `Decl` reworks, the `src_node` field is eliminated. This change aids incremental compilation, and simplifies `Decl`. In some cases -- particularly in backends -- the source location of a declaration is desired. This was previously `Decl.srcLoc` and worked for any `Decl`. Now, it is `Decl.navSrcLoc` in reference to the upcoming refactor, since the set of `Decl`s this works for precisely corresponds to what will in future become a `Nav` -- that is, source-level declarations and generic function instantiations, but not type owner Decls. This commit introduces more tags to `LazySrcLoc.Offset` so as to eliminate the concept of `error.NeededSourceLocation`. Now, `.unneeded` should only be used to assert that an error path is unreachable. In the future, uses of `.unneeded` can probably be replaced with `undefined`. The `src_decl` field of `Sema.Block` no longer has a role in type resolution. Its main remaining purpose is to handle namespacing of type names. It will be eliminated entirely in a future commit to remove another undue responsibility from `Decl`. It is worth noting that in future, the `Zcu.SrcLoc` type should probably be eliminated entirely in favour of storing `Zcu.LazySrcLoc` values. This is because `Zcu.SrcLoc` is not valid across incremental updates, and we want to be able to reuse error messages from previous updates even if the source file in question changed. The error reporting logic should instead simply resolve the location from the `LazySrcLoc` on the fly.	2024-06-15 00:57:52 +01:00
mlugg	07a24bec9a	compiler: move LazySrcLoc out of std This is in preparation for some upcoming changes to how we represent source locations in the compiler. The bulk of the change here is dealing with the removal of `src()` methods from `Zir` types.	2024-06-15 00:57:52 +01:00
Ryan Liptak	76fb2b685b	std: Convert deprecated aliases to compile errors and fix usages Deprecated aliases that are now compile errors: - `std.fs.MAX_PATH_BYTES` (renamed to `std.fs.max_path_bytes`) - `std.mem.tokenize` (split into `tokenizeAny`, `tokenizeSequence`, `tokenizeScalar`) - `std.mem.split` (split into `splitSequence`, `splitAny`, `splitScalar`) - `std.mem.splitBackwards` (split into `splitBackwardsSequence`, `splitBackwardsAny`, `splitBackwardsScalar`) - `std.unicode` + `utf16leToUtf8Alloc`, `utf16leToUtf8AllocZ`, `utf16leToUtf8`, `fmtUtf16le` (all renamed to have capitalized `Le`) + `utf8ToUtf16LeWithNull` (renamed to `utf8ToUtf16LeAllocZ`) - `std.zig.CrossTarget` (moved to `std.Target.Query`) Deprecated `lib/std/std.zig` decls were deleted instead of made a `@compileError` because the `refAllDecls` in the test block would trigger the `@compileError`. The deleted top-level `std` namespaces are: - `std.rand` (renamed to `std.Random`) - `std.TailQueue` (renamed to `std.DoublyLinkedList`) - `std.ChildProcess` (renamed/moved to `std.process.Child`) This is not exhaustive. Deprecated aliases that I didn't touch: + `std.io.` + `std.Build.` + `std.builtin.Mode` + `std.zig.c_translation.CIntLiteralRadix` + anything in `src/`	2024-06-13 10:18:59 -04:00
David Rubin	b2cb090c37	riscv: float args	2024-06-13 02:21:39 -07:00
Andrew Kelley	8f27fdb84e	Merge pull request #20120 from vahur/move-consts-to-rdata mark anondecls as constants in llvm ir	2024-06-05 14:24:00 -04:00
Andrew Kelley	ac3faa6cb3	LLVM backend: loongarch64 support	2024-06-05 13:43:35 -04:00
Veikka Tuominen	c01d8c8b20	pass `-fno-builtin` when testing `lib/c.zig`, `lib/compiler_rt.zig`	2024-06-02 10:10:35 +03:00
Veikka Tuominen	06d9f88f97	llvm: disable constant anons on Darwin	2024-06-01 19:32:19 +03:00
Vahur Sinijärv	4af163ac94	mark anondecls as constants in llvm ir	2024-05-30 00:32:16 +03:00
Andrew Kelley	9be8a9000f	Revert "implement `@expect` builtin (#19658 )" This reverts commit `a7de02e052`. This did not implement the accepted proposal, and I did not sign off on the changes. I would like a chance to review this, please.	2024-05-22 09:57:43 -07:00
David Rubin	a7de02e052	implement `@expect` builtin (#19658 ) * implement `@expect` * add docs * add a second arg for expected bool * fix typo * move `expect` to use BinOp * update to newer langref format	2024-05-22 10:51:16 -05:00
Veikka Tuominen	0fb2015fd3	llvm: fix `@wasmMemory{Size,Grow}` for wasm64 Closes #19942	2024-05-22 09:48:52 -04:00
Veikka Tuominen	f776e70c39	llvm: fix lowering of packed structs with optional pointers Closes #20022	2024-05-21 19:52:25 +03:00
Veikka Tuominen	167854c19c	llvm: lower ptr to int constants with correct address spaces Closes #19915	2024-05-21 19:52:00 +03:00
Veikka Tuominen	8aae0d87b5	Target: add OpenHarmonyOS ABI Closes #20009	2024-05-20 09:25:52 -04:00
David Rubin	3bf008a3d0	riscv: implement slices	2024-05-11 02:17:11 -07:00
Pyry Kovanen	4163126c01	llvm: always include debug information for global variables	2024-05-09 16:02:03 -07:00
Jakub Konka	d3ba541034	codegen/llvm: handle missing Apple targets	2024-05-09 22:00:17 +02:00
Jakub Konka	2e1fc0dd14	handle visionos target OS tag in the compiler * rename .xros to .visionos as agreed in the tracking issue * add support for VisionOS platform in the MachO linker	2024-05-09 15:04:15 +02:00
Andrew Kelley	e9efed9ed1	LLVM: zeroext/signext does happen on macos Fixes a regression introduced in 3ce7fee9dd8bbb6f56e47758a9a8ada028400c71.	2024-05-08 19:37:29 -07:00
Andrew Kelley	88ada2121f	LLVM: fix x86 and x86_64 datalayout string calculation now it matches clang again	2024-05-08 19:37:29 -07:00
Andrew Kelley	6730b366a0	LLVM backend: no more signext on aarch64 Clang doesn't do it, so Zig must not do it in order to match the C ABI.	2024-05-08 19:37:29 -07:00
Andrew Kelley	6986d2aca9	x86_64 sysv C ABI: fix f128 param and return types Clang 17 passed struct{f128} parameters using rdi and rax, while Clang 18 matches GCC 13.2 behavior, passing them using xmm0. This commit makes Zig's LLVM backend match Clang 18 and GCC 13.2. The commit deletes a hack in x86_64/abi.zig which miscategorized f128 as "memory" which obviously disagreed with the spec.	2024-05-08 19:37:29 -07:00
Andrew Kelley	b7799ef322	std.Target.maxIntAlignment: move to compiler implementation This should not be a public API, and the x86 backend does not support the value 16.	2024-05-08 19:37:29 -07:00
Andrew Kelley	65bea9ac07	LLVM 18 update: avoid passing vectors sometimes LLVM now refuses to lower arguments and return values on x86 targets when the total vector bit size is >= 512. This code detects such a situation and uses byref instead of byval.	2024-05-08 19:37:28 -07:00
Andrew Kelley	d34fae26d5	LLVM 18 std lib updates and fixes * some manual fixes to generated CPU features code. In the future it would be nice to make the script do those automatically. * add to various target OS switches. Some of the values I was unsure of and added TODO panics, for example in the case of spirv CPU arch.	2024-05-08 19:37:28 -07:00
mlugg	db890dbae7	InternPool: eliminate `var_args_param_type` This was a "fake" type used to handle C varargs parameters, much like generic poison. In fact, it is treated identically to generic poison in all cases other than one (the final coercion of a call argument), which is trivially special-cased. Thus, it makes sense to remove this special tag and instead use `generic_poison_type` in its place. This fixes several bugs in Sema related to missing handling of this tag. Resolves: #19781	2024-05-04 22:03:56 +01:00
Anton Lilja	20b9b54e6b	LLVM: Fix panic when using tagged union backed by enum with negative values	2024-05-02 14:15:44 +00:00
Julian	4303400e47	Sema+llvm: properly implement `Interrupt` callconv Co-authored-by: Veikka Tuominen <git@vexu.eu>	2024-04-28 22:43:00 +00:00
antlilja	c231d94960	LLVM: Remove deprecated or soon to be deprecated constant expressions	2024-04-25 22:58:47 -07:00
Jacob Young	5d745d94fb	x86_64: fix C abi for unions Closes #19721	2024-04-22 15:24:29 -07:00
Jacob Young	ebce190321	llvm: fix debug info when running tests	2024-04-19 12:13:29 -07:00
mlugg	d0e74ffe52	compiler: rework comptime pointer representation and access We've got a big one here! This commit reworks how we represent pointers in the InternPool, and rewrites the logic for loading and storing from them at comptime. Firstly, the pointer representation. Previously, pointers were represented in a highly structured manner: pointers to fields, array elements, etc, were explicitly represented. This works well for simple cases, but is quite difficult to handle in the cases of unusual reinterpretations, pointer casts, offsets, etc. Therefore, pointers are now represented in a more "flat" manner. For types without well-defined layouts -- such as comptime-only types, automatic-layout aggregates, and so on -- we still use this "hierarchical" structure. However, for types with well-defined layouts, we use a byte offset associated with the pointer. This allows the comptime pointer access logic to deal with reinterpreted pointers far more gracefully, because the "base address" of a pointer -- for instance a `field` -- is a single value which pointer accesses cannot exceed since the parent has undefined layout. This strategy is also more useful to most backends -- see the updated logic in `codegen.zig` and `codegen/llvm.zig`. For backends which do prefer a chain of field and elements accesses for lowering pointer values, such as SPIR-V, there is a helpful function in `Value` which creates a strategy to derive a pointer value using ideally only field and element accesses. This is actually more correct than the previous logic, since it correctly handles pointer casts which, after the dust has settled, end up referring exactly to an aggregate field or array element. In terms of the pointer access code, it has been rewritten from the ground up. The old logic had become rather a mess of special cases being added whenever bugs were hit, and was still riddled with bugs. The new logic was written to handle the "difficult" cases correctly, the most notable of which is restructuring of a comptime-only array (for instance, converting a `[3][2]comptime_int` to a `[2][3]comptime_int`. Currently, the logic for loading and storing work somewhat differently, but a future change will likely improve the loading logic to bring it more in line with the store strategy. As far as I can tell, the rewrite has fixed all bugs exposed by #19414. As a part of this, the comptime bitcast logic has also been rewritten. Previously, bitcasts simply worked by serializing the entire value into an in-memory buffer, then deserializing it. This strategy has two key weaknesses: pointers, and undefined values. Representations of these values at comptime cannot be easily serialized/deserialized whilst preserving data, which means many bitcasts would become runtime-known if pointers were involved, or would turn `undefined` values into `0xAA`. The new logic works by "flattening" the datastructure to be cast into a sequence of bit-packed atomic values, and then "unflattening" it; using serialization when necessary, but with special handling for `undefined` values and for pointers which align in virtual memory. The resulting code is definitely slower -- more on this later -- but it is correct. The pointer access and bitcast logic required some helper functions and types which are not generally useful elsewhere, so I opted to split them into separate files `Sema/comptime_ptr_access.zig` and `Sema/bitcast.zig`, with simple re-exports in `Sema.zig` for their small public APIs. Whilst working on this branch, I caught various unrelated bugs with transitive Sema errors, and with the handling of `undefined` values. These bugs have been fixed, and corresponding behavior test added. In terms of performance, I do anticipate that this commit will regress performance somewhat, because the new pointer access and bitcast logic is necessarily more complex. I have not yet taken performance measurements, but will do shortly, and post the results in this PR. If the performance regression is severe, I will do work to to optimize the new logic before merge. Resolves: #19452 Resolves: #19460	2024-04-17 13:41:25 +01:00
Jacob Young	7611d90ba0	InternPool: remove slice from byte aggregate keys This deletes a ton of lookups and avoids many UAF bugs. Closes #19485	2024-04-08 13:24:08 -04:00

1 2 3 4 5 ...

915 Commits