motiejus/zig - zig - gitea: Gitea Service

Author	SHA1	Message	Date
Andrew Kelley	94648a0383	fix merge conflicts with updating line numbers	2025-01-15 15:11:36 -08:00
Andrew Kelley	5b18af85cb	type checking for synthetic functions	2025-01-15 15:11:36 -08:00
Andrew Kelley	4fccb5ae7a	wasm linker: improve error messages by making source locations more lazy	2025-01-15 15:11:36 -08:00
Andrew Kelley	1a4c5837fe	wasm linker: fix crashes when parsing compiler_rt	2025-01-15 15:11:36 -08:00
Andrew Kelley	d1cde847a3	implement the prelink phase in the frontend this strategy uses a "postponed" queue to handle codegen tasks that spawn too early. there's probably a better way.	2025-01-15 15:11:36 -08:00
Andrew Kelley	070b973c4a	wasm linker: allow undefined imports when lib name is provided and expose object_host_name as an option for setting the lib name for object files, since the wasm linking standards don't specify a way to do it.	2025-01-15 15:11:36 -08:00
Andrew Kelley	edd592d371	fix compilation when enabling llvm	2025-01-15 15:11:35 -08:00
Andrew Kelley	943dac3e85	compiler: add type safety for export indices	2025-01-15 15:11:35 -08:00
Andrew Kelley	9bf715de74	rework error handling in the backends	2025-01-15 15:11:35 -08:00
Andrew Kelley	77accf597d	elf linker: conform to explicit error sets	2025-01-15 15:11:35 -08:00
Andrew Kelley	da25ed95fc	macho linker conforms to explicit error sets, again	2025-01-15 15:11:35 -08:00
Andrew Kelley	16180f525a	macho linker: conform to explicit error sets Makes linker functions have small error sets, required to report diagnostics properly rather than having a massive error set that has a lot of codes. Other linker implementations are not ported yet. Also the branch is not passing semantic analysis yet.	2025-01-15 15:11:35 -08:00
Andrew Kelley	795e7c64d5	wasm linker: aggressive DODification The goals of this branch are to: * compile faster when using the wasm linker and backend * enable saving compiler state by directly copying in-memory linker state to disk. * more efficient compiler memory utilization * introduce integer type safety to wasm linker code * generate better WebAssembly code * fully participate in incremental compilation * do as much work as possible outside of flush(), while continuing to do linker garbage collection. * avoid unnecessary heap allocations * avoid unnecessary indirect function calls In order to accomplish this goals, this removes the ZigObject abstraction, as well as Symbol and Atom. These abstractions resulted in overly generic code, doing unnecessary work, and needless complications that simply go away by creating a better in-memory data model and emitting more things lazily. For example, this makes wasm codegen emit MIR which is then lowered to wasm code during linking, with optimal function indexes etc, or relocations are emitted if outputting an object. Previously, this would always emit relocations, which are fully unnecessary when emitting an executable, and required all function calls to use the maximum size LEB encoding. This branch introduces the concept of the "prelink" phase which occurs after all object files have been parsed, but before any Zcu updates are sent to the linker. This allows the linker to fully parse all objects into a compact memory model, which is guaranteed to be complete when Zcu code is generated. This commit is not a complete implementation of all these goals; it is not even passing semantic analysis.	2025-01-15 15:11:35 -08:00
mlugg	065e10c95c	link: new incremental line number update API	2025-01-05 02:20:56 +00:00
mlugg	3afda4322c	compiler: analyze type and value of global declaration separately This commit separates semantic analysis of the annotated type vs value of a global declaration, therefore allowing recursive and mutually recursive values to be declared. Every `Nav` which undergoes analysis now has two corresponding `AnalUnit`s: `.{ .nav_val = n }` and `.{ .nav_ty = n }`. The `nav_val` unit is responsible for fully resolving the `Nav`: determining its value, linksection, addrspace, etc. The `nav_ty` unit, on the other hand, resolves only the information necessary to construct a pointer to the `Nav`: its type, addrspace, etc. (It does also analyze its linksection, but that could be moved to `nav_val` I think; it doesn't make any difference). Analyzing a `nav_ty` for a declaration with no type annotation will just mark a dependency on the `nav_val`, analyze it, and finish. Conversely, analyzing a `nav_val` for a declaration with a type annotation will first mark a dependency on the `nav_ty` and analyze it, using this as the result type when evaluating the value body. The `nav_val` and `nav_ty` units always have references to one another: so, if a `Nav`'s type is referenced, its value implicitly is too, and vice versa. However, these dependencies are trivial, so, to save memory, are only known implicitly by logic in `resolveReferences`. In general, analyzing ZIR `decl_val` will only analyze `nav_ty` of the corresponding `Nav`. There are two exceptions to this. If the declaration is an `extern` declaration, then we immediately ensure the `Nav` value is resolved (which doesn't actually require any more analysis, since such a declaration has no value body anyway). Additionally, if the resolved type has type tag `.@"fn"`, we again immediately resolve the `Nav` value. The latter restriction is in place for two reasons: * Functions are special, in that their externs are allowed to trivially alias; i.e. with a declaration `extern fn foo(...)`, you can write `const bar = foo;`. This is not allowed for non-function externs, and it means that function types are the only place where it is possible for a declaration `Nav` to have a `.@"extern"` value without actually being declared `extern`. We need to identify this situation immediately so that the `decl_ref` can create a pointer to the real extern `Nav`, not this alias. * In certain situations, such as taking a pointer to a `Nav`, Sema needs to queue analysis of a runtime function if the value is a function. To do this, the function value needs to be known, so we need to resolve the value immediately upon `&foo` where `foo` is a function. This restriction is simple to codify into the eventual language specification, and doesn't limit the utility of this feature in practice. A consequence of this commit is that codegen and linking logic needs to be more careful when looking at `Nav`s. In general: * When `updateNav` or `updateFunc` is called, it is safe to assume that the `Nav` being updated (the owner `Nav` for `updateFunc`) is fully resolved. * Any `Nav` whose value is/will be an `@"extern"` or a function is fully resolved; see `Nav.getExtern` for a helper for a common case here. * Any other `Nav` may only have its type resolved. This didn't seem to be too tricky to satisfy in any of the existing codegen/linker backends. Resolves: #131	2024-12-24 02:18:41 +00:00
Jacob Young	5c76e08f49	lldb: add pretty printer for intern pool indices	2024-12-20 22:51:20 -05:00
Andrew Kelley	d37ee79535	std.Build.Cache.hit: more discipline in error handling Previous commits `2b0929929d` `4ea2f441df` had this text: > There are no dir components, so you would think that this was > unreachable, however we have observed on macOS two processes racing to > do openat() with O_CREAT manifest in ENOENT. This appears to have been a misunderstanding based on the issue report #12138 and corresponding PR #12139 in which the steps to reproduce removed the cache directory in a loop which also executed detached Zig compiler processes. There is no evidence for the macOS kernel bug however the ENOENT is easily explained by the removal of the cache directory. This commit reverts those commits, ultimately reporting the ENOENT as an error rather than repeating the create file operation. However this commit also adds an explicit error set to `std.Build.Cache.hit` as well as changing the `failed_file_index` to a proper diagnostic field that fully communicates what failed, leading to more informative error messages on failure to check the cache. The equivalent failure when occuring for AstGen performs a fatal process kill, reasoning being that the compiler has an invariant of the cache directory not being yanked out from underneath it while executing. This could be made a more granular error in the future but I suspect such thing is not valuable to pursue. Related to #18340 but does not solve it.	2024-12-10 18:11:12 -08:00
Andrew Kelley	11bf2d92de	diversify "unable to spawn" failure messages to help understand where a spurious failure is occurring	2024-11-26 13:56:40 -08:00
Andrew Kelley	f2dcfe0e40	link.File.Wasm: parse inputs in compilation pipeline Primarily, this moves linker input parsing from flush() into the linker task queue, which is executed simultaneously with the frontend. I also made it avoid redundantly opening the same archive file N times for each object file inside. Furthermore, hard code fixed buffer stream rather than using a generic stream type. Finally, I fixed the error handling of the Wasm.Archive.parse function. Please pay attention to this pattern of returning a struct rather than accepting a mutable struct as an argument. This ensures function-level atomicity and makes resource management straightforward. Deletes the file and path fields from Archive and Object. Removed a well-meaning but ultimately misguided suggestion about how to think about ZigObject since thinking about it that way has led to problematic anti-DOD patterns.	2024-10-30 23:43:53 -07:00
Alex Rønne Petersen	16b87f7082	link: Fix archive format selection for some OSs. * AIX has its own bespoke format. * Handle all Apple platforms. * FreeBSD and OpenBSD both use the GNU format in LLVM. * Windows has since been switched to the COFF format by default in LLVM.	2024-10-31 01:33:49 +01:00
Alex Rønne Petersen	f1f804e532	zig_llvm: Reduce our exposure to LLVM API breakage. LLVM recently introduced new Triple::ArchType members in 19.1.3 which broke our static assertions in zig_llvm.cpp. When implementing a fix for that, I realized that we don't even need a lot of the stuff we have in zig_llvm.(cpp,h) anymore. This commit trims the interface down considerably.	2024-10-31 01:27:22 +01:00
Andrew Kelley	504ad56815	link.flushTaskQueue: move safety lock The safety lock needs to happen after check()	2024-10-23 16:27:39 -07:00
Andrew Kelley	ba71079837	combine codegen work queue and linker task queue these tasks have some shared data dependencies so they cannot be done simultaneously. Future work should untangle these data dependencies so that more can be done in parallel. for now this commit ensures correctness by making linker input parsing and codegen tasks part of the same queue.	2024-10-23 16:27:39 -07:00
Andrew Kelley	336466c9df	glibc sometimes makes archives be ld scripts it is incredible how many bad ideas glibc is bundled into one project.	2024-10-23 16:27:39 -07:00
Andrew Kelley	5d75d8f6fc	also find static libc files on the host and don't look for glibc files on windows	2024-10-23 16:27:39 -07:00
Andrew Kelley	3cc19cd865	better error messages	2024-10-23 16:27:38 -07:00
Andrew Kelley	c2898c436f	branch fixes	2024-10-23 16:27:38 -07:00
Andrew Kelley	5ca54036ca	move linker input file parsing to the compilation pipeline	2024-10-23 16:27:38 -07:00
Andrew Kelley	cbcd67ea90	link.MachO: fix missing input classification	2024-10-23 16:27:38 -07:00
Andrew Kelley	e2a71b37d8	fix MachO linking regression	2024-10-23 16:27:38 -07:00
Andrew Kelley	8cfe303da9	fix resolving link inputs	2024-10-23 16:27:38 -07:00
Andrew Kelley	7656903518	CLI: fix detection of link inputs	2024-10-23 16:27:38 -07:00
Andrew Kelley	e567abb339	rework linker inputs * Compilation.objects changes to Compilation.link_inputs which stores objects, archives, windows resources, shared objects, and strings intended to be put directly into the dynamic section. Order is now preserved between all of these kinds of linker inputs. If it is determined the order does not matter for a particular kind of linker input, that item should be moved to a different array. * rename system_libs to windows_libs * untangle library lookup from CLI types * when doing library lookup, instead of using access syscalls, go ahead and open the files and keep the handles around for passing to the cache system and the linker. * during library lookup and cache file hashing, use positioned reads to avoid affecting the file seek position. * library directories are opened in the CLI and converted to Directory objects, warnings emitted for those that cannot be opened.	2024-10-23 16:27:38 -07:00
Andrew Kelley	5b016e290a	move ld script processing to the frontend along with the relevant logic, making the libraries within subject to the same search criteria as all the other libraries. this unfortunately means doing file system access on all .so files when targeting ELF to determine if they are linker scripts, however, I have a plan to address this.	2024-10-23 16:27:38 -07:00
Andrew Kelley	1ba3fc90be	link.Elf: eliminate an O(N^2) algorithm in flush() Make shared_objects a StringArrayHashMap so that deduping does not need to happen in flush. That deduping code also was using an O(N^2) algorithm, which is not allowed in this codebase. There is another violation of this rule in resolveSymbols but this commit does not address it. This required reworking shared object parsing, breaking it into independent components so that we could access soname earlier. Shared object parsing had a few problems that I noticed and fixed in this commit: * Many instances of incorrect use of align(1). * `shnum * @sizeOf(elf.Elf64_Shdr)` can overflow based on user data. * `@divExact` can cause illegal behavior based on user data. * Strange versyms logic that wasn't present in mold nor lld. The logic was not commented and there is no git blame information in ziglang/zig nor kubkon/zld. I changed it to match mold and lld instead. * Use of ArrayList for slices of memory that are never resized. * finding DT_VERDEFNUM in a different loop than finding DT_SONAME. Ultimately I think we should follow mold's lead and ignore this integer, relying on null termination instead. * Doing logic based on VER_FLG_BASE rather than ignoring it like mold and LLD do. No comment explaining why the behavior is different. * Mutating the original ELF symbols rather than only storing the mangled name on the new Symbol struct. I noticed something that I didn't try to address in this commit: Symbol stores a lot of redundant information that is already present in the ELF symbols. I suspect that the codebase could benefit from reworking Symbol to not store redundant information. Additionally: * Add some type safety to std.elf. * Eliminate 1-3 file system reads for determining the kind of input files, by taking advantage of file name extension and handling error codes properly. * Move more error handling methods to link.Diags and make them infallible and thread-safe * Make the data dependencies obvious in the parameters of parseSharedObject. It's now clear that the first two steps (Header and Parsed) can be done during the main Compilation pipeline, rather than waiting for flush().	2024-10-12 10:44:17 -07:00
Andrew Kelley	13fb68c064	link: consolidate diagnostics By organizing linker diagnostics into this struct, it becomes possible to share more code between linker backends, and more importantly it becomes possible to pass only the Diag struct to some functions, rather than passing the entire linker state object in. This makes data dependencies more obvious, making it easier to rearrange code and to multithread. Also fix MachO code abusing an atomic variable. Not only was it using the wrong atomic operation, it is unnecessary additional state since the state is already being protected by a mutex.	2024-10-11 10:36:19 -07:00
Andrew Kelley	14c8e270bb	link: fix false positive crtbegin/crtend detection Embrace the Path abstraction, doing more operations based on directory handles rather than absolute file paths. Most of the diff noise here comes from this one. Fix sorting of crtbegin/crtend atoms. Previously it would look at all path components for those strings. Make the C runtime path detection partially a pure function, and move some logic to glibc.zig where it belongs.	2024-10-10 14:21:52 -07:00
Andrew Kelley	31d70cb1e1	link.Elf: avoid needless file system reads in flush() flush() must not do anything more than necessary. Determining the type of input files must be done only once, before flush. Fortunately, we don't even need any file system accesses to do this since that information is statically known in most cases, and in the rest of the cases can be determined by file extension alone. This commit also updates the nearby code to conform to the convention for error handling where there is exactly one error code to represent the fact that error messages have already been emitted. This had the side effect of improving the error message for a linker script parse error. "positionals" is not a linker concept; it is a command line interface concept. Zig's linker implementation should not mention "positionals". This commit deletes that array list in favor of directly making function calls, eliminating that heap allocation during flush().	2024-10-08 18:02:59 -07:00
Andrew Kelley	2c41c453b6	link.Elf: avoid converting rpath data in flush() The goal is to minimize as much as possible how much logic is inside flush(). So let's start by moving out obvious stuff. This data can be preformatted before flush().	2024-10-08 18:02:59 -07:00
Alex Rønne Petersen	d1901c744c	std.Target: Remove Cpu.Arch.dxil and ObjectFormat.dxcontainer. See: https://devblogs.microsoft.com/directx/directx-adopting-spir-v Since we never hooked up the (experimental) DirectX LLVM backend, we've never actually supported targeting DXIL in Zig. With Microsoft moving away from DXIL, that seems very unlikely to change.	2024-09-23 17:17:25 -07:00
Jacob Young	e046977354	codegen: implement output to the `.debug_info` section	2024-09-10 12:27:57 -04:00
mlugg	0fe3fd01dd	std: update `std.builtin.Type` fields to follow naming conventions The compiler actually doesn't need any functional changes for this: Sema does reification based on the tag indices of `std.builtin.Type` already! So, no zig1.wasm update is necessary. This change is necessary to disallow name clashes between fields and decls on a type, which is a prerequisite of #9938.	2024-08-28 08:39:59 +01:00
Jacob Young	8c3f6c72c0	Dwarf: fix and test string format	2024-08-27 02:09:59 -04:00
David Rubin	863f74dcd2	comp: rename `module` to `zcu`	2024-08-25 15:17:21 -07:00
Robin Voetter	b4343074d2	replace Compilation.Emit with std.Build.Cache.Path This type is exactly the same as std.Build.Cache.Path, except for one function which is not used anymore. Therefore we can replace it without consequences.	2024-08-19 19:09:12 +02:00
Robin Voetter	43f73af359	fix various issues related to Path handling in the compiler and std A compilation build step for which the binary is not required could not be compiled previously. There were 2 issues that caused this: - The compiler communicated only the results of the emitted binary and did not properly communicate the result if the binary was not emitted. This is fixed by communicating the final hash of the artifact path (the hash of the corresponding /o/<hash> directory) and communicating this instead of the entire path. This changes the zig build --listen protocol to communicate hashes instead of paths, and emit_bin_path is accordingly renamed to emit_digest. - There was an error related to the default llvm object path when CacheUse.Whole was selected. I'm not really sure why this didn't manifest when the binary is also emitted. This was fixed by improving the path handling related to flush() and emitLlvmObject(). In general, this commit also improves some of the path handling throughout the compiler and standard library.	2024-08-19 19:09:11 +02:00
Jacob Young	ef11bc9899	Dwarf: rework self-hosted debug info from scratch This is in preparation for incremental and actually being able to debug executables built by the x86_64 backend.	2024-08-16 15:22:55 -04:00
Alex Rønne Petersen	6316fd9535	std.Target: Add goff and xcoff to ObjectFormat. Also improve the docs a bit, and handle driverkit and dxil in default().	2024-08-11 20:21:28 -07:00
mlugg	548a087faf	compiler: split Decl into Nav and Cau The type `Zcu.Decl` in the compiler is problematic: over time it has gained many responsibilities. Every source declaration, container type, generic instantiation, and `@extern` has a `Decl`. The functions of these `Decl`s are in some cases entirely disjoint. After careful analysis, I determined that the two main responsibilities of `Decl` are as follows: * A `Decl` acts as the "subject" of semantic analysis at comptime. A single unit of analysis is either a runtime function body, or a `Decl`. It registers incremental dependencies, tracks analysis errors, etc. * A `Decl` acts as a "global variable": a pointer to it is consistent, and it may be lowered to a specific symbol by the codegen backend. This commit eliminates `Decl` and introduces new types to model these responsibilities: `Cau` (Comptime Analysis Unit) and `Nav` (Named Addressable Value). Every source declaration, and every container type requiring resolution (so not including `opaque`), has a `Cau`. For a source declaration, this `Cau` performs the resolution of its value. (When #131 is implemented, it is unsolved whether type and value resolution will share a `Cau` or have two distinct `Cau`s.) For a type, this `Cau` is the context in which type resolution occurs. Every non-`comptime` source declaration, every generic instantiation, and every distinct `extern` has a `Nav`. These are sent to codegen/link: the backends by definition do not care about `Cau`s. This commit has some minor technically-breaking changes surrounding `usingnamespace`. I don't think they'll impact anyone, since the changes are fixes around semantics which were previously inconsistent (the behavior changed depending on hashmap iteration order!). Aside from that, this changeset has no significant user-facing changes. Instead, it is an internal refactor which makes it easier to correctly model the responsibilities of different objects, particularly regarding incremental compilation. The performance impact should be negligible, but I will take measurements before merging this work into `master`. Co-authored-by: Jacob Young <jacobly0@users.noreply.github.com> Co-authored-by: Jakub Konka <kubkon@jakubkonka.com>	2024-08-11 07:29:41 +01:00
Jan Hendrik Farr	ca012e5b69	std.posix: read on timerfd can return error.Canceled	2024-08-08 01:12:26 -07:00

1 2 3 4 5 ...

394 Commits