stdenv: allow for jobservers across multiple nix builds #314888

RaitoBezarius · 2024-05-26T16:48:27Z

Description of changes

Retake of #143820 where I unfortunately fucked up trying to debug kernel issues and btrfs.

Original motivations by @pennae:

make -jN -lN in stdenv is a very blunt instrument. it works well when max-jobs=1, but as nix-level paralellism increases it becomes increasingly deficient. starting from a low-load situation we start max-jobs * N compilers, loadavg goes through the roof, the -lN load limit kicks in and inhibits new compilers starting until loadavg has fallen below N—at which point all make instances spawn a lot of new compilers and loadavg goes through the roof again. this oscillation leaves the system underutilized in low phases and overcommitted in high phases.

testing the current stdenv against a jobserver with 26 tokens on a 12C/24T machine shows that parallel builds of llvm_{8..11} run about 7% faster (35:52min for stdenv, 33:30min with jobserver), a larger build of llvm{5..13} is about about 11% faster (1:27h for stdenv, 1:17h with jobserver). (removing the -l from stdenv also improves utilization but is less efficient. preliminary testing here shows that -l${1.5 * N} may be a good alternative to -lN as used currently, #141266 could be a good vector to go for that instead of this whole mess. [more testing says that -l2N would be a minimum to get better utilization, but so far every -l setting we've tried has produced some underutilization except excessive large numbers like 6N or higher])

nothing in here should be regarded as a final suggestion in any way, it's more of a "hey look, this might just work". as such it's extremely rough around the edges, eg to use the jobserver the experimenter currently has to bring a /jobserver fifo filled with tokens into the nix sandbox:

nix-build -E 'with import ./. {}; pkgs.callPackage ./pkgs/os-specific/linux/nixos-jobserver {}'
touch /tmp/jobserver
sudo ./result -t 24 -g nixbld /tmp/jobserver&
nix build --extra-sandbox-paths "/jobserver=/tmp/jobserver" #...

is this something worth pursuing? a 10% speedup for hydra does seem tempting

todos before this is more generally usable:

re-add $NIX_BUILD_CORES support (ioctl on the jobserver fd should do it)
maybe port from cuse to fuse to not require root for everything (mmap problems may be solved by reporting jobserver file size as 0 at all times, needs testing)
usability of tools (error messages, runtime configuration, etc)
add support for other build systems (cargo can honor MAKEFLAGS, maybe others too)
nixos module
make configurable which env var(s) jscall adds jobserver information to
metrics?
<your suggestion here>

Things done

pennae · 2024-05-27T02:32:42Z

doc/stdenv/stdenv.chapter.md

@@ -1355,6 +1355,30 @@ name="/nix/store/9s9r019176g7cvn2nvcw41gsp862y6b4-coreutils-8.24"
 someVar=$(stripHash $name)
 ```

+### `runInJobServer` \<command\> \-\-\-\- \<defArgs\> \-\-\-\- \<args\> {#fun-runInJobServer}


Suggested change

### `runInJobServer` \<command\> \-\-\-\- \<defArgs\> \-\-\-\- \<args\> {#fun-runInJobServer}

### `runInJobserver` \<command\> \-\-\-\- \<defArgs\> \-\-\-\- \<args\> {#fun-runInJobServer}

(multiple places)

pennae · 2024-05-27T02:33:35Z

nixos/modules/services/misc/nixos-jobserver.nix

+  config = mkIf cfg.enable {
+    nix.sandboxPaths = [ "/build-support/jobserver=${tokenFile}?" ];
+
+    systemd.services.nixos-jobserver = {


running this as root is most unwise in practice but running it as another user seems to be nearly impossible. when fuse doesn't fuck us over systemd does.

pennae · 2024-05-27T02:34:04Z

pkgs/by-name/ni/nixos-jobserver/nixos-jobserver.cpp

+
+struct stat jobserver_st = {
+    .st_ino = FUSE_ROOT_ID,
+    .st_mode = S_IFREG | 0660,


Suggested change

.st_mode = S_IFREG | 0660,

.st_mode = S_IFREG | 0666,

won't work otherwise

pennae · 2024-05-27T02:34:49Z

pkgs/stdenv/generic/setup.sh

+        if [[ -n "$enableParallelChecking" ]]; then
+            runInJobserver \
+                make ---- \
+                -j${NIX_BUILD_CORES} -l${NIX_BUILD_CORES} ---- \


Suggested change

-j${NIX_BUILD_CORES} -l${NIX_BUILD_CORES} ---- \

-j${NIX_BUILD_CORES} ---- \

multiple places

pennae · 2024-05-27T19:06:39Z

cargo apparently no longer accepts jobserver fds that aren't pipes. that's somewhat problematic.

initially only make and cargo support using the jobserver. other build systems may follow suit later.

Co-authored-by: Raito Bezarius <[email protected]>

aviallon · 2024-07-30T19:11:01Z

Is this work dead? It looked very promising.

yu-re-ka · 2024-08-12T09:56:10Z

I have some patches lying around locally to make it work with ninja, so anyone interested in picking this up, feel free to write me about it.

RaitoBezarius requested review from zowoq, winterqt, figsoda and Ericson2314 as code owners May 26, 2024 16:48

lilyinstarlight added the significant Novel ideas, large API changes, notable refactorings, issues with RFC potential, etc. label May 26, 2024

ofborg bot added 10.rebuild-darwin-stdenv This PR causes stdenv to rebuild 10.rebuild-linux-stdenv This PR causes stdenv to rebuild 8.has: package (new) This PR adds a new package 10.rebuild-darwin: 501+ 10.rebuild-darwin: 5001+ 10.rebuild-linux: 501+ 10.rebuild-linux: 5001+ labels May 26, 2024

pennae requested changes May 27, 2024

View reviewed changes

stdenv: allow for jobservers across multiple nix builds

742725b

initially only make and cargo support using the jobserver. other build systems may follow suit later.

pennae force-pushed the stdenv-jobserver branch from 8a4a67a to cd4036f Compare May 27, 2024 19:27

nixos/nixos-jobserver: init

e97220e

Co-authored-by: Raito Bezarius <[email protected]>

pennae force-pushed the stdenv-jobserver branch from cd4036f to e97220e Compare May 28, 2024 19:49

h7x4 added the 8.has: module (new) This PR adds a module in `nixos/` label Jun 1, 2024

wegank added the 2.status: merge conflict This PR has merge conflicts with the target branch label Sep 10, 2024

wegank added the 2.status: stale https://github.com/NixOS/nixpkgs/blob/master/.github/STALE-BOT.md label Jan 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

stdenv: allow for jobservers across multiple nix builds #314888

stdenv: allow for jobservers across multiple nix builds #314888

RaitoBezarius commented May 26, 2024

pennae May 27, 2024

pennae May 27, 2024

pennae May 27, 2024

pennae May 27, 2024

pennae commented May 27, 2024

aviallon commented Jul 30, 2024

yu-re-ka commented Aug 12, 2024

	### `runInJobServer` \<command\> \-\-\-\- \<defArgs\> \-\-\-\- \<args\> {#fun-runInJobServer}
	### `runInJobserver` \<command\> \-\-\-\- \<defArgs\> \-\-\-\- \<args\> {#fun-runInJobServer}

	-j${NIX_BUILD_CORES} -l${NIX_BUILD_CORES} ---- \
	-j${NIX_BUILD_CORES} ---- \

stdenv: allow for jobservers across multiple nix builds #314888

Are you sure you want to change the base?

stdenv: allow for jobservers across multiple nix builds #314888

Conversation

RaitoBezarius commented May 26, 2024

Description of changes

Original motivations by @pennae:

Things done

pennae May 27, 2024

Choose a reason for hiding this comment

pennae May 27, 2024

Choose a reason for hiding this comment

pennae May 27, 2024

Choose a reason for hiding this comment

pennae May 27, 2024

Choose a reason for hiding this comment

pennae commented May 27, 2024

aviallon commented Jul 30, 2024

yu-re-ka commented Aug 12, 2024