fetchTree: Allow fetching plain files #6548

thufschmitt · 2022-05-19T08:21:09Z

Replace the tarball fetcher by a more generic url one, mostly similar except that it takes an extra unpack parameter (false by default) to specify whether the target is an archive to extract or a plain file.

Also adds a few convolutions for backwards compatibility:

The tarball fetcher type is kept. It’s now an alias for url, but with unpack defaulting to true
Flake references that used to resolve to a tarball fetcher (i.e. http(s):// and file:// urls whose path ends with a known archive extension) still do, meaning that
1. They are still extracted by default (which is both a backwards-compatibility measure, and very handy)
2. The generated lockfiles are still compatible with older Nix versions who don’t know of the url fetcher

Fix #3785

Originally taken from #4153 (although most of it has been rewritten)

/cc @tomberek

thufschmitt · 2022-05-19T08:23:40Z

One thing I’m really unhappy with is the url name. I’ve taken it for consistency with fetchurl, but it’s a highly vague and overloaded term, so I’d happily accept any suggestion for a better one :)

edolstra · 2022-05-19T10:54:26Z

I would remove the "unpack" parameter and have two different input types: "tarball" and "file". Calling it "url" is not very meaningful since most fetchers use a URL, they just differ in what they do with the URL (e.g. in this case, unpack a tarball, or use a plain file).

thufschmitt · 2022-05-19T13:20:18Z

I would remove the "unpack" parameter and have two different input types: "tarball" and "file".

That’s actually what I had initially. I moved away from it because it was making the URL parsing slightly weird: I had the tarball URL parser accepting urls ending with a known archive format, the file parser accepting every curl-esque url, meaning that they were overlapping and and it only worked properly because they were registered in the right order.

… but now that I write it down, it feels much less dirty than the actual weird semantics of the current PR, so you’re right, I’ll switch back to that 🙂

Add a new `file` fetcher type, which will fetch a plain file over http(s), or from the local file. Because plain `http(s)://` or `file://` urls can already correspond to `tarball` inputs (if the path ends-up with a know archive extension), the URL parsing logic is a bit convuluted in that: - {http,https,file}:// urls will be interpreted as either a tarball or a file input, depending on the extensions of the path part (so `https://foo.com/bar` will be a `file` input and `https://foo.com/bar.tar.gz` as a `tarball` input) - `file+{something}://` urls will be interpreted as `file` urls (with the `file+` part removed) - `tarball+{something}://` urls will be interpreted as `tarball` urls (with the `tarball+` part removed) Fix #3785 Co-Authored-By: Tony Olagbaiye <[email protected]>

thufschmitt · 2022-05-19T16:27:13Z

OK, I’ve made them separate input types, and added the file+{blah} and tarball+{blah} url schemes to resolve the ambiguity if needs be

tomberek · 2022-05-19T16:46:26Z

Does a “file” type need to have the “flake” parameter? Or can it default to false because if you do not unpack it can’t be a flake at all. Perhaps the file it itself can be raw flake.nix file, but that seems too rare/edge case; I’d prefer the simpler situation where “file+” by default knows it cannot be a flake.

thufschmitt · 2022-05-19T17:51:50Z

Does a “file” type need to have the “flake” parameter? Or can it default to false because if you do not unpack it can’t be a flake at all. Perhaps the file it itself can be raw flake.nix file, but that seems too rare/edge case; I’d prefer the simpler situation where “file+” by default knows it cannot be a flake.

Ideally it probably shouldn’t indeed. In practice I couldn’t find an easy and clean way to do it ;)

zimbatm · 2022-05-20T07:46:45Z

src/nix/flake.md

+  In URL form, the schema must be `tarball+http://`, `tarball+https://` or `tarball+file://`.
+  If the extension corresponds to a known archive format (`.zip`, `.tar`,
+  `.tgz`, `.tar.gz`, `.tar.xz`, `.tar.bz2` or `.tar.zst`), then the `tarball+`
+  can be dropped.


Isn't that change breaking backwards compatibility?

Nope, on the contrary: Currently http(s)/file urls are required to have such an extension and are treated as tarballs. The new behavior makes it so that what was already accepted still is, and with the same semantics, and the rest is treated as file urls

The problematic bit though, is that adding a new tarball extension would break backwards-compat a bit: with this change, https://foo.com/bar.tar.lzma is fetched as a plain file, and if we wanted to add .tar.lzma as a recognized archive extension, it would become an directory tarball

zimbatm · 2022-05-20T07:50:55Z

src/nix/flake.md

+  can be dropped.
+
+* `file`: Plain files or directory tarballs, either over http(s) or from the local
+  disk.


Might as well rename the function to builtins.fetch if it downloads arbitrary things, even non-tree ones.

Might be an idea, indeed :)

I’d leave that for later though if you’re OK with that

edolstra · 2022-05-20T09:12:20Z

src/libfetchers/tarball.cc

+
+    std::pair<StorePath, Input> fetch(ref<Store> store, const Input & input) override
+    {
+        auto file = downloadFile(store, getStrAttr(input.attrs, "url"), input.getName(), false);


A problem with the use of downloadFile() is that it uses FileIngestionMethod::Flat instead of FileIngestionMethod::Recursive. Currently it's assumed that all flake inputs use recursive+sha256 with a name of "source". This allows inputs to be substituted using the narHash attribute in the lock file (see Input::computeStorePath()). However, the lazy trees branch will probably remove the ability to substitute inputs anyway...

Oh indeed, I didn’t notice the Flat here. Would it be an issue to make downloadFile use Recursive? Afaik none of the other use sites really care about how the result is hashed since it’s only used internally by the fetchers but doesn’t leak outside.

(Alternatively I could just wrap that by adding an extra step that would copy the output to a Recursive CA location, but if that can be avoided it’s even better)

nixos-discourse · 2022-06-01T13:44:17Z

This pull request has been mentioned on NixOS Discourse. There might be relevant details there:

https://discourse.nixos.org/t/nix-2-9-0-released/19407/6

nixos-discourse · 2022-06-03T09:48:51Z

This pull request has been mentioned on NixOS Discourse. There might be relevant details there:

https://discourse.nixos.org/t/tweag-nix-dev-update-31/19481/1

…OS/nix#6548)

matthewbauer · 2022-08-26T20:07:14Z

PR to add support to flake-compat:

edolstra/flake-compat#44

thufschmitt force-pushed the file-fetcher branch from 68976e9 to 5b8c1de Compare May 19, 2022 16:25

thufschmitt requested a review from edolstra May 19, 2022 16:27

zimbatm reviewed May 20, 2022

View reviewed changes

edolstra reviewed May 20, 2022

View reviewed changes

edolstra merged commit 89a8955 into master May 25, 2022

edolstra deleted the file-fetcher branch May 25, 2022 13:29

MatthewCroughan mentioned this pull request Jul 5, 2022

Support arbitrary fetching of files as Flake inputs #5979

Closed

rehno-lindeque added a commit to rehno-lindeque/ml-pkgs that referenced this pull request Aug 16, 2022

Maintain imported nix files via the flake inputs (supported as of Nix…

7b479c3

…OS/nix#6548)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fetchTree: Allow fetching plain files #6548

fetchTree: Allow fetching plain files #6548

thufschmitt commented May 19, 2022

thufschmitt commented May 19, 2022

edolstra commented May 19, 2022

thufschmitt commented May 19, 2022

thufschmitt commented May 19, 2022

tomberek commented May 19, 2022

thufschmitt commented May 19, 2022

zimbatm May 20, 2022

thufschmitt May 20, 2022

thufschmitt May 20, 2022

zimbatm May 20, 2022

thufschmitt May 20, 2022

edolstra May 20, 2022

thufschmitt May 20, 2022

nixos-discourse commented Jun 1, 2022

nixos-discourse commented Jun 3, 2022

matthewbauer commented Aug 26, 2022

fetchTree: Allow fetching plain files #6548

fetchTree: Allow fetching plain files #6548

Conversation

thufschmitt commented May 19, 2022

thufschmitt commented May 19, 2022

edolstra commented May 19, 2022

thufschmitt commented May 19, 2022

thufschmitt commented May 19, 2022

tomberek commented May 19, 2022

thufschmitt commented May 19, 2022

zimbatm May 20, 2022

Choose a reason for hiding this comment

thufschmitt May 20, 2022

Choose a reason for hiding this comment

thufschmitt May 20, 2022

Choose a reason for hiding this comment

zimbatm May 20, 2022

Choose a reason for hiding this comment

thufschmitt May 20, 2022

Choose a reason for hiding this comment

edolstra May 20, 2022

Choose a reason for hiding this comment

thufschmitt May 20, 2022

Choose a reason for hiding this comment

nixos-discourse commented Jun 1, 2022

nixos-discourse commented Jun 3, 2022

matthewbauer commented Aug 26, 2022