tvl-depot

Author	SHA1	Message	Date
Vincent Ambo	e8fd6b6734	refactor(server): Change setup to create new storage backends	2019-10-28 22:31:44 +01:00
Vincent Ambo	20e0ca53cb	feat(server): Implement GCS storage backend with new interface Logical implementation is mostly identical to the previous one, but adhering to the new storage.Backend interface.	2019-10-28 22:31:44 +01:00
Vincent Ambo	f7d16c5d45	refactor(server): Introduce pluggable interface for storage backends This abstracts over the functionality of Google Cloud Storage and other potential underlying storage backends to make it possible to replace these in Nixery. The GCS backend is not yet reimplemented.	2019-10-28 22:31:44 +01:00
Vincent Ambo	3a5db4f9f1	refactor(server): Load GCS signing key from service account key The JSON file generated for service account keys already contains the required information for signing URLs in GCS, thus the environment variables for toggling signing behaviour have been removed. Signing is now enabled automatically in the presence of service account credentials (i.e. `GOOGLE_APPLICATION_CREDENTIALS`).	2019-10-27 13:58:04 +01:00
Vincent Ambo	1853c74998	refactor(server): Only compress symlink forest layer once Instead of compressing & decompressing again to get the underlying tar hash, use a similar mechanism as for store path layers for the symlink layer and only compress it once while uploading.	2019-10-11 12:37:00 +01:00
Vincent Ambo	e22ff5d176	fix(server): Use uncompressed tarball hashes in image config Docker expects hashes of compressed tarballs in the manifest (as these are used to fetch from the content-addressable layer store), but for some reason it expects hashes in the configuration layer to be of uncompressed tarballs. To achieve this an additional SHA256 hash is calculcated while creating the layer tarballs, but before passing them to the gzip writer. In the current constellation the symlink layer is first compressed and then decompressed again to calculate its hash. This can be refactored in a future change.	2019-10-11 12:37:00 +01:00
Vincent Ambo	0693e371d6	feat(server): Apply GZIP compression to all image layers This fixes #62	2019-10-11 12:37:00 +01:00
Vincent Ambo	d7ffbbdea4	refactor(server): Use logrus convenience functions for logs Makes use of the `.WithError` and `.WithField` convenience functions in logrus to simplify log statement construction. This has the added benefit of making it easier to correctly log errors.	2019-10-06 23:05:23 +01:00
Vincent Ambo	6f148f789f	refactor(server): Convert existing log entries to structured format This rewrites all existing log statements into the structured logrus format. For consistency, all errors are always logged separately from the primary message in a field called `error`. Only the "info", "error" and "warn" severities are used.	2019-10-06 23:05:23 +01:00
Vincent Ambo	f77c93b6ae	feat(server): Add log level to severity mapping The output format now writes a `severity` field that follows that format that should be recognised by Stackdriver Logging.	2019-10-06 23:05:23 +01:00
Vincent Ambo	6912658c72	feat(server): Use hash of Nixery source as version Uses a hash of Nixery's sources as the version displayed when Nixery launches or logs an error. This makes it possible to distinguish between errors logged from different versions. The source hashes should be reproducible between different checkouts of the same source tree.	2019-10-06 23:05:23 +01:00
Vincent Ambo	95abb1bcde	feat(server): Initial Stackdriver-compatible log formatter This formatter has basic support for the Stackdriver Error Reporting format, but several things are still lacking: * the service version (preferably git commit?) needs to be included in the server somehow * log streams should be split between stdout/stderr as that is how AppEngine (and several other GCP services?) seemingly differentiate between info/error logs	2019-10-06 23:05:23 +01:00
Vincent Ambo	0642f7044d	fix(server): Amend package path for Go tooling compatibility With these changes it is possible to keep Nixery in $GOPATH and build the server in there, while still having things work correctly via Nix.	2019-10-06 23:05:23 +01:00
Vincent Ambo	f4bf3518f6	refactor(server): Replace log calls with logrus This introduces a structured logging library that can be used (next step) to attach additional metadata to log entries.	2019-10-06 23:05:23 +01:00
Vincent Ambo	d9b329ef59	refactor(server): Always include 'cacert' & 'iana-etc' These two packages almost always end up being required by programs, but people don't necessarily consider them. They will now always be added and their popularity is artificially inflated to ensure they end up at the top of the layer list.	2019-10-03 22:50:02 +01:00
Vincent Ambo	9bb6d0ae25	fix(server): Ensure build cache objects are written to GCS Cache writes might not be flushed without this call.	2019-10-03 22:50:02 +01:00
Vincent Ambo	48a5ecda97	feat(server): Order layers in image manifest based on merge rating Image layers in manifests are now sorted in a stable (descending) order based on their merge rating, meaning that layers more likely to be shared between images come first. The reason for this change is Docker's handling of image layers on overlayfs2: Images are condensed into a single representation on disk after downloading. Due to this Docker will constantly redownload all layers that are applied in a different order in different images (layer order matters in imperatively created images), based on something it calls the 'ChainID'. Sorting the layers this way raises the likelihood of a long chain of matching layers at the beginning of an image. This relates to #39.	2019-10-03 22:50:02 +01:00
Vincent Ambo	6b06fe27be	feat(server): Implement creation of layer tarballs in the server This will create, upload and hash the layer tarballs in one disk read.	2019-10-03 22:29:50 +01:00
Vincent Ambo	1124b8c236	fix(server): Do not invoke layer build if no layers are missing This previously invoked a Nix derivation that spent a few seconds on making an empty object in JSON ...	2019-10-03 13:21:04 +01:00
Vincent Ambo	feba42e409	feat(server): Fetch popularity data on launch The last missing puzzle piece for #50!	2019-10-03 13:21:04 +01:00
Vincent Ambo	43a642435b	feat(server): Reimplement local manifest cache backed by files Implements a local manifest cache that uses the temporary directory to cache manifest builds. This is necessary due to the size of manifests: Keeping them entirely in-memory would quickly balloon the memory usage of Nixery, unless some mechanism for cache eviction is implemented.	2019-10-03 13:21:04 +01:00
Vincent Ambo	313e5d08f1	refactor(builder): Streamline layer creation & reintroduce caching The functions used for layer creation are now easier to follow and have clear points at which the layer cache is checked and populated. This relates to #50.	2019-10-03 13:21:04 +01:00
Vincent Ambo	53906024ff	refactor: Remove remaining MD5-hash mentions and computations	2019-10-03 13:21:04 +01:00
Vincent Ambo	f6b40ed6c7	refactor(server): Cache manifest entries for layer builds MD5 hash checking is no longer performed by Nixery (it does not seem to be necessary), hence the layer cache now only keeps the SHA256 hash and size in the form of the manifest entry. This makes it possible to restructure the builder code to perform cache-fetching and cache-populating for layers in the same place.	2019-10-03 13:21:04 +01:00
Vincent Ambo	355fe3f5ec	feat(server): Reintroduce manifest caching to GCS The new builder now caches and reads cached manifests to/from GCS. The in-memory cache is disabled, as manifests are no longer written to local file and the caching of file paths does not work (unless we reintroduce reading/writing from temp files as part of the local cache).	2019-10-03 13:21:04 +01:00
Vincent Ambo	1308a6e1fd	refactor(server): Clean up cache implementation A couple of minor fixes and improvements to the cache implementation.	2019-10-03 13:21:04 +01:00
Vincent Ambo	0698d7f2aa	chore(server): Remove "layer seen" cache This cache is no longer required as it is implicit because the layer cache (mapping store path hashes to layer hashes) implies that a layer has been seen.	2019-10-03 13:21:04 +01:00
Vincent Ambo	64fca61ea1	fix(server): Upload symlink layer created by first Nix build This layer is needed in addition to those that are built in the second Nix build.	2019-10-03 13:21:04 +01:00
Vincent Ambo	f4f2909573	fix(server): Specify correct authentication scope for GCS When retrieving tokens for service service accounts, some methods of retrieval require a scope to be specified.	2019-10-03 13:21:04 +01:00
Vincent Ambo	aa02ae1421	feat(server): Implement new build process core Implements the new build process to the point where it can actually construct and serve image manifests. It is worth noting that this build process works even if the Nix sandbox is enabled! It is also worth nothing that none of the caching functionality that the new build process enables (such as per-layer build caching) is actually in use yet, hence running Nixery at this commit is prone to doing more work than previously. This relates to #50.	2019-10-03 13:21:04 +01:00
Vincent Ambo	17adda0355	fix(server): Minor fixes to updated new builder code	2019-10-03 13:21:04 +01:00
Vincent Ambo	3f40c0a2d2	feat(server): Implement package for creating image manifests The new manifest package creates image manifests and their configuration. This previously happened in Nix, but is now part of the server's workload. This relates to #50.	2019-10-03 13:21:04 +01:00
Vincent Ambo	87e196757b	feat(server): Reimplement creation & uploading of layers The new build process can now call out to Nix to create layers and upload them to the bucket if necessary. The layer cache is populated, but not yet used.	2019-10-03 13:21:04 +01:00
Vincent Ambo	61269175c0	refactor(server): Introduce a state type to carry runtime state The state type contains things such as the bucket handle and Nixery's configuration which need to be passed around in the builder. This is only added for convenience.	2019-10-03 13:21:04 +01:00
Vincent Ambo	6e2b84f475	feat(server): Add cache for layer builds in GCS & local cache This cache is going to be used for looking up whether a layer build has taken place already (based on a hash of the layer contents). See the caching section in the updated documentation for details. Relates to #50.	2019-10-03 13:21:04 +01:00
Vincent Ambo	e60805c9b2	feat(server): Introduce function to hash contents of a layer This creates a cache key which can be used to check if a layer has already been built.	2019-10-03 13:21:04 +01:00
Vincent Ambo	9c3c622403	refactor(server): Expose layer grouping logic via a function Refactors the layer grouping package (which previously compiled to a separate binary) to expose the layer grouping logic via a function instead. This is the next step towards creating layers inside of the server component instead of in Nix. Relates to #50.	2019-10-03 13:21:04 +01:00
Vincent Ambo	8c79d085ae	chore(server): Import layer grouping logic into server component	2019-10-03 13:21:04 +01:00
Vincent Ambo	0000b956bb	feat(server): Log Nix output live during the builds Instead of dumping all Nix output as one at the end of the build process, stream it live as the lines come in. This is a lot more useful for debugging stuff like where manifest retrievals get stuck.	2019-09-21 15:25:28 +01:00
Vincent Ambo	64f74abc4d	feat: Add configuration option for popularity data URL	2019-09-21 12:44:40 +01:00
Vincent Ambo	5a002fe067	refactor(builder): Calculate image cache key only once	2019-09-10 11:32:37 +01:00
Vincent Ambo	4a58b0ab4d	feat(server): Cache built manifests to the GCS bucket Caches manifests under `manifests/$cacheKey` in the GCS bucket and introduces two-tiered retrieval of manifests from the caches (local first, bucket second). There is some cleanup to be done in this code, but the initial version works.	2019-09-10 11:32:37 +01:00
Vincent Ambo	051eb77b3d	refactor(server): Use package source specific cache keys Use the PackageSource.CacheKey function introduced in the previous commit to determine the key at which a manifest should be cached in the local cache. Due to this change, manifests for moving target sources are no longer cached and the recency threshold logic has been removed.	2019-09-10 11:32:37 +01:00
Vincent Ambo	980f5e2187	refactor(server): Move package source management logic to server Introduces three new types representing each of the possible package sources and moves the logic for specifying the package source to the server. Concrete changes: * Determining whether a specified git reference is a commit vs. a branch/tag is now done in the server, and is done more precisely by using a regular expression. * Package sources now have a new `CacheKey` function which can be used to retrieve a key under which a build manifest can be cached if the package source is not a moving target (i.e. a full git commit hash of either nixpkgs or a private repository). This function is not yet used. * Users must now specify a package source, Nixery no longer defaults to anything and will fail to launch if no source is configured.	2019-09-10 11:32:37 +01:00
Vincent Ambo	32b9b5099e	feat(server): Add configuration option for Nix build timeouts Adds a NIX_TIMEOUT environment variable which can be set to a number of seconds that is the maximum allowed time each Nix builder can run. By default this is set to 60 seconds, which should be plenty for most use-cases as Nixery is not expected to be performing builds of uncached binaries in most production cases. Currently the errors Nix throws on a build timeout are not separated from other types of errors, meaning that users will see a generic 500 server error in case of a timeout. This fixes #47	2019-09-02 23:44:57 +01:00
Vincent Ambo	9a95c4124f	fix(server): Sort requested packages in image name & spec Before this change, Nixery would pass on the image name unmodified to Nix which would lead it to cache-bust the manifest and configuration layers for images that are content-identical but have different package ordering. This fixes #38.	2019-08-17 10:31:41 +01:00
Florian Klink	3b65fc8c72	feat(server): add iana-etc and cacert to the shell convenience package These probably should be part of every container image by default, but adding it to the "shell" convenience name probably is our best bet for now.	2019-08-16 22:47:22 +01:00
Vincent Ambo	85b9c30749	chore(server): Add 'go vet' to build process	2019-08-14 20:18:41 +01:00
Vincent Ambo	36d50d1f19	fix(server): Print correct project name during startup They grow up so fast :')	2019-08-14 20:18:41 +01:00
Vincent Ambo	cf227c153f	feat(builder): Implement build cache for manifests & layers Implements a cache that keeps track of: a) Manifests that have already been built (for up to 6 hours) b) Layers that have already been seen (and uploaded to GCS) This significantly speeds up response times for images that are full or partial matches with previous images served by an instance.	2019-08-14 20:18:41 +01:00

1 2

54 commits