fix(tvix/castore): ensure all directories are present during ingestion

`ingest_entries` requires that all directories referenced by entries in
the ingestion stream have an explicit entry in the stream.

For example, if the stream contains a file with path `foo/bar`, there
must be an entry that comes later in the stream for the directory `foo`.

Change-Id: I61b4fbbb73ea7278715e04271d8073b484e05e61
Reviewed-on: https://cl.tvl.fyi/c/depot/+/11488
Autosubmit: Connor Brewster <cbrewster@hey.com>
Reviewed-by: flokli <flokli@flokli.de>
Tested-by: BuildkiteCI
Reviewed-by: raitobezarius <tvl@lahfa.xyz>
This commit is contained in:
Connor Brewster 2024-04-20 09:43:43 -05:00 committed by clbot
parent 3107961428
commit 18ab59ed70

View file

@ -42,6 +42,9 @@ pub mod fs;
/// - The last entry must be the root node which must have a single path component. /// - The last entry must be the root node which must have a single path component.
/// - Every entry should have a unique path, and only consist of normal components. /// - Every entry should have a unique path, and only consist of normal components.
/// This means, no windows path prefixes, absolute paths, `.` or `..`. /// This means, no windows path prefixes, absolute paths, `.` or `..`.
/// - All referenced directories must have an associated directory entry in the stream.
/// This means if there is a file entry for `foo/bar`, there must also be a `foo` directory
/// entry.
/// ///
/// Internally we maintain a [HashMap] of [PathBuf] to partially populated [Directory] at that /// Internally we maintain a [HashMap] of [PathBuf] to partially populated [Directory] at that
/// path. Once we receive an [IngestionEntry] for the directory itself, we remove it from the /// path. Once we receive an [IngestionEntry] for the directory itself, we remove it from the
@ -141,6 +144,11 @@ where
.add(node); .add(node);
}; };
assert!(
directories.is_empty(),
"Tvix bug: left over directories after processing ingestion stream"
);
// if there were directories uploaded, make sure we flush the putter, so // if there were directories uploaded, make sure we flush the putter, so
// they're all persisted to the backend. // they're all persisted to the backend.
if let Some(mut directory_putter) = maybe_directory_putter { if let Some(mut directory_putter) = maybe_directory_putter {