tvl-depot/users/flokli
Florian Klink 46964f6d8f fix(users/flokli/archaeology): don't use file but column compression
Clickhouse also has column compression, configurable with the
output_format_parquet_compression_method setting.

It defaults to lz4, and the previous setting got a a zstd-compressed
parquet file with lz4 data.

Set output_format_parquet_compression_method to zstd instead, and sort
by timestamp before assembling the parquet file.

The existing files were updated to the same format with the following query:

```
SELECT * FROM file('bucket_logs_2023-11-11*.pq', 'Parquet', 'auto') ORDER BY timestamp ASC INTO OUTFILE 'bucket_logs_2023-11-11.parquet' SETTINGS output_format_parquet_compression_method = 'zstd'
```

Change-Id: Id63b14c82e7bf4b9907a500528b569a51e277751
Reviewed-on: https://cl.tvl.fyi/c/depot/+/10008
Reviewed-by: raitobezarius <tvl@lahfa.xyz>
Tested-by: BuildkiteCI
2023-11-11 19:49:13 +00:00
..
archeology fix(users/flokli/archaeology): don't use file but column compression 2023-11-11 19:49:13 +00:00
nixos feat(users/flokli/nixos/archeology-ec2): add parse-bucket-logs 2023-11-11 12:24:23 +00:00
presentations feat(users/flokli): add ASG Lightning talk presentation 2023-10-16 11:49:06 +00:00
keys.nix feat(ops/nixos/whitby): add flokli user 2021-03-26 20:31:48 +00:00
OWNERS chore(gerrit): migrate OWNERS files to code-owners style 2022-09-19 11:13:28 +00:00