[STORAGE USE REDUCTION] Snapshot page compression #1594

cloutiertyler · 2024-08-15T15:22:42Z

No description provided.

cloutiertyler · 2024-10-10T18:07:38Z

Need to ask Phoebe if we have sufficient metadata to determine if these are compressed or not.

gefjon · 2024-11-20T18:25:29Z

Filenames of snapshot pages are semantically important. It would be easy to recognize {HASH} as uncompressed and {HASH}.zip as compressed.

gefjon · 2024-11-20T19:50:16Z

MVP / definition of done as I see it:

When taking a snapshot, unconditionally compress all pages and blobs before writing them to disk.
When restoring a snapshot, decompress the pages and blobs while reading them into memory.
Benchmark to find out how much slower this is than the uncompressed version, and determine whether that slowdown is acceptable.

Things we could do if the slowdown from the above solution is too high:

After taking a snapshot, compress all previous snapshots. Whether each snapshot gets its own archive, or the snapshots get added into a single big archive via something like zip -u, requires experimentation.
After taking a snapshot, go into the previous "parent" snapshot and compress any of its pages which are not also present in the new "child" snapshot.
- Additional complexity: when compressing such a page, you have to examine the chain of "ancestor" snapshots before that one, which may also contain the same page, and fix up so that they contain hardlinks to the same compressed archive. Otherwise you don't save any disk space, as the "grandparent" may still contain an uncompressed version of the page that you compressed within the "parent."

bfops · 2024-11-20T20:32:30Z

Leaving as a P1 for devops's sake, because disks keep filling up

bfops · 2024-11-20T20:34:21Z

the naive unconditional compression thing, would not be backwards-compatible.. but we can do this backwards-compatibly (e.g. by adding .zip to the filename)

gefjon assigned mamcx Nov 20, 2024

mamcx added the release-1.0 label Nov 21, 2024

mamcx mentioned this issue Dec 3, 2024

Compress the snapshot & commit log #2034

Draft

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[STORAGE USE REDUCTION] Snapshot page compression #1594

[STORAGE USE REDUCTION] Snapshot page compression #1594

cloutiertyler commented Aug 15, 2024

cloutiertyler commented Oct 10, 2024

gefjon commented Nov 20, 2024

gefjon commented Nov 20, 2024

bfops commented Nov 20, 2024

bfops commented Nov 20, 2024

[STORAGE USE REDUCTION] Snapshot page compression #1594

[STORAGE USE REDUCTION] Snapshot page compression #1594

Comments

cloutiertyler commented Aug 15, 2024

cloutiertyler commented Oct 10, 2024

gefjon commented Nov 20, 2024

gefjon commented Nov 20, 2024

bfops commented Nov 20, 2024

bfops commented Nov 20, 2024