> Configured a block-level in-memory disk accelerator / cache (fs operations at ...

jacobwg · 2025-03-28T18:24:32 1743186272

Exactly, a ramdisk-backed writeback cache for the root volume for Linux. For macOS we wrote a custom nbd filter to achieve the same thing.

philsnow · 2025-03-28T18:33:40 1743186820

Forgive me, I'm not trying to be argumentative, but doesn't Linux (and presumably all modern OSes) already have a ram-backed writeback cache for filesystems? That sounds exactly like the page cache.

jacobwg · 2025-03-28T19:17:48 1743189468

No worries, entirely valid question. There may be ways to tune page cache to be more like this, but my mental model for what we've done is effectively make reads and writes transparently redirect to the equivalent of a tmpfs, up to a certain size. If you reserve 2GB of memory for the cache, and the CI job's read and written files are less than 2GB, then _everything_ stays in RAM, at RAM throughput/IOPS. When you exceed the limit of the cache, blocks are moved to the physical disk in the background. Feels like we have more direct control here than page cache (and the page cache is still helping out in this scenario too, so it's more that we're using both).

philsnow · 2025-03-28T21:32:40 1743197560

> reads and writes transparently redirect to the equivalent of a tmpfs, up to a certain size

The last bit (emphasis added) sounds novel to me, I don't think I've heard before of anybody doing that. It sounds like an almost-"free" way to get a ton of performance ("almost" because somebody has to figure out the sizing. Though, I bet you could automate that by having your tool export a "desired size" metric that's equal to the high watermark of tmpfs-like storage used during the CI run)

3np · 2025-03-29T07:31:04 1743233464

Just to add, my understanding is that unless you also tune your workload writes, the page cache will not skip backing storage for writes, only for reads. So it does make sense to stack both if you're fine with not being able to rely on peristence of those writes.

nine_k · 2025-03-28T20:43:30 1743194610

No, it's more like swapping pages to disk when RAM is full, or like using RAM when the L2 cache is full.

Linux page cache exists to speed up access to the durable store which is the underlying block device (NVMe, SSD, HDD, etc).

The RAM-backed block device in question here is more like tmpfs, but with an ability to use the disk if, and only if, it overflows. There's no intention or need to store its whole contents on the durable "disk" device.

Hence you can do things entirely in RAM as long as your CI/CD job can fit all the data there, but if it can't fit, the job just gets slower instead of failing.

trillic · 2025-03-28T18:52:26 1743187946

If you clearly understand your access patterns and memory requirements, you can often outperform the default OS page cache.

Consider a scenario where your VM has 4GB of RAM, but your build accesses a total of 6GB worth of files. Suppose your code interacts with 16GB of data, yet at any moment, its active working set is only around 2GB. If you preload all Docker images at the start of your build, they'll initially be cached in RAM. However, as your build progresses, the kernel will begin evicting these cached images to accommodate recently accessed data, potentially even files used infrequently or just once. And that's the key bit, to force caching of files you know are accessed more than once.

By implementing your own caching layer, you gain explicit control, allowing critical data to remain persistently cached in memory. In contrast, the kernel-managed page cache treats cached pages as opportunistic, evicting the least recently used pages whenever new data must be accommodated, even if this new data isn't frequently accessed.

philsnow · 2025-03-28T21:33:53 1743197633

> If you clearly understand your access patterns and memory requirements, you can often outperform the default OS page cache.

I believe various RDBMSs bypass the page cache and use their own strategies for managing caching if you give them access to raw block devices, right?

inkyoto · 2025-03-28T23:32:07 1743204727

That is true and correct, except that Linux does not have raw devices, and O_DIRECT on a file is not a complete replacement for the raw devices (the buffer cache still gets involved as well as the file system).