3/20/2026 at 7:59:11 AM
Oh neat, a post I actually know something about! I worked a lot on userfaultfd performance for GCE's live migration post-copy a couple years ago. Or more specifically, I worked on mechanisms to avoid it entirely- due to lock contention in the kennel, faults become veeeerry slow as the number of vcpus scales, and as it happens VMs these days can have a lot of vcpusby anlsh
3/20/2026 at 11:29:14 AM
that's very interesting! I was noticing page vault storm on live migrations as well and I wonder if that's what you were running into / mentioning here regarding the lock contentionby shayonj
3/20/2026 at 11:42:23 AM
[dead]by samsudin