alt.hn

3/3/2026 at 4:33:50 AM

Optimizing Recommendation Systems with JDK's Vector API

https://netflixtechblog.com/optimizing-recommendation-systems-with-jdks-vector-api-30d2830401ec

by mariuz

3/5/2026 at 9:30:04 PM

I had success on a similar problem by allocating native buffers for the matrices, then using a basic CUDA call. The actual work was 100x faster than my CPU baseline.

The bottleneck of course was fetching & loading relevant data to memory to start with.

by lukev

3/5/2026 at 7:29:21 PM

"the remaining 2% were large batch requests", [which made up 50% of the work] .. who really watches that many shows on Netflix? What was in those batches, if someone is watching that much, why bother with serendipity at all? Most serendipitous thing you could do is shut off their subscription.

by aberoham