CXL in Cloud Practice: Practical Lessons for Incrementally Scaling Deployment

  • ,
  • Karthik Kumar ,
  • Midhul Vuppalapati ,
  • Chet Douglas ,
  • Jesse Sathre ,
  • Ian N. Robinson ,
  • Prateek Tandon ,
  • Mark D. Hill

IEEE Transactions on Computers | , Vol 75: pp. 1234-1246

Publication

This paper explores learnings from first-generation Compute Express Link (CXL) memory expansion to accelerate CXL’s journey to broad, robust use. While broad adoption will be a long journey similar to that of RDMA, we argue that the first step—CXL.mem expansion—is viable on today’s hardware. Through an end-to-end analysis, we revisit common showstoppers: we decompose memory access latency and show that CPU and DRAM internals, rather than the CXL protocol, dominate latency and variability, and we demonstrate how system slack absorbs link error rates above nominal specifications. Along the way, we distill practical guidance on device validation, monitoring, failure modes, security, and multi-tenant interference, and we outline a pragmatic adoption pathway: solidify robust expansion first, prototype micro-pooling next, and move to selective sharing as the ecosystem matures.