Gigabyte Technology Adopts CXL Memory Pooling for AI Servers
Gigabyte Technology is integrating Compute Express Link (CXL) memory pooling into its AI server infrastructure to address hardware bottlenecks and enhance performance for large-scale AI workloads.

Gigabyte Technology is incorporating Compute Express Link (CXL) memory pooling to enhance the efficiency and scalability of AI servers. This initiative aims to address the limitations of current compute architectures that struggle with the memory demands of increasingly complex AI models, such as Meta's 405-billion-parameter Llama 3.1.
The traditional approach, where processors and accelerators rely on isolated local memory, often leads to data transfer inefficiencies and performance bottlenecks. CXL technology introduces a shared memory pool, allowing different computing components to dynamically access and share memory resources as needed. This reduces latency and improves overall resource utilization, crucial for efficiently processing vast datasets.
Gigabyte is developing servers that leverage this technology to provide a unified memory resource. This approach can significantly boost memory utilization, potentially by up to 50%, and offers high throughput capabilities, essential for data-intensive AI training, inference, and large-scale simulations. CXL 3.0, for example, provides 128 GB/s of bidirectional bandwidth.
By optimizing data movement and resource allocation, CXL memory pooling not only accelerates AI workloads but also contributes to more sustainable data center operations through improved energy efficiency. Gigabyte's adoption of CXL signals a move towards more flexible and powerful computing infrastructure for the evolving AI landscape.