New technology enables GPUs to use PCIe-attached memory for expanded capacity

Published on:

In short: GPUs have reminiscence limitations when going through the calls for of AI and HPC functions. There are methods round this bottleneck, however the options may be costly and cumbersome. Now, a startup headquartered in Daejeon, South Korea, has developed a brand new strategy: utilizing PCIe-attached reminiscence to broaden capability. Creating this resolution required leaping by way of many tech hoops and there are nonetheless challenges forward. Particularly, will AMD, Intel, and Nvidia assist the expertise?

Reminiscence necessities stemming from superior datasets for AI and HPC functions typically swamp the reminiscence constructed right into a GPU. Increasing that reminiscence has sometimes meant putting in costly excessive bandwidth reminiscence, which frequently introduces modifications to the present GPU structure or software program.

One resolution to this bottleneck is being supplied by Panmnesia, an organization backed by South Korea’s KAIST analysis institute, which has launched new tech that enables GPUs to entry system reminiscence instantly by way of a Compute Specific Hyperlink (CXL) interface. Primarily, it allows GPUs to make use of system reminiscence as an extension of their very own reminiscence.

- Advertisement -

Known as CXL GPU Picture, this PCIe-attached reminiscence has a double-digit nanosecond latency that’s considerably sooner than conventional SSDs, the corporate says.

Panmnesia needed to overcome a number of tech challenges to develop this technique.

CXL is a protocol that works on prime of a PCIe hyperlink, however the expertise must be acknowledged by an ASIC and its subsystem. In different phrases, one can not merely add a CXL controller to the tech stack as there is no such thing as a CXL logic material and subsystems that assist DRAM and/or SSD endpoints in GPUs.

- Advertisement -
See also  GitHub Accelerator fuels open source AI revolution, empowering startups to democratize access

Additionally, GPU cache and reminiscence subsystems don’t acknowledge any expansions besides unified digital reminiscence (UVM), which isn’t quick sufficient for AI or HPC. In checks by Panmnesia, UVM carried out the worst amongst all examined GPU kernels. The CXL, nonetheless, supplied direct entry to expanded storage through load/retailer directions, eliminating the problems hampering UVM resembling overhead from host runtime intervention throughout web page faults and transferring knowledge on the web page stage.

What Panmnesia developed in response is a sequence of {hardware} layers that assist the entire key CXL protocols, consolidating them right into a unified controller.

The CXL 3.1-compliant root complicated has a number of root ports supporting exterior reminiscence over PCIe and a number bridge with a host-managed machine reminiscence decoder that connects to the GPU’s system bus and manages the system reminiscence.

There are different challenges that Panmnesia is going through that transcend its management, a giant one being that AMD and Nvidia should add CXL assist to their GPUs. It’s doable that trade gamers resolve they just like the strategy of utilizing PCIe-attached reminiscence for GPUs – and go on to develop their very own expertise.

- Advertisment -

Related

- Advertisment -

Leave a Reply

Please enter your comment!
Please enter your name here