Samsung’s next-generation DDR5 and HBM3 memory will integrate AI engine
At the beginning of February this year, the news show that Samsung’s new HBM2 memory will integrate an AI engine. The HBM-PIM (Aquabolt-XL) chip can provide up to 1.2 TFLOPS of embedded computing capabilities, allowing the memory chip itself to perform CPU, GPU, ASIC, or FPGA operations. Recently, Samsung introduced a larger future development plan on Hot Chips 33, which will expand PIM (processing-in-memory) technology to DDR4, DDR5, LPDDR5X, GDDR6, and HBM3 memory.
Obviously, PIM technology is very suitable for data center use, but Samsung further shifts the technology to more standard memory. This time Samsung demonstrated AXDIMM memory, which is an optimization of existing memory. The AI engine is integrated into the buffer chip to minimize a large amount of data exchange between CPU and DRAM to improve efficiency. AXDIMM memory can be inserted into a standard DDR4 server memory slot to be used without major changes to the existing hardware architecture.
Samsung’s PIM technology is not limited to relatively high-end areas, but can also be shifted to common products close to consumers, such as laptops, tablets, and even mobile phones. Samsung has planned to introduce PIM technology into LPDDR5 memory, but it is still in the early stages. In the simulation test of LPDDR5-PIM memory (6400 MHz), the performance of its speech recognition workload was increased by 2.3 times, the translation performance based on the converter was increased by 1.8 times, and the GPT-2 text generation ability was increased by 2.4 times. Consumption has been reduced by more than 60%.
Although Samsung’s PIM technology is progressing rapidly and can be used with standard memory controllers, it has not been certified by JEDEC. This is the main obstacle to the widespread use of PIM technology. Samsung hopes that the PIM specification can be standardized to further expand the portfolio of memory products, such as adding it to the HBM3 standard specification. At present, JEDEC has not officially released the HBM3 standard specification, it is still under development.Samsung said that it will move forward from FP16 of HBM2 to FP64 of HBM3, which means that the chip will have expanded functions. FP16 and FP32 will be reserved for data center use, while INT8 and INT16 will serve LPDDR5, DDR5, and GDDR6 memory to better adapt to different application scenarios.