

#### Future of Persistent Memory, DRAM and SSD Form Factors Aligned with New System Architectures

As first presented at SNIA Persistent Memory Summit 2021

#### **Large Datasets Need Memory that Scales**





Processor memory and I/O technologies ...

... are being stretched to their limits

More than 2X digital data will be created over the next five years compared to the combined amount since the advent of digital storage (Source; IDC, Mar 2020)





## **Form Factor Migration**



## Performance



| Direct attached<br>(Parallel Bus)<br>100's of GB<br>DIMM<br>DIMM<br>NVDIMM-N, NVDIMM-P<br>(Persistent) |                       | Serial attached<br>and PCIe attached<br>100's of GB to TB's |                          |                           | Network Attached<br>TB's to PB's |                                             |
|--------------------------------------------------------------------------------------------------------|-----------------------|-------------------------------------------------------------|--------------------------|---------------------------|----------------------------------|---------------------------------------------|
|                                                                                                        |                       | E1.S (x4)                                                   |                          | Memory Module             | OpenCAPI<br>DDIMM                |                                             |
|                                                                                                        | DDR DIMM              | E1.S 1C (x4)                                                | E1.S 2C (x8)             | E3.S   AIC (x16)          | E3 with x8 (2C)                  | Across network                              |
| Current<br>Generation<br>*                                                                             | DDR4@3200<br>25.6GB/s | PCIe-Gen4-x4<br>7.8GB/s                                     | PCIe-Gen4-x8<br>15.7GB/s | PCIe-Gen4-x16<br>31.5GB/s | OMI<br>25.6 GB/s<br>8 Ianes      | RDMA<br>(Fabric and work load<br>dependent) |
| Future<br>Generation<br>**                                                                             | DDR5@4800<br>63.0GB/s | PCIe-Gen5-x4<br>15.7GB/s                                    | PCIe-Gen5-x8<br>31.5GB/s | PCIe-Gen5-x16<br>63.0GB/s | DDR5 DDIMM<br>(TBD) GB/s         | Gen-Z, NVMe-oF (TBD)                        |

\* Source(s): https://en.wikipedia.org/wiki/PCI\_Express#History\_and\_revisions

https://en.wikipedia.org/wiki/DDR4\_SDRAM#JEDEC\_standard\_DDR4\_module

https://www.smartm.com/media/press-releases/smart-modular-to-showcase-its-ddr4-differential-dimm-at-the-flash-memory-summit

\*\* Source(s): https://www.tomshardware.com/news/ddr5-6400-ram-benchmarks-major-performance-gains-ddr4







<100ns

<350ns



## **Form Factors**



Serial attached Direct attached Network Attached and PCIe attached (Parallel Bus) TB's to PB's 100's of GB to TB's 100's of GB EDSFF 3" XMM **DRAM Memory Mo** DIMM SMART いまっ DEFF T AND E3.S (x8) NVDIMM-N, NVDIMM-P E3.S (x16) E1.S (x4) E1.S (x8) E1.S with x4 E3.S with x16 E1.S with x8 (2C) **DDR4 DIMM** E3 with x8 (2C) **Network Card** (1C)(4C) 288 pins 84 pins 84 pins 56 pins 140 pins Media and protocol (64 data, (16 diff-data, (32 diff-data, (64 diff-data, (32 diff-data, Pins specific 87 sideband, rest 16 sideband, 24 18 sideband, 34 24 sideband, 52 18 sideband, 34 power) power) power) power) power) 57.0mm x 35.6mm x Connect 23.8mm x 142.0mm x 6.5mm 35.6mm x 6.0mm SFP/QSFP/... 6.0mm 6.0mm 6.0mm or (LxW) Input voltage Input voltage Input voltage 12V Input voltage=1.2V 12V 12V Input voltage Vaux 3.3 Vendor specific Power **VPP 2.5** Vaux 3.3 Vaux=3.3 12V (optional) (optional) (optional)

# **Bandwidth**

12 Channels 2 DIMMs/Channel 128GB DIMM 3TB Memory





- Theoretical maximum bandwidth of 921.6 GB/S
- 9.6 GB/s per core

PERSISTENT MEMORY + SUMMIT 2021 COMPUTATIONAL STORAGE

8 x8 links 256GB E3.S 2TB Memory



- Theoretical maximum bandwidth of 252.16 GB/s
- 2.6 GB/s per core.

Estimated higher bandwidth of 12.2 GB/s per core

## **E1.S and E1.L for Memory Acceleration**



|                                                                | Feature                | Description                                                                                                                                                                  |
|----------------------------------------------------------------|------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|                                                                | Host Interface         | <ul> <li>Data: PCIe x4,x8</li> <li>Sideband: SMBus (I2C)</li> <li>Wake-up, Low-power (PWRDIS),</li> </ul>                                                                    |
| interconnect                                                   | Memory                 | 64-128GB with DDR4 or DDR5                                                                                                                                                   |
|                                                                | Protocols              | NVMe, CXL, CCIX, Gen-Z                                                                                                                                                       |
| SMART A REAL PARAMETER AND | Power                  | <ul> <li>Multiple profiles from 12,16, 20, 25W</li> <li>Completely bus powered: 12V (main), 3.3V Aux</li> <li>Supports low power modes (CLKREQ# PWRDIS signaling)</li> </ul> |
|                                                                | Targeted<br>Use Cases  | <ul> <li>Targeted for 1U Servers</li> <li>16 – 32 Slots per 1U Server</li> <li>Improves performance by offloading</li> </ul>                                                 |
|                                                                | Memory<br>Acceleration | fixed functions like encryption,<br>compression or Key-Value semantics<br>to Memory modules.                                                                                 |

Source: https://www.anandtech.com/show/16248/edsff-form-factor-updates

## E3.S and E3.L for Memory Expansion





| EDSFF 3" XMM<br>DRAM Memory Module |
|------------------------------------|
| SMART<br>Kolder Technologies       |
|                                    |

| Feature                                      | Description                                                                                                                                                                        |
|----------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Host Interface                               | <ul> <li>Data: PCIe x16</li> <li>Sideband: SMBus (I2C)</li> <li>Wake-up, Low-power (PWRDIS),</li> </ul>                                                                            |
| Memory                                       | Up to 256GB with DDR4 or DDR5<br>** A Non-volatile persistent memory feature<br>could be support on this form-factor using back-<br>up and restore functionality like in NVDIMM-N. |
| Protocols                                    | NVMe, CXL, CCIX, Gen-Z                                                                                                                                                             |
| Power                                        | <ul> <li>2 profiles 25W (thin) and 40W (thick)</li> <li>Bus powered: 12V (main), 3.3V Aux</li> <li>Supports low power modes (CLKREQ#, PWRDIS signaling)</li> </ul>                 |
| Targeted<br>Use Cases<br>Memory<br>Expansion | Targeted for 2U Server<br>Enables 4TB – 8 TB of Memory expansion with 16<br>E3.S modules in single 2U server, achieving better<br>throughput than direct attached DDR4 DIMM.       |

#### **OpenCAPI High Bandwidth Memory - DDIMM**



COpenCAPI™





| Feature               | Description                                                                                                                                                                                                                                          |
|-----------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Host Interface        | • OpenCAPI                                                                                                                                                                                                                                           |
| Memory                | • Up to 256GB                                                                                                                                                                                                                                        |
| Protocols             | <ul> <li>OMI – Open Memory Interface</li> <li>The memory bus is defined with one read port and<br/>one write port per channel, each having eight<br/>unidirectional differential lanes</li> </ul>                                                    |
| Performance           | <ul> <li>DDR4-3200</li> <li>Latency 40ns</li> <li>Data throughput rate of 25.6GB/s with 8 lanes</li> <li>The DDIMM/OMI approach delivers up to 4TB of memory on a server at about 320GB/second or 512GB at up to 650GB/s sustained rates.</li> </ul> |
| Targeted<br>Use Cases | <ul> <li>Targeted for servers</li> <li>High bandwidth, low latency serial connection for<br/>memory, accelerators, network, storage, and other<br/>devices like ASICs</li> </ul>                                                                     |

## **NVDIMM for Persistent Memory**



#### Key Features of DDR4 and DDR5 NVDIMM-N

- Operation like DRAM
- Fast recovery from system power loss
- Software overhead can be eliminated









Backup Power

| Feature               | Description                                                                                                                                                                                                                                        |
|-----------------------|----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| Host Interface        | • DDR                                                                                                                                                                                                                                              |
| Memory                | • DDR4 16GB, 32GB<br>• DDR5 32GB, 64GB                                                                                                                                                                                                             |
| Protocol              | • JEDEC Compliant DDR4 / DDR5                                                                                                                                                                                                                      |
| Features              | <ul> <li>Throughput of 25.6GB/s (DDR4)</li> <li>Latency ~20ns</li> <li>AES 256 bit Encryption</li> </ul>                                                                                                                                           |
| Targeted<br>Use Cases | <ul> <li>All Flash Arrays, Storage Servers, HPC,<br/>Al Training Servers</li> <li>Needed for very low latency tiering,<br/>caching, write buffering, metadata storage,<br/>checkpointing</li> <li>Needed for Al/ML algorithm processing</li> </ul> |

# **CXL-based NVDIMM (NV-XMM)**





Source: Modified from PIRL 2019, "Accelerate Everything", Stephen Bates, Eideticom

High-speed DRAM with built-in back-up power to back-up data to on-module Flash during power loss.

## Conclusions

Future of Persistent Memory, DRAM and SSD Form Factors





14