Ryotta's Basic

DRAM
๐Ÿง  DRAM ๊ฒ€์ฆ์™„๋ฃŒ

HBM Generation Comparison

HBM DRAM ์„ธ๋Œ€ ๋น„๊ต HBM1 ยท HBM2 ยท HBM3 ยท HBM3E ยท HBM4

dram_0040_hbm_generation_comparison

๊ฐœ์š”

HBM(High Bandwidth Memory)์€ TSV ๊ธฐ๋ฐ˜ 3D ์ ์ธต๊ณผ ์‹ค๋ฆฌ์ฝ˜ ์ธํ„ฐํฌ์ €๋ฅผ ์ด์šฉํ•ด GPU์™€ AI ๊ฐ€์†๊ธฐ ๊ฐ€๊นŒ์ด์—์„œ TB/s๊ธ‰ ๋Œ€์—ญํญ์„ ์ œ๊ณตํ•˜๋Š” DRAM ๊ณ„์—ด์ž…๋‹ˆ๋‹ค. HBM1๊ณผ HBM2๋Š” 1024-bit ์ธํ„ฐํŽ˜์ด์Šค๋ฅผ ์ค‘์‹ฌ์œผ๋กœ ์ถœ๋ฐœํ–ˆ๊ณ , HBM3๋ถ€ํ„ฐ๋Š” pseudo channel๊ณผ ์ฑ„๋„ ๋ถ„ํ• ์„ ํ†ตํ•ด ๋ณ‘๋ ฌ์„ฑ์„ ๋” ์„ธ๋ฐ€ํ•˜๊ฒŒ ๋‚˜๋ˆด์Šต๋‹ˆ๋‹ค.

HBM3E๋Š” HBM3๋ฅผ ๋” ๋†’์€ ํ•€ ์†๋„์™€ ๋” ๋‚ฎ์€ ์ „์••์œผ๋กœ ํ™•์žฅํ•œ ์„ธ๋Œ€์ด๊ณ , HBM4๋Š” 2048-bit ์ธํ„ฐํŽ˜์ด์Šค์™€ ๋” ํฐ I/O ์ง‘์ ์„ ํ†ตํ•ด AI/HPC ์›Œํฌ๋กœ๋“œ์˜ ๋ฉ”๋ชจ๋ฆฌ ๋ณ‘๋ชฉ์„ ์ค„์ด๋Š” ๋ฐฉํ–ฅ์œผ๋กœ ์ง„ํ™”ํ–ˆ์Šต๋‹ˆ๋‹ค. ๊ณต๊ฐœ ์ œํ’ˆ ๊ธฐ์ค€์œผ๋กœ๋Š” Samsung HBM4๊ฐ€ 13 Gbps, 3,300 GB/s, 36 GB 12H๋ฅผ, Micron HBM4๊ฐ€ 11 Gbps ์ด์ƒ, 2.8 TB/s ์ด์ƒ, 2048-pin ๋ฒ„์Šค๋ฅผ ์ œ์‹œํ•ฉ๋‹ˆ๋‹ค.

ํ•ต์‹ฌ ๊ฐœ๋…

Wide interface

HBM์˜ ํ•ต์‹ฌ์€ ๋„“์€ ๋ฒ„์Šค์ž…๋‹ˆ๋‹ค. ์ผ๋ฐ˜ DRAM์ด ์ฑ„๋„ ์ˆ˜๋ฅผ ๋Š˜๋ ค ๋Œ€์—ญํญ์„ ํ™•๋ณดํ•˜๋Š” ๋ฐ˜๋ฉด, HBM์€ ํŒจํ‚ค์ง€ ๋‚ด๋ถ€์—์„œ ๋งค์šฐ ๋„“์€ ์ธํ„ฐํŽ˜์ด์Šค๋ฅผ ์งง์€ ๊ฒฝ๋กœ๋กœ ์—ฐ๊ฒฐํ•ด ํ•€๋‹น ์ „์†ก๋Ÿ‰์„ ํฌ๊ฒŒ ๋Œ์–ด์˜ฌ๋ฆฝ๋‹ˆ๋‹ค.

TSV์™€ ์ธํ„ฐํฌ์ €

HBM์€ ๋‹ค์ด ์‚ฌ์ด๋ฅผ TSV๋กœ ์ˆ˜์ง ์—ฐ๊ฒฐํ•˜๊ณ , GPU๋‚˜ ASIC๊ณผ๋Š” ์‹ค๋ฆฌ์ฝ˜ ์ธํ„ฐํฌ์ €๋กœ ์ด์–ด์ง‘๋‹ˆ๋‹ค. ์ด ๊ตฌ์กฐ ๋•๋ถ„์— ์‹ ํ˜ธ ๊ฒฝ๋กœ๊ฐ€ ์งง์•„์ง€๊ณ  ๋Œ€์—ญํญ ๋ฐ€๋„๊ฐ€ ๋†’์•„์ง€์ง€๋งŒ, ํŒจํ‚ค์ง• ๋‚œ๋„๊ฐ€ ์˜ฌ๋ผ๊ฐ‘๋‹ˆ๋‹ค.

Pseudo channel๊ณผ ์ฑ„๋„ ์„ธ๋ถ„ํ™”

HBM2์˜ pseudo channel์€ ๋‚ด๋ถ€ ๋ณ‘๋ ฌ์„ฑ์„ ๋” ์ž˜ ํ™œ์šฉํ•˜๊ธฐ ์œ„ํ•œ ๋ถ„ํ•  ๋ฐฉ์‹์ž…๋‹ˆ๋‹ค. HBM3 ์ดํ›„์—๋Š” ์ฑ„๋„ ํญ์„ ๋” ์ž˜๊ฒŒ ๋‚˜๋ˆ  ๋ณ‘๋ ฌ ์ ‘๊ทผ์„ฑ๊ณผ ํšจ์œจ์„ ๋†’์˜€๊ณ , HBM4๋Š” 32์ฑ„๋„ ๊ตฌ์กฐ๋กœ ์ด ํ๋ฆ„์„ ๋” ํ™•์žฅํ•ฉ๋‹ˆ๋‹ค.

์‹ ๋ขฐ์„ฑ ๊ธฐ๋Šฅ

์„ธ๋Œ€๊ฐ€ ์˜ฌ๋ผ๊ฐˆ์ˆ˜๋ก on-die ECC, ECS, DRFM ๊ฐ™์€ ๊ธฐ๋Šฅ์ด ๋ถ™์–ด AI/HPC ํ™˜๊ฒฝ์—์„œ์˜ ์žฅ์‹œ๊ฐ„ ๋™์ž‘๊ณผ ์˜ค๋ฅ˜ ๋‚ด์„ฑ์„ ๋ณด๊ฐ•ํ•ฉ๋‹ˆ๋‹ค. HBM์€ ๋‹จ์ˆœํ•œ ์†๋„ ๊ฒฝ์Ÿ์ด ์•„๋‹ˆ๋ผ ํŒจํ‚ค์ง•, ์ „๋ ฅ ๋ฐฐ๋ถ„, ๋ณต์›๋ ฅ์„ ํ•จ๊ป˜ ๋ณด๋Š” ๋ฉ”๋ชจ๋ฆฌ์ž…๋‹ˆ๋‹ค.

๋น„๊ต/๋ถ„์„

์ถœ์ฒ˜: JEDEC JESD235/235D (HBM1, HBM2) ยท JESD238 (HBM3) ยท JESD238A (HBM3E) ยท Samsung HBM4 product page ยท Micron HBM4 product page ยท HBM4 public technical reports

NEW = ํ•ด๋‹น ์„ธ๋Œ€์—์„œ ์ฒ˜์Œ ๋„์ž…. N/A = ํ•ด๋‹น ์—†์Œ. โ€ก ๊ณต๊ฐœ ์ œํ’ˆ ๊ธฐ์ค€ ์ˆ˜์น˜๋กœ, JEDEC baseline spec ์ž์ฒด ์ˆ˜์น˜๋Š” ์•„๋‹˜.

๋Œ€์—ญํญ ๊ณ„์‚ฐ ์˜ˆ์‹œ: HBM1=8ร—16Bร—1G, HBM2=8ร—16Bร—2.4G, HBM3=16ร—8Bร—6.4G, HBM3E=16ร—8Bร—9.6G, HBM4=32ร—8Bร—8G baseline์ž…๋‹ˆ๋‹ค. ๊ณต๊ฐœ ์ œํ’ˆ์€ Samsung 3,300 GB/s, Micron >2.8 TB/s ์ˆ˜์ค€๊นŒ์ง€ ์ œ์‹œ๋ฉ๋‹ˆ๋‹ค.

Parameter HBM1 Legacy Mode HBM2 Pseudo Channel Mode HBM3 Pseudo Channel Mode HBM3E HBM3 Extended HBM4 Pseudo Channel Mode
JEDEC Standard JESD235 (2013) JESD235Aโ€“D JESD238 (Jan 2022) JESD238A (May 2023) JESD270-4 (Apr 2025)
ELECTRICAL ELECTRICAL ELECTRICAL ELECTRICAL ELECTRICAL ELECTRICAL
VDDC (core) 1.2 V 1.2 V 1.1 V 1.1 V (same as HBM3) NEW 1.0 V / 1.05 V (vendor-specific)
VDDQ (I/O) 1.2 V 1.2 V 1.1 V 1.1 V (same as HBM3) NEW 0.7 / 0.75 / 0.8 / 0.9 V (vendor-specific)
Tx driver voltage โ€” โ€” 0.4 V 0.4 V 0.4 V
INTERFACE & TOPOLOGY INTERFACE & TOPOLOGY INTERFACE & TOPOLOGY INTERFACE & TOPOLOGY INTERFACE & TOPOLOGY INTERFACE & TOPOLOGY
Total interface width 8ch ร— 128b = 1024-bit 8ch ร— 128b = 1024-bit 16ch ร— 64b = 1024-bit 16ch ร— 64b = 1024-bit (same as HBM3) NEW 32ch ร— 64b = 2048-bit (doubled vs. HBM3/3E)
Channels / stack Up to 8 Up to 8 Up to 16 Up to 16 (same as HBM3) NEW Up to 32
DQ width / channel 128-bit 128-bit 64-bit 64-bit 64-bit
Pseudo channels / ch. โ€” 2 ร— 64-bit 2 ร— 32-bit 2 ร— 32-bit 2 ร— 32-bit
Total pseudo ch. / stack โ€” 16 32 32 NEW 64
Prefetch / access 256-bit (BL2) 256-bit / PC (BL4) 256-bit / PC (BL8) 256-bit / PC (BL8) 256-bit / PC (BL8)
Separate cmd/data buses โ€” โ€” โ€” โ€” NEW โœ“
SPEED & BANDWIDTH SPEED & BANDWIDTH SPEED & BANDWIDTH SPEED & BANDWIDTH SPEED & BANDWIDTH SPEED & BANDWIDTH
Data rate / pin 1.0 Gbps (target) 1.0 โ€“ 3.6 Gbps/pin 4.8 โ€“ 6.4 Gbps/pin NEW 9.2 โ€“ 9.6 Gbps/pin (JESD238A speed bins) NEW 8.0 Gbps/pinโ€ก baseline; public products 11.0 โ€“ 13.0 Gbpsโ€ก
Peak bandwidth / stack ~128 GB/s (8chร—16Bร—1G) ~307 GB/s (8chร—16Bร—2.4G) ~819 GB/s (16chร—8Bร—6.4G) NEW ~1,229 GB/s (~1.2 TB/s) (16chร—8Bร—9.6G) NEW ~2,048 GB/s (2 TB/s) baseline; public products >2.8 TB/sโ€ก
Bandwidth gain vs. prev. gen baseline ~2.4ร— ~2.7ร— ~1.5ร— ~1.7ร—
ARRAY ORGANIZATION ARRAY ORGANIZATION ARRAY ORGANIZATION ARRAY ORGANIZATION ARRAY ORGANIZATION ARRAY ORGANIZATION
Banks / channel 8 8, 16, 32, or 48 16, 32, 48, or 64 16, 32, 48, or 64 (same as HBM3) TBD (โ‰ฅ HBM3)
Bank groups โ€” โœ“ โœ“ โœ“ โœ“
Page size / pseudo ch. 2 KB / ch. 2 KB / ch. 1 KB / PC 1 KB / PC 1 KB / PC
Die density 1, 2 Gb / die 4, 8, 12, 16 Gb / die 8, 16, 24, 32 Gb / die 16, 24 Gb / die (products: 24GB 8-Hi, 36GB 12-Hi) NEW 24, 32 Gb / die
Stack heights (Hi) 4-Hi 8-Hi (12-Hi opt.) 4 / 8 / 12 / 16-Hi 8-Hi, 12-Hi, 16-Hi 4 / 8 / 12 / 16-Hi
Max stack capacity ~4 GB ~24 GB (8-Hi, 16Gb) ~64 GB (16-Hi, 32Gb) ~36 GBโ€ก (12-Hi, SK Hynix/Micron) NEW ~64 GB (16-Hi, 32Gb)
HBM3 controller compat. โ€” โ€” โ€” โœ“ (HBM3E โ†” HBM3) โœ“ (backward compat. w/ HBM3)
RELIABILITY & RAS RELIABILITY & RAS RELIABILITY & RAS RELIABILITY & RAS RELIABILITY & RAS RELIABILITY & RAS
ECC mode Optional (external pins) Optional (external pins) On-die ECC (mandatory) Symbol-based, 256b+16b On-die ECC (mandatory) (same as HBM3) On-die ECC (mandatory) + enhanced RAS
Error Check & Scrub (ECS) โ€” โ€” โœ“ โœ“ โœ“
Directed Refresh Mgmt. (DRFM) โ€” โ€” โ€” โ€” NEW โœ“ (row-hammer mitigation)
DBIac โ€” โœ“ โœ“ โœ“ โœ“
Interconnect redundancy remapping โœ“ โœ“ โœ“ โœ“ โœ“
Self-refresh modes โœ“ โœ“ โœ“ โœ“ โœ“

๋™์ž‘ ์›๋ฆฌ

HBM ์ปจํŠธ๋กค๋Ÿฌ๋Š” row activate์™€ read/write ๋ช…๋ น์„ ๋‚ด๋ฆฐ ๋’ค, pseudo channel ๋‹จ์œ„๋กœ ๋ฐ์ดํ„ฐ๋ฅผ ๋ถ„์‚ฐํ•ด ๋” ์งง์€ ๋‚ด๋ถ€ ๊ฒฝ๋กœ๋กœ ์ฃผ๊ณ ๋ฐ›์Šต๋‹ˆ๋‹ค. ์ด๋Ÿฐ ๊ตฌ์กฐ๋Š” ๊ฐ™์€ ํ•€ ์ˆ˜์—์„œ ๋” ๋†’์€ ์ฒ˜๋ฆฌ๋Ÿ‰์„ ์–ป๋Š” ๋ฐ ์œ ๋ฆฌํ•˜์ง€๋งŒ, command scheduling๊ณผ refresh ํƒ€์ด๋ฐ์„ ๋” ์ •๊ตํ•˜๊ฒŒ ๋งž์ถฐ์•ผ ํ•ฉ๋‹ˆ๋‹ค.

์ ์ธต๋œ DRAM ๋‹ค์ด์™€ logic die ์‚ฌ์ด์˜ ์—ฐ๊ฒฐ์€ TSV์™€ micro-bump๊ฐ€ ๋‹ด๋‹นํ•˜๊ณ , GPU์™€ HBM ์Šคํƒ ์‚ฌ์ด์˜ ์—ฐ๊ฒฐ์€ ์ธํ„ฐํฌ์ €๊ฐ€ ๋‹ด๋‹นํ•ฉ๋‹ˆ๋‹ค. ๊ทธ๋ž˜์„œ ์„ฑ๋Šฅ์€ ๋ฉ”๋ชจ๋ฆฌ ๋‹ค์ด ์ž์ฒด๋ณด๋‹ค๋„ ํŒจํ‚ค์ง€ ์„ค๊ณ„, ์ „์› ๋ถ„๋ฐฐ, ์—ด ํ™•์‚ฐ, ์ธํ„ฐํฌ์ € ๋ฐฐ์„  ํ’ˆ์งˆ์— ํฌ๊ฒŒ ์ขŒ์šฐ๋ฉ๋‹ˆ๋‹ค.

HBM3E์™€ HBM4์—์„œ๋Š” on-die ECC, ECS, DRFM ๊ฐ™์€ ๊ธฐ๋Šฅ์ด ์˜ค๋ฅ˜ ํšŒ๋ณต๊ณผ ์žฅ๊ธฐ ์•ˆ์ •์„ฑ์„ ๋ณด๊ฐ•ํ•ฉ๋‹ˆ๋‹ค. AI ํ•™์Šต๊ณผ ์ถ”๋ก ์—์„œ๋Š” ๋Œ€์—ญํญ์ด ๊ณง ์ฒ˜๋ฆฌ๋Ÿ‰์ด๋ฏ€๋กœ, HBM์€ "๋น ๋ฅธ ๋ฉ”๋ชจ๋ฆฌ"๋ผ๊ธฐ๋ณด๋‹ค "ํŒจํ‚ค์ง€ ๋‹จ์œ„ ์‹œ์Šคํ…œ ๋ฉ”๋ชจ๋ฆฌ"์— ๊ฐ€๊น๊ฒŒ ๋‹ค๋ค„์ง‘๋‹ˆ๋‹ค.

์žฅ๋‹จ์ 

  • ์žฅ์ : ๋งค์šฐ ๋„“์€ ๋ฒ„์Šค๋กœ TB/s๊ธ‰ ๋Œ€์—ญํญ์„ ๋งŒ๋“ค ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.
  • ์žฅ์ : ์งง์€ ์ธํ„ฐ์ปค๋„ฅํŠธ ๋•๋ถ„์— ํ•€๋‹น ์ „๋ ฅ ํšจ์œจ์ด ๋†’์Šต๋‹ˆ๋‹ค.
  • ์žฅ์ : AI/HPC์šฉ GPU์™€ ASIC์— ํ•„์š”ํ•œ ๋Œ€์šฉ๋Ÿ‰ ๋ณ‘๋ ฌ ์ฒ˜๋ฆฌ์— ์ž˜ ๋งž์Šต๋‹ˆ๋‹ค.
  • ๋‹จ์ : ์ธํ„ฐํฌ์ €, TSV, ์ ์ธต ๊ณต์ • ๋•Œ๋ฌธ์— ์ œ์กฐ ๋น„์šฉ๊ณผ ๋‚œ๋„๊ฐ€ ๋†’์Šต๋‹ˆ๋‹ค.
  • ๋‹จ์ : ์—ด ๋ฐ€๋„๊ฐ€ ๋†’์•„ ํŒจํ‚ค์ง€ ์ˆ˜์ค€์˜ ์—ด ์„ค๊ณ„๊ฐ€ ์ค‘์š”ํ•ฉ๋‹ˆ๋‹ค.
  • ๋‹จ์ : DIMM์ฒ˜๋Ÿผ ์‰ฝ๊ฒŒ ๊ต์ฒดํ•˜๊ฑฐ๋‚˜ ํ™•์žฅํ•˜๊ธฐ ์–ด๋ ต์Šต๋‹ˆ๋‹ค.

๊ด€๋ จ ๊ธฐ์ˆ 

ํ•ต์‹ฌ ์ •๋ฆฌ

HBM ์„ธ๋Œ€ ๋น„๊ต์˜ ํ•ต์‹ฌ์€ ๋ฒ„์Šค ํญ, ํ•€ ์†๋„, ์ ์ธต ๋†’์ด, ์‹ ๋ขฐ์„ฑ ๊ธฐ๋Šฅ์ด ํ•จ๊ป˜ ์ง„ํ™”ํ•œ๋‹ค๋Š” ์ ์ž…๋‹ˆ๋‹ค. HBM1๊ณผ HBM2๋Š” 1024-bit ๊ตฌ์กฐ๋ฅผ, HBM3์™€ HBM3E๋Š” ๋” ์„ธ๋ถ„ํ™”๋œ ์ฑ„๋„๊ณผ ๋†’์€ ํ•€ ์†๋„๋ฅผ, HBM4๋Š” 2048-bit ์ธํ„ฐํŽ˜์ด์Šค์™€ ๊ณต๊ฐœ ์ œํ’ˆ ๊ธฐ์ค€ 3TB/s ์•ˆํŒŽ์˜ ๋Œ€์—ญํญ์„ ๋ณด์—ฌ ์ค๋‹ˆ๋‹ค. ์‹ค์ œ ์„ฑ๋Šฅ์€ ๋ฉ”๋ชจ๋ฆฌ ๋‹ค์ด๋งŒ์ด ์•„๋‹ˆ๋ผ ์ธํ„ฐํฌ์ €, logic die, ์ „๋ ฅ ๋ถ„๋ฐฐ, ๋ฐœ์—ด ๊ด€๋ฆฌ๊นŒ์ง€ ํฌํ•จํ•œ ํŒจํ‚ค์ง€ ์„ค๊ณ„๊ฐ€ ๊ฒฐ์ •ํ•ฉ๋‹ˆ๋‹ค. ์ด ๋ฌธ์„œ๋Š” ํ‘œ์ค€ ์„ธ๋Œ€์™€ ๊ณต๊ฐœ ์ œํ’ˆ ์ •๋ณด๋ฅผ ํ•จ๊ป˜ ๋†“๊ณ  HBM์˜ ์ง„ํ™” ๋ฐฉํ–ฅ์„ ๋น ๋ฅด๊ฒŒ ๋Œ€์กฐํ•  ์ˆ˜ ์žˆ๊ฒŒ ์ •๋ฆฌํ•ฉ๋‹ˆ๋‹ค.