With the rapid development of the artificial intelligence server market, there is an increasing demand for High Bandwidth Memory (HBM). According to research by TrendForce, it is estimated that by 2022, SK Hynix, Samsung, and Micron will occupy approximately 50%, 40%, and 10% of the global HBM market share, respectively.
Furthermore, the demand for high-performance deep learning AI GPUs is driving the upgrade of HBM products. It is expected that in the second half of 2023, with the large-scale application of high-end GPUs such as NVIDIA H100 and AMD MI300, SK Hynix, Samsung, and Micron have already started planning and preparing for mass production of corresponding high-spec HBM3 products. This year, it is expected that more customers will start adopting HBM3 products. With such expectations, as the only supplier capable of mass-producing the new generation HBM3 products, SK Hynix's market share in the HBM market is expected to further increase to 53%. Samsung and Micron are expected to start mass production of HBM3 products around the end of this year, and their market shares are projected to increase to 38% and 9%, respectively.
According to NVIDIA's definition, current deep learning and machine learning (DL/ML) AI servers are typically equipped with 4 or 8 high-end graphics cards, combined with two mainstream x86 server CPUs. These servers are mainly used by American cloud service providers such as Google, AWS, Meta, and Microsoft.
According to statistics from TrendForce, the annual growth rate of high-end AI servers with graphics processing units (GPUs) in 2022 is about 9%, with nearly 80% of the shipments concentrated among the eight major cloud service providers from China and the United States. Looking ahead, companies such as Microsoft, Meta, Baidu, and ByteDance are expected to launch product services based on generative AI, so the annual growth rate of AI server shipments this year is expected to reach 15.4%. From 2023 to 2027, the compound annual growth rate of AI server shipments is projected to be approximately 12.2%. These data reflect the strong growth momentum of the AI server market and the increasing demand for high-end AI servers from major cloud service providers. With the continuous development of AI technology and the expansion of application areas, it is expected that the AI server market will continue to grow at a high speed in the coming years.
The latest research from TrendForce indicates that the rise of AI servers is expected to significantly drive the growth in memory demand. Currently, enterprise-level server configurations typically range from 500GB to 600GB of DRAM, while AI servers tend to use single-module memory ranging from 64GB to 128GB, with an average capacity of 1.2 to 1.7TB.
In terms of solid-state drives (SSDs), due to the higher speed requirements of AI servers, they focus more on meeting the needs of DRAM or high-bandwidth memory (HBM). Therefore, although expanding SSD capacity is not necessary, when it comes to selecting the transmission interface, PCIe 5.0 has become the preferred choice in order to accommodate the demands of high-speed computing.
Compared to traditional servers, AI servers make greater use of graphics processing units (GPUs) for parallel computing. For example, the NVIDIA A100 is configured with 80GB of memory and utilizes 4 or 8 computing cards, with projected HBM usage ranging from 320GB to 640GB.
Looking ahead, as the complexity of AI models continues to increase, it will stimulate further growth in memory demand, thus further driving the demand for Server DRAM, SSDs, and HBM. This trend undoubtedly presents significant business opportunities and challenges in the memory market.