Next-Gen AI Successor To Ponte Vecchio Xe-HPC GPU With Up To 160 Xe Cores, Over 20,000 ALUs, OAM 2.0, Sampling in 2023

Next-Gen AI Successor To Ponte Vecchio Xe-HPC GPU With Up To 160 Xe Cores, Over 20,000 ALUs, OAM 2.0, Sampling in 2023

Intel has officially unveiled its next-generation successor to its flagship Xe GPU, the Ponte Vecchio, known as Rialto Bridge. The new graphics chip is designed for the next generation of AI & HPC data center segment, aiming at AMD’s CDNA & NVIDIA’s CUDA processors.

Intel Rialto Bridge GPU Unveiled: Successor To Ponte Vecchio With 25% More Cores, Increased Flops, Targeting AMD & NVIDIA Data Center GPUs

The Intel Rialto Bridge GPU can be seen as an upgraded version of Ponte Vecchio with more cores, more flops, more bandwidth, and more GT/s. Intel hasn’t disclosed a lot of details but claims Rialto Bridge will feature up to 160 Xe cores. We don’t know yet if these are the same cores as the current Ponte Vecchio GPUs or based on a brand new architecture but it looks like the latter might be true.

Intel Falcon Shores XPU To Push High-Performance Computing To The Extreme With Multi-Tiled x86 CPU & Xe GPU Configurations, Targeting Over 5x Performance Per Watt

Today we are announcing our successor to this powerhouse data center GPU, code-named Rialto Bridge. By evolving the Ponte Vecchio architecture and combining enhanced tiles with next process node technology, Rialto Bridge will offer significantly increased density, performance, & efficiency, while providing software consistency.

Intel’s Rialto Bridge is named after the bridge of the same name which is the oldest of the four bridges spanning the Grand Canal in Venice, Italy. So was the case with Ponte Vecchio & it looks like the next gen that comes after Rialto will also be named after an iconic bridge. According to Intel, its Rialto Bridge GPU will power the next generation of AI & HPC Data Center solutions while aiming at the AMD CDNA & NVIDIA CUDA accelerators.

In terms of specifications, we only know that the Rialto Bridge GPU will house up to 160 Xe cores in its brand new OAM v2 form factor. But besides the unveiling of the specs, Intel also gives us a first look at the chip itself and there are some things we can dissect. The biggest change to the GPU is in its GPU die layout. While Ponte Vecchio has 16 Xe-HPC dies, each with 8 Xe cores for a total of 128 cores or 16,384 ALUs, the Rialto Bridge GPU comes with 8 16 Xe-HPC dies. So that should be 20 Xe cores per die for a total of 160 Xe cores on the 8 dies. That rounds up to 20,480 ALUs which is a 25 percent increase over its predecessor.

The rest of the Rialto Bridge GPU structure is pretty much the same as the Ponte Vecchio GPU with two Xe Link Tiles, eight HBM Tiles (HBM3) with four HBM stacks tied to each compute tile (4 Xe HPC dies), There are also the passive die stiffeners located around the Compute Tiles while the Xe Link & HBM3 Tiles are connected to the Compute Tile using an EMIB Tile. The Foveros chip interconnect is used by the Compute Tile to communicate with the rest of the Xe Dies. We don’t know the actual variation of each tile yet but it should be based on the new Foveros Omni (3rd Gen) design. Also, it looks like the Rambo Cache tile is missing but it is highly possible that given the die size increase of each Compute tile, the cache is now featured on the Compute tile itself rather than have it separate on its own tile.

Intel’s Sapphire Rapids HBM ‘Xeon Scalable’ CPUs With 64 GB HBM2e Memory Offer Up To 3x Performance Increase Over Ice Lake Xeons

As for performance, Intel hasn’t revealed any clear numbers and only stated that we should expect more FLOPs, GT/s, and increased bandwidth. The increased bandwidth should be coming from the upgraded HBM3 memory dies. The Ponte Vecchio GPUs are already equipped with up to 128 GB of VRAM capacities so that should be what we also see on the Rialto Bridge GPUs but Intel could stack it up even higher.

Following is the full Intel Rialto Bridge die configuration that we can dissect at the moment:

  • 8Xe HPC (internal/external)
  • 2 Xe Base (internal)
  • 11 EMIB (internal)
  • 2 Xe Link (external)
  • 8 HBMs (external)

Intel hasn’t given any release time or details regarding the process node for the Rialto Bridge GPU but it is likely that we will hear more about it in mid-2023 when it will be sampled to first customers and a launch that aims either late 2023 or 1H of 2024.

Next-Gen Data Center GPU Accelerators

GPU Name AMD Instinct MI250X NVIDIA Hopper GH100 Intel Ponte Vecchio Intel Rialto Bridge
Packaging Design MCM (Infinity Fabric) Monolithic MCM (EMIB + Foveros) MCM (EMIB + Foveros)
GPU Architecture Aldebaran (CNAD 2) Hopper GH100 Xe-HPC Xe-HPC
GPU Process Node 6nm 4N 7nm (Intel 4) 5nm (Intel 3)?
GPU Cores 14,080 16,896 16,384 ALUs
(128 Xe Cores)
20,480 ALUs
(160 Xe Cores)
GPU Clock Speed 1700MHz ~1780MHz TBA TBA
L2/L3 Cache 2 x 8MB 50MB 2 x 204MB TBA
FP16 Compute 383 TOPs 2000 TFLOPs TBA TBA
FP32 Compute 95.7 TFLOPs 1000 TFLOPs ~45 TFLOPs (A0 Silicon) TBA
FP64 Compute 47.9 TFLOPs 60 TFLOPs TBA TBA
Memory Capacity 128GB HBM2E 80GB HBM3 128 GB HBM2e 128 GB HBM3?
Memory Clock 3.2Gbps 3.2Gbps TBA TBA
Memory Bus 8192-bit 5120-bit 8192-bit 8192-bit
Memory Bandwidth 3.2 TB/s 3.0 TB/s ~3 TB/s ~3 TB/s
FormFactor OAM OAM OAM OAM v2
cooling Passive Cooling
Liquid Cooling
Passive Cooling
Liquid Cooling
Passive Cooling
Liquid Cooling
Passive Cooling
Liquid Cooling
PDT 560W 700W 700W 800W
Launch Q4 2021 2H 2022 2022? 2024?

Leave a Comment

Your email address will not be published. Required fields are marked *