Intel xeon comparison chart: Detailed Specifications of the «Ice Lake SP» Intel Xeon Processor Scalable Family CPUs

Detailed Specifications of the «Ice Lake SP» Intel Xeon Processor Scalable Family CPUs

Articles > Detailed Specifications of the “Ice Lake SP” Intel Xeon Processor Scalable Family CPUs

This article provides in-depth discussion and analysis of the 10nm Intel Xeon Processor Scalable Family (formerly codenamed “Ice Lake-SP” or “Ice Lake Scalable Processor”). These processors replace the previous 14nm “Cascade Lake-SP” microarchitecture and are available for sale as of April 6, 2021.

The “Ice Lake SP” CPUs are the 3rd generation of Intel’s Xeon Scalable Processor family. This generation brings new features, increased performance, and new server/workstation platforms. The Xeon ‘Ice Lake SP’ CPUs cannot be installed into previous-generation systems. Those considering a new deployment are encouraged to review with one of our experts.

Highlights of the features in Xeon Scalable Processor Family “Ice Lake SP” CPUs include:

  • Up to 40 processor cores per socket (with options for 8-, 12-, 16-, 18-, 20-, 24-, 26-, 28-, 32-, 36-, and 38-cores)
  • Up to 38% higher per-core performance through micro-architecture improvements (at same clock speed vs “Cascade Lake SP”)
  • Significant memory performance & capacity increases:
    • Eight-channel memory controller on each CPU (up from six)
    • Support for DDR4 memory speeds up to 3200MHz (up from 2933MHz)
    • Large-memory capacity with Intel Optane Persistent Memory
    • All CPU models support up to 6TB per socket (combined system memory and Optane persistent memory)
  • Increased link speed between CPU sockets: 11. 2GT/s UPI links (up from 10.4GT/s)
  • I/O Performance Improvements – more than twice the throughput of “Cascade Lake SP”:
    • PCI-Express generation 4.0 doubles the throughput of each PCI-E lane (compared to gen 3.0)
    • Support for 64 PCI-E lanes per CPU socket (up from 48 lanes)
  • Continued high performance with the AVX-512 instruction capabilities of the previous generation:
    • AVX-512 instructions (up to 16 double-precision FLOPS per cycle per AVX-512 FMA unit)
    • Two AVX-512 FMA units per CPU core (available in all Ice Lake-SP CPU SKUs)
  • Continued support for deep learning inference with AVX-512 VNNI instruction:
    • Intel Deep Learning Boost (VNNI) provides significant, more efficient deep learning inference acceleration
    • Combines three AVX-512 instructions (VPMADDUBSW, VPMADDWD, VPADDD) into a single VPDPBUSD operation
  • Improvements to Intel Speed Select processor configurability:
    • Performance Profiles: certain processors support three distinct core count/clock speed operating points
    • Base Frequency: specific CPU cores are given higher base clock speeds; the remaining cores run at lower speeds
    • Turbo Frequency: specific CPU cores are given higher turbo-boost speeds; the remaining cores run at lower speeds
    • Core Power: each CPU core is prioritized; when surplus frequency is available, it is given to high-priority cores
  • Integrated hardware-based security improvements and total memory encryption

With a product this complex, it’s very difficult to cover every aspect of the design. Here, we concentrate primarily on the performance of the processors for HPC & AI applications.

Continued Specialization of Xeon CPU SKUs

Those already familiar with Intel Xeon will see this processor family is divided into familiar tiers: Silver, Gold, and Platinum. The Silver and Gold models are in the price/performance range familiar to HPC/AI teams. Platinum models are in a higher price range. The low-end Bronze tier present in previous generations has been dropped.

Further, Intel continues to add new specialized CPU models that are optimized for particular workloads and environments. Many of these specialized SKUs are not relevant to readers here, but we summarize them briefly:

  • N: network function virtualization (NFV) optimized
  • P: virtualization-optimized (with a focus on clock frequency)
  • S: max SGX enclave size
  • T: designed for higher-temperature environments (NEBS)
  • V: virtualization-optimized (with focus on high-density/low-power)

Targeting specific workloads and environments provides the best performance and efficiency for those use cases. However, using these CPUs for other workloads may reduce performance, as the CPU clock frequencies and Turbo Boost speeds are guaranteed only for those specific workloads. Running other workloads on these optimized CPUs will likely lead to CPU throttling, which would be undesirable. Considering these limitations, the above workload-optimized models will not be included in our review.

For those who prefer to see all details in one chart, Intel has provided a summary of all of the Xeon 3rd-generation CPUs (including both the ‘Ice Lake’ CPUs covered here, as well as the ‘Cooper Lake’ CPUs which provide 4-socket and 8-socket capability). The left-most column shows ‘Cooper Lake’ CPU SKUs. The middle and right columns summarize the ‘Ice Lake SP’ CPU SKUs covered in this article.

Four Xeon CPU specializations relevant to HPC & AI use cases

There are several specialized Xeon CPU options which are relevant to high performance computationally-intensive workloads. Each capability is summarized below and included in our analysis.

  • Liquid-cooled – Xeon 8368Q CPU: optimized for liquid-cooled deployment, this CPU SKU offers high core counts along with higher CPU clock frequencies. The high clock frequencies are made possible only through the more effective cooling provided by liquid-cooled datacenters.
  • Media, AI, and HPC – Xeon 8352M CPU: optimized for AVX-heavy vector instruction workloads as found in media processing, AI, and HPC; this CPU SKU offers improved performance per watt.
  • Performance Profiles – Y: a set of CPU SKUs with support for Intel Speed Select Technology – Performance Profiles. These CPUs are indicated with a Y suffix in the model name (e.g., Xeon 8352Y) and provide flexibility for those with mixed workloads. Each CPU supports three different operating profiles with separate CPU core count, base clock and turbo boost frequencies, as well as operating wattages (TDP). In other words, each CPU could be thought of as three different CPUs. Administrators switch between profiles via system BIOS, or through Operating Systems with support for this capability (Intel SST-PP). Note that several of the other specialized CPU SKUs also support multiple Performance Profiles (e.g., Xeon 8352M).
  • Single Socket – U: single-socket optimized. The CPUs designed for a single socket are indicated with a U suffix in the model name (e.g., Xeon 6312U). These CPUs are more cost-effective. However, they do not include UPI links and thus can only be installed in systems with a single processor.

Summary of Xeon “Ice Lake-SP” CPU tiers

With the Bronze CPU tier no longer present, all models in this CPU family are well-suited to HPC and AI (though some will offer more performance than others). Before diving into the details, we provide a high-level summary of this Xeon processor family:

  • Intel Xeon Silver – suitable for entry-level HPC
    The Xeon Silver 4300-series CPU models provide higher core counts and increased memory throughput compared to previous generations. However, their performance is limited compared to Gold and Platinum (particularly on Core Count, Clock Speed, Memory Performance, and UPI speed).
  • Intel Xeon Gold – recommended for most HPC workloads
    Xeon Gold 5300- and 6300-series CPUs provide the best balance of performance and price. In particular, the 6300-series models should be preferred over the 5300-series models, because the 6300-series CPUs offer improved Clock Speeds and Memory Performance.
  • Intel Xeon Platinum – only for specific HPC workloads
    Although 8300-series models provide the highest performance, their higher price makes them suitable only for particular workloads which require their specific capabilities (e.g., highest core count, large L3 cache).

Xeon “Ice Lake SP” Computational Performance

With this new family of Xeon processors, Intel once again delivers unprecedented performance. Nearly every model provides over 1 TFLOPS (one teraflop of double-precision 64-bit performance per second), many models exceed 2 TFLOPS, and a few touch 3 TFLOPS. These performance levels are achieved through high core counts and AVX-512 instructions with FMA (as in the first and second Xeon Scalable generations). The plots in the tabs below compare the performance ranges for these new CPUs:

AVX-512 Instruction Performance

AVX2 Instruction Performance

In the charts above, the shaded/colored bars indicate the expected performance range for each CPU model. The performance is a range rather than a specific value, because CPU clock frequencies scale up and down on a second-by-second basis. The precise achieved performance depends upon a variety of factors including temperature, power envelope, type of cooling technology, the load on each CPU core, and the type(s) of CPU instructions being issued to each core.

The first tab shows performance when using Intel’s AVX-512 instructions with FMA. Note that only a small set of codes will be capable of issuing exclusively AVX-512 FMA instructions (e.g., HPL LINPACK). Most applications issue a mix of instructions and will achieve lower than peak FLOPS. Further, applications which have not been re-compiled with an appropriate compiler will not include AVX-512 instructions and thus achieve lower performance. Computational applications which do not utilize AVX-512 instructions will most likely utilize AVX2 instructions (as shown in the second tab with AVX2 Instruction performance.

The pricing of the 3rd-generation Xeon Processor Scalable Family spans a wide range, so budget must be kept in mind when selecting options. It would be frustrating to plan on 38-core processors when the budget cannot support a price of more than $10,000 per CPU. The plot below compares the prices of the Xeon “Ice Lake SP” processors:

As shown in the above plot, the CPUs in this article have been sorted by tier and by price. Most HPC users are expected to select CPU models from the Gold Xeon 6300-series. These models provide close to peak performance for a price around $3,000 per processor. Certain specialized applications will leverage the Platinum Xeon 8300-series

To ease comparisons, all of the plots in this article are ordered to match the above plot. Keep this pricing in mind as you review this article and plan your system architecture.

Intel “Ice Lake SP” Xeon Processor Scalable Family Specifications

The sets of tabs below compare the features and specifications of this new Xeon processor family. As you will see, the Silver (4300-series) and lower-end Gold (5300-series) CPU models offer fewer capabilities and lower performance. The higher-end Gold (6300-series) and Platinum (8300-series) offer more capabilities and higher performance. Additionally, certain CPU SKUs have special models integrating additional specializations:

  • Enabled for Intel Speed Select Performance Profiles (indicated with a Y suffix on the part number)
  • Designed for single CPU socket systems (indicated with a U suffix on the part number)
  • Requires a liquid-cooled thermal solution in the server (indicated with a Q suffix on the part number)

CPU Cores

Most HPC groups will find that processors with 12 to 32 CPU cores fit within their budget. However, the 36-, 38-, and 40-core models are only available within the Platinum tier and will be at a higher cost than most groups would consider cost-effective.

CPU Speed

The relatively standard base clock speeds of this CPU family demonstrate that this launch focuses most on overall throughput, as there are few options with high base CPU frequencies. There are a few CPU options with 3+ GHz base frequencies, but most models prefer higher core counts with more modest clock speeds.

Memory Speed

Memory performance of Intel Xeon “Ice Lake-SP” is fairly straightforward, with the Silver CPUs providing a lower speed than the Gold and Platinum models. Demanding users should consider the 6300-series as a minimum in order to ensure memory operates at the maximum speed of 3200MT/s. The amount of memory bandwidth available per CPU core is an important factor, but is essentially a function of the number of cores. Users planning to run on CPUs with higher core counts need to ensure that each core won’t be starved for data.

It is important to note that some system platforms support two memory slots per memory channel (a total of 32 DIMMs in a dual-socket system). If both memory slots are populated with memory, the slots may run at a slower speed (depending upon the platform) – this is simply an electrical/signaling limit.

AVX Units

As shown, all of the ‘Ice Lake SP’ Xeon CPUs provide two AVX-512 units per processor core. This is an important change compared to both of the previous generations of Intel Xeon Scalable processors.

L3 Cache

The “Ice Lake-SP” CPUs have been designed to offer at least 1.5MB of L3 cache per processor core. As shown below, there are a few models which feature a larger quantity of L3 per core, but only a modest increase. Remember that each core also has 1.25MB of private L2 cache. In this architecture, the L3 cache is designed as a fallback for data that spills out of L2.

UPI

The UPI capabilities of these CPUs are similar to the previous generations. Each CPU supports two or three UPI links operating at 10.4GT/s or 11.2GT/s. Only the Xeon ‘Gold’ and ‘Platinum’ tiers (5300-, 6300-, and 8300-series) support the higher number of UPI links, which allows greater connectivity between sockets. Dual-socket systems are the most popular configuration for HPC, but not all dual-socket platforms support all three UPI links – review your proposed system architecture.

Socket Count

Although dual-socket systems are the most common for HPC & AI workloads, there are use cases requiring larger or smaller numbers of CPUs. The plot below compares the various CPU socket counts supported by this processor line-up (supporting either one or two sockets). Workloads needing more than two sockets will need to look to other product lines.

TDP

Although there are still processor models in the same power range as previous generations, most models feature TDPs above 140 Watts. More than half of the models require 205+ Watts, and several models consume 250 Watts or more. HPC users must be certain that the systems they select have received thorough thermal validation. Systems which run warm will suffer lower performance. In particular, care is recommended with higher clock speed CPUs (2.5+ GHz) which may reduce their clock speeds more aggressively to remain within thermal limits.

In addition to the specifications called out above, technical readers should note that the “Ice Lake SP” CPU architecture inherits most of the architectural design of the previous “Cascade Lake-SP” architecture, including the mesh processor layout, redesigned L2/L3 caches, greater UPI connectivity between CPU sockets, and improvements to the processor frequency speeds/turbo.

As stated at the top, most of this new CPU family offers excellent performance. However, it is common for HPC sites to set a minimum floor on CPU clock speeds (usually around 2.5GHz), with the intent that no workload suffers too low of a performance. While there are users who would demand even higher clock speeds, experience shows that most groups settle on a minimum clock speed in the 2.5GHz to 2.6GHz range. With that in mind, the comparisons below highlight only those CPU models which offer 2.5+GHz performance.

2.5+GHz Core Counts

AVX-512 Performance

AVX2 Performance

2.5+GHz Cost-Effectiveness

Overview, Features & Comparison – Blog

Product Overview

Based on the latest generation Intel® microarchitecture (codenamed Sandy Bridge), the Intel® Xeon® processor E5-2600 series is the first Intel® platform to offer solutions ranging from eight-core/singlesocket
up to 16-core/two-socket, providing significant performance improvement over previous-generation Intel Xeon processors.

Today’s private clouds enable greater business agility and efficiency by delivering innovative and scalable services that drive value back into the organizations they serve. Building and managing these services require systems that scale to meet your ever changing needs, while delivering solutions to the challenges IT managers face every day. Server and storage platforms based on the Intel® Xeon® processor E5 family are built to be at the heart of today’s data centers and enable powerful, versatile private cloud solutions.

The Intel® Xeon® processor E5 family support key cloud requirements that make your data center agile and more responsive to changing business needs. Maintaining industry leadership for power efficiency and continuing Intel Xeon processor performance advancements, the Intel® Xeon® processor E5 family gives you additional performance when you need it for single- and multi-threaded applications, including increased floating-point performance for technical computing workloads.

Table 1. Intel Xeon processor E5-1600/2600 product families features

Higher performance for peak workloads

• Intel® Turbo Boost Technology 2. 03 takes advantage of power and thermal headroom to increase processor
frequencies for peak workloads
• Provides more and higher performance boosts and improved efficiency versus the previous generation

Higher performance for HPC applications

• Intel® Advanced Vector Extensions (Intel® AVX) accelerates vector and floating point computations by
increasing maximum vector size from 128 to 256 bits
• Provides up to 2x performance boost4 for floating point operations, which can significantly increase
performance for high performance computing (HPC)5 applications

The first integrated storage and server processor

• Supports key storage processor features, including non-transparent bridging to increase scalability; the
ability to connect multiple systems, each with access to the other’s memory window; accelerated RAID,
which eliminates the need for a custom ASIC to perform RAID 5 and 6 operations

Hardened protection for virtual/cloud environments

• Intel® Trusted Execution Technology (Intel® TXT) lets IT establish trusted pools of virtualized resources
for stronger security in virtual and cloud environments

Industry-leading energy-efficiency

• Intel® Intelligent Power Technology11 dynamically manages CPU and memory energy states as workloads vary
to minimize power without slowing performance
• More sensors, finer-grained control, faster control loops, and greater accuracy increase power savings versus
the prior generation

One Solution, Multiple Advantages

This is the first dual-socket server-class Intel® processor series with PCI Express 3. 02 integrated on the processor die, to boost performance, reduce latency and save power. Integrated PCI Express 3.0 saves board real estate over previousgeneration three-chip platforms by eliminating the need for a separate I/O hub.

Developers can create a single design, then deliver a family of products due to a broad range of processor features, including eight cores (16 threads) or six cores (12 threads) and a thermal design power (TDP) range of 70 W to 95 W. Single- or dual-socket configurations, when paired with the Intel® C604 or Intel® C602-J chipset,  maintain compatibility with enterprise platform requirements.

This platform provides the performance, I/O and memory capabilities for a wide range of compute-intensive embedded
and communications applications:
• Communication infrastructure servers, blades and appliances
• Security servers, blades and appliances
• Carrier-grade rack-mount servers
• Proprietary form factors, such as router modules
• AdvancedTCA-based blades
• Medical servers, blades and appliances

Intel Xeon E5-2600 Processors Comparison

Comparison chart

—————————————————

All other brands, names and trademarks are the property of their respective owners.  Intel, the Intel logo,
and Xeon are trademarks of Intel Corporation in the U.S. and other countries.
Sentral IT Ltd is Intel Technology Provider, Gold Partner. Find out about our Intel servers & storage.

Like this:

Like Loading…

Blender 2.8 processor performance table

March 13
2022

Buyer, this is a very important page for you!

This table and chart show the test results of our workstations.

How is testing done?

It’s very simple: on the assembled station, before issuing it to the client, the rendering of the same scene (pavilion barcelona) in Blender 2.8 starts, and the results of rendering the image (1280*720) are recorded in our summary table in seconds.

Thus, you and we see the speed of work in a real application, and the shorter the rendering time, the more efficient the workstation will show itself in this work.

Why is this needed?

WE based on these data will help you choose the right station, but YOU can save some money)

The table and graphs allow you to compare processors with each other, compare them with the power of your current computer, in general, do not buy in a bag. » The data in this table has been collected over the years, updated and added, and there are more and more processors. This update added Xeon Gold processors, expanded the Xeon E5 line and many new desktop processors.

Data update from 10/12/2022.

Processor Render time
2x Xeon Platinum 8176 142
1x Threadripper 3970x 148
2x Xeon Gold 6154 176
2x Xeon E5 2696v4 192
2x Xeon E5 2699v4 192
2x Xeon Gold 6148 199
2x Xeon E5 2673v4 219
4x Xeon E5 4657lv2 221
2x Xeon E5 2697v4 223
2x Xeon E5 2695v4 250
2x Xeon E5 2699v3 253
4x Xeon E5 4650v2 254
Ryzen 9 5950x 270
2x Xeon E5 2680v4 275
Ryzen 9 3950x 286
Core i9 12900KF 290
2x Xeon E5 2697v3 291
2x Xeon E5 2690v3 315
4x Xeon E5 4650 323
1x Xeon Gold 6154 320
Ryzen 9 5900x 326
2x Xeon E5 2678v3 338
Core i7 12700K 353
1x Xeon E5 2696v4 376
2x Xeon E5 2666v3 365
Ryzen 9 3900x 371
Core i7 12700F 376
2x Xeon E5 2660v3 388
2x Xeon E5 2695v2 409
2x Xeon E5 2690v2 410
Core i9 10900KF 414
Core i9 10850KF 430
2x Xeon E5 2670v2 465
Core i5 12600K 445
Core i9 11900KF 475
2x Xeon E5 2640v3 492
Core i7 11700F 527
2x Xeon E5 2690 536
Core i9 9900KF 538
2x Xeon E5 2643v3 537
1x Xeon E5 2680v4 537
Core i7 10700K 518
Core i7 11700K 488
2x Xeon E5 2650v2 558
1x Xeon E5 2697v3 561
Ryzen 7 3700x 563
1x Threadripper 2970WX 562
2x Xeon E5 2670 586
1x Xeon w-2145 638
1x Xeon E5 2680v3 658
Core i9 11900F 686
Core i5 12400F 673
Ryzen 5 5600X 669
1x Xeon E5 2678v3 660
Core i7 9700k@5. 0 732
1x Xeon E5 2660v3 793
Core i7 9700k 782
Core i5 10400 880
Ryzen 5 2600x 901
1xE5 2670 v2 918
Ryzen 5 2600 935
Core i5 11400 948
2x X5675 976
Core i3 12100 972
2x Xeon X5670 1013
2x Xeon E5 2643 1049
1x Xeon E5 2690 1066
1x Xeon E5 1650v3 1084
Core i5 9400f 1156
2x Xeon E5649 1156
1x Xeon E5 2670 1165
2x Xeon X5677 1293
1x Xeon E5 1660 1325
Core i3 10100 1345
1x Xeon E5 1650 1366
Ryzen 5 3400G 1400
2x Xeon X5570 1432
1x Xeon E5 1620v3 1667
Core i3 9100f 1838
1x Xeon E5 2623 v4 1911
1x Xeon X5675 1943
1x Xeon E3 1245 2136
Ryzen 3 2200 2220
1x Xeon E5649 2399
1x Xeon X5677 2589
Pentium G6400 3484

Second part, slower processors.

Share

Tweet

Share

Share

New comment

Sign in with

Submit

Comparative memo on Intel Scalable processors / Sudo Null IT News

exebit

Time to read
2 min

Number of views

16K

Computer hardware

From the sandbox

In fact, the transition of OEM manufacturers to the use of Intel Scalable processors (Skylake-SP) has already been completed. You can learn about new features from the article on the Intel blog. Here we will talk about the user approach in choosing processors when ordering equipment.

How we are offered to make a choice:

Intel Scalable processors are divided into 5 different groups that have different purposes and pricing policies. Bronze for the most budget systems, Silver has the best price/performance ratio, Gold is required for performance or 4 socket servers, and only Platinum supports 8 socket motherboards.

Group Price range* Number of sockets Number of cores Minimum cost per core
Bronze 31XX $213-$316 2 6-8 $35
Silver 41XX $417-$1112 2 4-12 $52
Gold 51XX $1221-$1727 2 4-14 $106
Gold 61XX $1691-$5949 2.4 6-22 $124
Platinum 81XX $3115-$13011 2,4,8 4-28 $195

* Prices taken from the Intel website

The differences between the families are not only in the number of cores and sockets. Also, the marketing alignment was made according to the maximum core frequency, supported memory and the operation of 512-bit FMA instructions.

Base clock core

  • Bronze 1.7 GHz
  • Silver 1.8-2.6 GHz
  • Gold 51XX 1.9-3.6 GHz
  • Gold 61XX 2.0-3.5 GHz
  • Platinum 2.0-3.6 GHz

Max memory

Supported memory models

  • Bronze DDR4-2133
  • Silver DDR4-2400
  • Gold 51XX DDR4-2400
  • Gold 61XX DDR4-2666
  • Platinum DDR4-2666

AVX-512 support

Number of instructions executed by FMA at a time

  • Bronze single
  • Silver one
  • Gold 51XX one
  • Gold 61XX two
  • Platinum two

Limitations in the operation of memory or FMA instructions do not affect the final performance as much as the number of cores, so you can leave them out of the analysis when choosing.