[HH] cores: AMD vs Intel

Tom Metro tmetro+hhacking at gmail.com
Wed Mar 13 03:29:23 EDT 2013


Federico Lucifredi wrote:
> Any nice comparison of per-core performance between AMD and Intel?

Do you literally want a comparison that looks at the performance of an
individual core, in isolation?

The comparisons I've seen uses the usual benchmarks, which attempt to
emulate real-world usage, and thus they're impacted by the imperfect way
tasks are distributed to multiple cores, and memory bandwidth, etc.

What's your application?


> Because, looking at the FX processors six and eight core prices, they
> are nowhere near what a 6 core Intel chip costs… so i'd like to educate
> myself on the why.

The short answer is they're cheaper because they perform considerably
worse. At least that's what I've read.

I don't recall a specific comparison, but at some point I read one at
Anandtech or Tom's Hardware.

Here's one:
http://www.anandtech.com/show/6396/the-vishera-review-amd-fx8350-fx8320-fx6300-and-fx4300-tested

  Last year's launch of AMD's FX processors was honestly disappointing.
  The Bulldozer CPU cores that were bundled into each Zambezi chip were
  hardly power efficient and in many areas couldn't significantly
  outperform AMD's previous generation platform. Look beyond the direct
  AMD comparison and the situation looked even worse. In our conclusion
  to last year's FX-8150 review I wrote the following:

    "Single threaded performance is my biggest concern, and compared to
    Sandy Bridge there's a good 40-50% advantage the i5 2500K enjoys
    over the FX-8150. ..."

Here's a thread linking to reviews comparing the FX-6300 to the i3:
http://www.tomshardware.com/forum/id-1623327/6300-gaming.html


The problem with these sites is that they review with desktop or gaming
performance in mind. What I wanted to know about was VM performance.
Can, for example, you attain better performance per dollar running 8 VMs
on an AMD 8-core CPU than you can on a 4-core i7?


In other CPU news, I see...

Calxeda's ARM server tested
http://www.anandtech.com/show/6757/calxedas-arm-server-tested

  At first sight, the relatively low performance per core of ARM CPUs
  seems like a bad match for servers. The dominant CPU in the server
  market is without doubt Intel's Xeon. The success of the Xeon family
  is largely rooted in its excellent single-threaded (or per core)
  performance at moderate power levels (70-95W). Combine this
  exceptional single-threaded performance with a decent core count and
  you get good performance in almost any kind of application.
  [...]
  Having four 32-bit Cortex A9 cores, each with 32 KB instruction and 32
  KB data L1 per-core caches, the processor block is somewhat similar to
  what we find inside modern smarphones. One difference is that this SoC
  contains a 4MB ECC enabled L2 cache, while most smartphone SoCs have a
  1MB L2 cache.
  [...]
  A dual Xeon E5 or Opteron 6300 server has much more processing power
  than most of us need to run one server application. That is the reason
  why it is not uncommon to see 10, 20 or even more virtual machines
  running on top of them.
  [...]
  Each server node has one quad-core Cortex A9 with 4MB of L2 cache and
  4GB of RAM. With that being the case, the question "what can this
  server node cope with?" is a lot more relevant.
  [...]
  The ARM based server is a pretty bad choice right now for memory
  intensive workloads. Even with four cores and DDR3-1333, the useable
  bandwidth is less than one sixth of what one Xeon core can sustain.

  In a similar vein, the ECX-1000 is not capable of providing more
  bandwidth than an Atom system equipped with DDR2-667. However, both
  the Atom and ARM cores are pretty bad when it comes to bandwidth.
  [...]
  Clock for clock, the out-of-order Cortex-A9 inside the Calxeda
  EXC-1000 beats the in-order Atom core. A single A9 has no trouble
  beating the older Atoms while likewise coming close to the much higher
  clocked N2800. The N2800 and ECX-1000 perform similarly.
  [...]
  Looking at both decompression and compression, it looks like a quad
  ARM A9 is about as fast as one Xeon core (without Hyper-Threading) at
  the same clock. We need about six A9 cores to match the Xeon core with
  Hyper-Threading enabled. The quad-core ECX-1000 1.4GHz is also close
  to the dual-core, four-threaded Atom at 1.86GHz. This bodes well for
  Calxeda as the 6.1W S1240 only runs at 1.6GHz.
  [...]
  We created 24 virtual machines on top of the Xeon server. ...
  [Calxeda's] server gets the same workload, but instead of using
  virtual machines, we used the 24 physical server nodes.
  [...]
  At the low concurrencies, the Intel machine leverages turboboost and
  its exceptionally high per core performance. At the higher web loads,
  the total throughput of the 96 (24x quad-core SoCs) ARM A9 cores is up
  to 50% higher than the low power 32 thread/16 core (2x Octal core)
  Xeons. Even the mighty 2660 cannot beat the herd of ARM SoCs.
  [...]
  ...each server needs about 8.3W (200W/24), measured at the wall. That
  is exactly what Calxeda promised: about 6W (at 1.4GHz) per server node
  (measured internally), up to 8.5W measured at the wall (again at
  1.4GHz). That is nothing short of amazing if you consider the
  performance numbers.
  [...]
  Let's be clear: most applications still run better on the Xeon E5. Our
  CPU benchmarks clearly indicate that any application that accesses the
  memory frequently or that needs high per thread integer processing
  power will run better on the Xeon E5. Compiling and installing
  software simply feels so much faster on the Xeon E5, there is no need
  to benchmark.

(See the article for lots more details.)

The 8-core AMD parts may be useful in a similar way to the ARM parts,
except the AMD parts tend to get dinged for power consumption, even
compared to the higher performing Intel parts. So even if they do make
sense in this context, it won't be long before the ARM servers take over
that space.

 -Tom



More information about the Hardwarehacking mailing list