- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Fausto Artico
Link Copied
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Hello Fausto,
I believe that bloomfield uses the nehalem cpu architecture and I think that nehalem could get a max 8 SP FLOPS/cycle (per core I assume) and 4 DP flops/cycle/core.
See http://www.realworldtech.com/sandy-bridge/6/ (which discusses nehalem, sandybridge and AMD's bulldozer).
But 'theoretical peak flops' is probably not so useful in real life. I would try to find some published benchmark that does something similar to what you intend to use with your system.
Pat
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
>>>I believe that bloomfield uses the nehalem cpu architecture and I think that nehalem could get a max 8 SP FLOPS/cycle (per core I assume) and 4 DP flops/cycle/core.>>>
Yes IIRC that CPU can achieve 8 SP FLOPS/cycle and 4 DP/cycle. MAX supported ISA extension is SSE 4.2.
http://www.anandtech.com/show/6355/intels-haswell-architecture/8
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
>>>and please explain me how you calculated them.>>>
You can code your own benchmark by using for example SAXPY A = a * A + B like computation.
In pseudocode:
// Allocate memory for float* A and float* B by using malloc() function.
//Initialize both of arrays
//Use double for-loop where outer loop will run 1.0e+6 iterations of shorter inner loop.
Calculate FP operations/sec by using following formula
GFLOPS = 1/MAX CPU FREQ * outer loop count * inner loop count * FLOPS/cycle.
You should multiply GLOPS result by the number of CPU cores.
As a side note peak theoretical FLOP/s is CPU MAX FREQ * Number of Cores * Number of SP FLOPS/cycle.
Peak theoretical SP FLOP/s for Intel Bloomfield CPU will be:
3.36e+9 hz * 4 * 8 SP ~ 107.5 SP GLOPS.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@Fausto
Here is interesting discussion about the FP benchmark.
Check also following link: https://software.intel.com/en-us/articles/estimating-flops-using-event-based-sampling-ebs
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
SKYLAKE 64 SP FLOPS PER CYCLE?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Future skylake server may support peak simd performance per core doubling that of Haswell. Current skylake client CPU apparently is similar to Haswell in that respect.
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
apple a6 8sp per cycle or 1 dp per cycle
apple a7/8 16 sp or 8 dp per cycle = intel ivy bridge
http://dench.flatlib.jp/opengl/devices?s[]=iphone&s[]=6s#ios_soc section - ios soc
and http://dench.flatlib.jp/opengl/cpufop
Its true?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
@Tim
Do you mean the future Xeon Skylake(Purely arch.) which incorporates 512-bit vector registers?
- Mark as New
- Bookmark
- Subscribe
- Mute
- Subscribe to RSS Feed
- Permalink
- Report Inappropriate Content
Vladimir S. wrote:
apple a6 8sp per cycle or 1 dp per cycle
apple a7/8 16 sp or 8 dp per cycle = intel ivy bridge
http://dench.flatlib.jp/opengl/devices?s[]=iphone&s[]=6s#ios_soc section - ios soc
and http://dench.flatlib.jp/opengl/cpufop
Its true?
Why are you trying to compare Apple's SoC to Intel Haswell CPU?
Afaik Apple's solution is based on ARM architecture and probably incorporates NEON SIMD with its 128-bit vector registers.
- Subscribe to RSS Feed
- Mark Topic as New
- Mark Topic as Read
- Float this Topic for Current User
- Bookmark
- Subscribe
- Printer Friendly Page