UWMLSC > Beowulf Systems > Medusa
  

Benchmarking results

The latest graph as of May 4, 2001 ishere.

This table summarizes the results of benchmarking tests on various machines, obtained by calculating complex Fourier transforms of varying size. The number n (in the second through fourth column headings) represents the number of complex floats passed to the Fourier transform (1 complex float = 8 bytes of data, so, for example, 2^12 = 4096 complex floats = 32768 bytes). The results recorded here are the number of floating point operations per second (1 Mflop = 1 million operations per second), and milliseconds per transform. The greater the Mflops and the lower the milliseconds, the better. We are most concerned with the performance at n=2^20, the column highlighted in red.

For easier comparison, a graph of each machine's fastest overall results can be found here.

Manufacturer and Model n=2^12 n=2^16 n=2^20 CPU type and MHz Memory Cache Bus Speed RAM Compiler
DEC Alpha XL300
185 Mflops
1.33 ms/fft
133 Mflops
39.4 ms/fft
77.4 Mflops
1350 ms/fft
Alpha EV5 21164
300MHz
8K data/8K instruction
96K internal/2M L2
? 256M ccc
DEC Alpha PW500au
249 Mflops
0.988 ms/fft
159 Mflops
33.1 ms/fft
127 Mflops
827 ms/fft
Alpha 21164 EV56
500MHz
96K internal, 2M L3 ? 128M gcc version
egcs-2.91.66
DEC Alpha PW500au
328 Mflops
0.748 ms/fft
173 Mflops
30.3 ms/fft
87.6 Mflops
1200 ms/fft
Alpha 21164 EV56
500MHz
96K internal, 2M L3 ? 128M ccc version
6.2.9.504-2
Compaq Alpha XP1000
711 Mflops
0.346 ms/fft
192 Mflops
27.2 ms/fft
141 Mflops
742 ms/fft
Alpha 21264 EV6
500MHz
64K data/64K instruction
4M L2
83MHz 256M gcc version
egcs-2.91.66
Compaq Alpha XP1000
715 Mflops
0.344 ms/fft
189 Mflops
27.7 ms/fft
126 Mflops
831 ms/fft
Alpha 21264 EV6
500MHz
64K data/64k instruction
4M L2
83MHz 256M ccc version
6.2.9.504-2
Gateway Professional M1000
609 Mflops
0.403 ms/fft
380 Mflops
13.8 ms/fft
280 Mflops
374 ms/fft
Intel PIII (Coppermine)
1GHz
256K 133MHz 512M gcc version
egcs-2.91.66
Gateway Select SB PC
1030 Mflops
0.239 ms/fft
371 Mflops
14.1 ms/fft
254 Mflops
413 ms/fft
AMD Athlon
1.2GHz
128K L1/256K L2 133MHz 512M gcc version
egcs-2.91.66
PCWisconsin PIII
581 Mflops
0.423 ms/fft
376 Mflops
13.9 ms/fft
261 Mflops
402 ms/fft
Intel PIII (Coppermine)
933MHz
? 133MHz 512M gcc version
egcs-2.91.66
Dell Dimension 8100
762 Mflops
0.322 ms/fft
231 Mflops
22.7 ms/fft
208 Mflops
504 ms/fft
Intel P4 1.4GHz 256K 100MHz 512M gcc version
egcs-2.95.2
Dell Precision 220
634 Mflops
0.388 ms/fft
217 Mflops
24.2 ms/fft
210 Mflops
500 ms/fft
Intel PIII 1GHz 256K 133MHz 512M gcc version
egcs-2.91.66
Compaq Alpha XP1000
927 Mflops
0.265 ms/fft
213 Mflops
24.6 ms/fft
146 Mflops
720 ms/fft
Alpha 21264 EV67 @ 667 MHz 64K inst. and data, 4MB L2 83MHz 512MB DEC cc 3.11
Linux Networx LNXI Athlon 1.2 DDR
1038 Mflops
0.237 ms/fft
193 Mflops
27.2 ms/fft
175 Mflops
599 ms/fft
AMD Athlon 1.2GHz 256K L2 266MHz 512MB DDR gcc version egcs-2.91.66
IBM xSeries 200
551 Mflops
0.446 ms/fft
229 Mflops
22.9 ms/fft
217 Mflops
483 ms/fft
Intel PIII 866 MHz 256K 133MHz 590MB gcc version egcs-2.91.66
IBM xSeries 220
593 Mflops
0.414 ms/fft
259 Mflops
20.3 ms/fft
250 Mflops
420 ms/fft
Intel PIII 933 MHz 256K 133MHz 512MB gcc version egcs-2.91.66
Intel D815EEA2L PIII
634 Mflops
0.388 ms/fft
377 Mflops
13.9 ms/fft
287 Mflops
365 ms/fft
Intel PIII 1 GHz 256K 133MHz 512MB gcc version egcs-2.91.66


$Id: benchmarking_results.html,v 1.7 2003/01/06 23:04:07 kflasch Exp $
Check this page for dead links, sloppy HTML, or a bad style sheet; or strip it for printing.
Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.