[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
SSE-enabled level 2
Guys,
I just got Camm's SSE-enabled Level 2 incoporated. I include timings on my
500Mhz PIII (256K on-chip L2) laptop below.
Cheers,
Clint
GH : Greg Henry's Intel BLAS
ATL : ATLAS without Camm's kernels
AT+C : ATLAS with Camm's SSE-enabled GEMV/GER
M=N=500 Trans=N, alpha=1.0, beta=1.0
HEMV GERU HER HER2 GERC
GEMV SYMV TRMV TRSV GER SYR SYR2
===== ===== ===== ===== ===== ===== ===== =====
dGH PIII500 500 107.7 51.2 49.3 91.0 49.7 26.2 50.5
dATL PIII500 500 105.4 138.8 98.3 93.3 43.8 41.6 80.5
dAT+C PIII500 500 131.5 120.0 98.5 94.7 43.8 41.6 78.5
sGH PIII500 500 139.1 64.5 68.5 113.0 51.1 50.1 92.3
sATL PIII500 500 139.6 153.1 128.6 117.4 75.1 73.7 111.6
sAT+C PIII500 500 222.9 159.6 166.9 161.7 91.2 88.4 171.5
zGH PIII500 500 137.9 125.0 78.0 59.3 74.9 62.3 107.4 73.4
zATL PIII500 500 138.1 208.9 142.2 134.8 75.1 73.9 115.0 74.9
zAT+C PIII500 500 224.9 184.4 198.9 185.3 115.7 110.2 140.5 115.7
cGH PIII500 500 194.4 188.6 104.7 97.0 99.1 99.5 169.9 98.4
cATL PIII500 500 230.3 232.2 212.3 197.8 100.8 100.0 145.4 100.6
cAT+C PIII500 500 421.7 417.7 353.1 328.8 217.7 209.9 269.1 218.3