Add more results in new format authored by Jan Eitzinger's avatar Jan Eitzinger
# System
* **Processor:** Intel(R) Xeon(R) CPU E5-2680
* **Base frequency:** 2.7 GHz
* **Number of sockets:** 2
* **Number of memory domains per socket:** 1
* **Number of cores per socket:** 8
* **Number of HWThreads per core:** 2
* **[MachineState](https://github.com/RRZE-HPC/MachineState) output:** NA
# Tool chain
```
+----------+---------------------------------+
| Compiler | icc (ICC) |
|----------|---------------------------------|
| Version | icc (ICC) 19.0.5.281 20190815 |
+----------+---------------------------------+
```
Optimizing flags: ```-fast -xHost -qopt-streaming-stores=always -std=c99 -ffreestanding -qopenmp```
# Results
All results are in ```GB/s```.
Summary results:
```
+--------------------------------------------+
| Single core | 13.73 (SDaxpy) |
| Memory domain | 44.59 (Sum with 7 cores) |
| Socket | 44.59 (Sum with 7 cores) |
| Node | 90.01 (Sum with 7 cores) |
+--------------------------------------------+
```
Results for scaling within a memory domain:
```
#nt Init Sum Copy Update Triad Daxpy STriad SDaxpy
1 5.43 13.11 7.23 9.37 9.39 11.66 11.43 13.73
2 10.82 25.05 14.21 18.31 18.44 22.61 22.29 26.47
3 16.05 35.14 20.75 26.54 26.53 31.99 31.36 35.84
4 21.10 41.78 26.78 32.34 33.16 36.54 36.62 38.37
5 26.08 44.38 31.88 36.75 36.30 38.55 37.19 38.55
6 30.75 44.32 34.89 38.13 36.72 38.64 36.95 38.43
7 34.97 44.59 36.09 39.04 36.64 38.52 36.91 38.46
8 37.77 44.09 36.05 38.78 36.27 38.16 36.56 38.03
```
Results for scaling across memory domains. Shown are the results for the number of memory domains used (nm) with columns number of cores used per memory domain.
Init:
```
#nm 1 2
1 5.43 12.06
2 10.82 23.84
3 16.05 35.32
4 21.10 46.22
5 26.08 56.69
6 30.75 66.13
7 34.97 74.24
8 37.77 79.30
```
Sum:
```
#nm 1 2
1 13.11 28.39
2 25.05 53.47
3 35.14 71.70
4 41.78 76.11
5 44.38 89.85
6 44.32 89.90
7 44.59 90.01
8 44.09 89.47
```
Copy
```
#nm 1 2
1 7.23 15.75
2 14.21 30.68
3 20.75 44.38
4 26.78 56.93
5 31.88 66.67
6 34.89 72.00
7 36.09 73.53
8 36.05 73.88
```
Update
```
#nm 1 2
1 9.37 18.86
2 18.31 36.71
3 26.54 52.39
4 32.34 65.25
5 36.75 73.44
6 38.13 76.38
7 39.04 78.20
8 38.78 77.95
```
Triad
```
#nm 1 2
1 9.39 20.30
2 18.44 39.47
3 26.53 55.88
4 33.16 68.87
5 36.30 74.44
6 36.72 74.72
7 36.64 74.42
8 36.27 73.86
```
# Scaling
Memory bandwidth scaling within one memory domain:
![Main memory bandwidth scaling plot](figures/Intel-SandyBridge-S2-M1-C8-scaling.png)
The following plots illustrate the the performance scaling over multiple memory domains using different number of cores per memory domain.
Memory bandwidth scaling across memory domains for init:
![Memory domain scaling plot](figures/Intel-SandyBridge-S2-M1-C8-domainsInit.png)
Memory bandwidth scaling across memory domains for sum
![Memory domain scaling plot](figures/Intel-SandyBridge-S2-M1-C8-domainsSum.png)
Memory bandwidth scaling across memory domains for copy
![Memory domain scaling plot](figures/Intel-SandyBridge-S2-M1-C8-domainsCopy.png)
Memory bandwidth scaling across memory domains for Triad
![Memory domain scaling plot](figures/Intel-SandyBridge-S2-M1-C8-domainsTriad.png)