6
« on: June 05, 2024, 07:32:37 am »
Let's have a bit of fun shall we? Below is my benchmark results on my Blackbird with 8 cores POWER9. What's your score?
$ lscpu
Architecture: ppc64le
Byte Order: Little Endian
CPU(s): 32
On-line CPU(s) list: 0-31
Model name: POWER9, altivec supported
Model: 2.3 (pvr 004e 1203)
Thread(s) per core: 4
Core(s) per socket: 8
Socket(s): 1
Frequency boost: enabled
CPU(s) scaling MHz: 58%
CPU max MHz: 3800.0000
CPU min MHz: 2166.0000
Caches (sum of all):
L1d: 256 KiB (8 instances)
L1i: 256 KiB (8 instances)
L2: 4 MiB (8 instances)
L3: 80 MiB (8 instances)
NUMA:
NUMA node(s): 1
NUMA node0 CPU(s): 0-31
Vulnerabilities:
Gather data sampling: Not affected
Itlb multihit: Not affected
L1tf: Mitigation; RFI Flush, L1D private per thread
Mds: Not affected
Meltdown: Mitigation; RFI Flush, L1D private per thread
Mmio stale data: Not affected
Reg file data sampling: Not affected
Retbleed: Not affected
Spec rstack overflow: Not affected
Spec store bypass: Mitigation; Kernel entry/exit barrier (eieio)
Spectre v1: Mitigation; __user pointer sanitization, ori31 speculation b
arrier enabled
Spectre v2: Mitigation; Software count cache flush (hardware accelerated
), Software link stack flush
Srbds: Not affected
Tsx async abort: Not affected
# # # # # # # ##### ###### # # #### # #
# # ## # # # # # # # ## # # # # #
# # # # # # ## ##### ##### # # # # ######
# # # # # # ## # # # # # # # # #
# # # ## # # # # # # # ## # # # #
#### # # # # # ##### ###### # # #### # #
Version 5.1.3 Based on the Byte Magazine Unix Benchmark
Multi-CPU version Version 5 revisions by Ian Smith,
Sunnyvale, CA, USA
January 13, 2011 johantheghost at yahoo period com
------------------------------------------------------------------------------
Use directories for:
* File I/O tests (named fs***) = /home/tle/Work/byte-unixbench/UnixBench/tmp
* Results = /home/tle/Work/byte-unixbench/UnixBench/results
------------------------------------------------------------------------------
1 x Dhrystone 2 using register variables 1 2 3 4 5 6 7 8 9 10
1 x Double-Precision Whetstone 1 2 3 4 5 6 7 8 9 10
1 x Execl Throughput 1 2 3
1 x File Copy 1024 bufsize 2000 maxblocks 1 2 3
1 x File Copy 256 bufsize 500 maxblocks 1 2 3
1 x File Copy 4096 bufsize 8000 maxblocks 1 2 3
1 x Pipe Throughput 1 2 3 4 5 6 7 8 9 10
1 x Pipe-based Context Switching 1 2 3 4 5 6 7 8 9 10
1 x Process Creation 1 2 3
1 x System Call Overhead 1 2 3 4 5 6 7 8 9 10
1 x Shell Scripts (1 concurrent) 1 2 3
1 x Shell Scripts (8 concurrent) 1 2 3
32 x Dhrystone 2 using register variables 1 2 3 4 5 6 7 8 9 10
32 x Double-Precision Whetstone 1 2 3 4 5 6 7 8 9 10
32 x Execl Throughput 1 2 3
32 x File Copy 1024 bufsize 2000 maxblocks 1 2 3
32 x File Copy 256 bufsize 500 maxblocks 1 2 3
32 x File Copy 4096 bufsize 8000 maxblocks 1 2 3
32 x Pipe Throughput 1 2 3 4 5 6 7 8 9 10
32 x Pipe-based Context Switching 1 2 3 4 5 6 7 8 9 10
32 x Process Creation 1 2 3
32 x System Call Overhead 1 2 3 4 5 6 7 8 9 10
32 x Shell Scripts (1 concurrent) 1 2 3
32 x Shell Scripts (8 concurrent) 1 2 3
========================================================================
BYTE UNIX Benchmarks (Version 5.1.3)
System: shrimp-paste: GNU/Linux
OS: GNU/Linux -- 6.8.11-300.fc40.ppc64le -- #1 SMP Mon May 27 14:48:15 UTC 2024
Machine: ppc64le (unknown)
Language: en_US.utf8 (charmap="UTF-8", collate="UTF-8")
20:34:42 up 7:16, 2 users, load average: 0.17, 23.10, 33.96; runlevel 2024-06-05
------------------------------------------------------------------------
Benchmark Run: Wed Jun 05 2024 20:34:42 - 21:03:09
32 CPUs in system; running 1 parallel copy of tests
Dhrystone 2 using register variables 43066559.3 lps (10.0 s, 7 samples)
Double-Precision Whetstone 4835.0 MWIPS (10.0 s, 7 samples)
Execl Throughput 3317.1 lps (29.8 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks 241162.4 KBps (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks 61272.0 KBps (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks 846105.6 KBps (30.0 s, 2 samples)
Pipe Throughput 779278.4 lps (10.0 s, 7 samples)
Pipe-based Context Switching 41152.3 lps (10.0 s, 7 samples)
Process Creation 4803.7 lps (30.0 s, 2 samples)
Shell Scripts (1 concurrent) 4640.7 lpm (60.0 s, 2 samples)
Shell Scripts (8 concurrent) 3796.2 lpm (60.0 s, 2 samples)
System Call Overhead 745761.8 lps (10.0 s, 7 samples)
System Benchmarks Index Values BASELINE RESULT INDEX
Dhrystone 2 using register variables 116700.0 43066559.3 3690.4
Double-Precision Whetstone 55.0 4835.0 879.1
Execl Throughput 43.0 3317.1 771.4
File Copy 1024 bufsize 2000 maxblocks 3960.0 241162.4 609.0
File Copy 256 bufsize 500 maxblocks 1655.0 61272.0 370.2
File Copy 4096 bufsize 8000 maxblocks 5800.0 846105.6 1458.8
Pipe Throughput 12440.0 779278.4 626.4
Pipe-based Context Switching 4000.0 41152.3 102.9
Process Creation 126.0 4803.7 381.2
Shell Scripts (1 concurrent) 42.4 4640.7 1094.5
Shell Scripts (8 concurrent) 6.0 3796.2 6327.0
System Call Overhead 15000.0 745761.8 497.2
========
System Benchmarks Index Score 800.9
------------------------------------------------------------------------
Benchmark Run: Wed Jun 05 2024 21:03:09 - 21:32:30
32 CPUs in system; running 32 parallel copies of tests
Dhrystone 2 using register variables 449736205.6 lps (10.0 s, 7 samples)
Double-Precision Whetstone 112382.5 MWIPS (9.8 s, 7 samples)
Execl Throughput 36818.4 lps (29.8 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks 878485.1 KBps (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks 212868.6 KBps (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks 3705686.7 KBps (30.0 s, 2 samples)
Pipe Throughput 10828494.4 lps (10.0 s, 7 samples)
Pipe-based Context Switching 1449711.4 lps (10.0 s, 7 samples)
Process Creation 70064.8 lps (30.0 s, 2 samples)
Shell Scripts (1 concurrent) 67413.9 lpm (60.0 s, 2 samples)
Shell Scripts (8 concurrent) 8397.6 lpm (60.1 s, 2 samples)
System Call Overhead 13942866.4 lps (10.0 s, 7 samples)
System Benchmarks Index Values BASELINE RESULT INDEX
Dhrystone 2 using register variables 116700.0 449736205.6 38537.8
Double-Precision Whetstone 55.0 112382.5 20433.2
Execl Throughput 43.0 36818.4 8562.4
File Copy 1024 bufsize 2000 maxblocks 3960.0 878485.1 2218.4
File Copy 256 bufsize 500 maxblocks 1655.0 212868.6 1286.2
File Copy 4096 bufsize 8000 maxblocks 5800.0 3705686.7 6389.1
Pipe Throughput 12440.0 10828494.4 8704.6
Pipe-based Context Switching 4000.0 1449711.4 3624.3
Process Creation 126.0 70064.8 5560.7
Shell Scripts (1 concurrent) 42.4 67413.9 15899.5
Shell Scripts (8 concurrent) 6.0 8397.6 13996.0
System Call Overhead 15000.0 13942866.4 9295.2
========
System Benchmarks Index Score 7717.0