Raptor Computing Systems Community Forums (BETA)
General OpenPOWER Hardware => General CPU Discussion => Topic started by: tle on June 05, 2024, 07:32:37 am
-
Let's have a bit of fun shall we? Below is my benchmark results on my Blackbird with 8 cores POWER9. What's your score?
$ lscpu
Architecture: ppc64le
Byte Order: Little Endian
CPU(s): 32
On-line CPU(s) list: 0-31
Model name: POWER9, altivec supported
Model: 2.3 (pvr 004e 1203)
Thread(s) per core: 4
Core(s) per socket: 8
Socket(s): 1
Frequency boost: enabled
CPU(s) scaling MHz: 58%
CPU max MHz: 3800.0000
CPU min MHz: 2166.0000
Caches (sum of all):
L1d: 256 KiB (8 instances)
L1i: 256 KiB (8 instances)
L2: 4 MiB (8 instances)
L3: 80 MiB (8 instances)
NUMA:
NUMA node(s): 1
NUMA node0 CPU(s): 0-31
Vulnerabilities:
Gather data sampling: Not affected
Itlb multihit: Not affected
L1tf: Mitigation; RFI Flush, L1D private per thread
Mds: Not affected
Meltdown: Mitigation; RFI Flush, L1D private per thread
Mmio stale data: Not affected
Reg file data sampling: Not affected
Retbleed: Not affected
Spec rstack overflow: Not affected
Spec store bypass: Mitigation; Kernel entry/exit barrier (eieio)
Spectre v1: Mitigation; __user pointer sanitization, ori31 speculation b
arrier enabled
Spectre v2: Mitigation; Software count cache flush (hardware accelerated
), Software link stack flush
Srbds: Not affected
Tsx async abort: Not affected
# # # # # # # ##### ###### # # #### # #
# # ## # # # # # # # ## # # # # #
# # # # # # ## ##### ##### # # # # ######
# # # # # # ## # # # # # # # # #
# # # ## # # # # # # # ## # # # #
#### # # # # # ##### ###### # # #### # #
Version 5.1.3 Based on the Byte Magazine Unix Benchmark
Multi-CPU version Version 5 revisions by Ian Smith,
Sunnyvale, CA, USA
January 13, 2011 johantheghost at yahoo period com
------------------------------------------------------------------------------
Use directories for:
* File I/O tests (named fs***) = /home/tle/Work/byte-unixbench/UnixBench/tmp
* Results = /home/tle/Work/byte-unixbench/UnixBench/results
------------------------------------------------------------------------------
1 x Dhrystone 2 using register variables 1 2 3 4 5 6 7 8 9 10
1 x Double-Precision Whetstone 1 2 3 4 5 6 7 8 9 10
1 x Execl Throughput 1 2 3
1 x File Copy 1024 bufsize 2000 maxblocks 1 2 3
1 x File Copy 256 bufsize 500 maxblocks 1 2 3
1 x File Copy 4096 bufsize 8000 maxblocks 1 2 3
1 x Pipe Throughput 1 2 3 4 5 6 7 8 9 10
1 x Pipe-based Context Switching 1 2 3 4 5 6 7 8 9 10
1 x Process Creation 1 2 3
1 x System Call Overhead 1 2 3 4 5 6 7 8 9 10
1 x Shell Scripts (1 concurrent) 1 2 3
1 x Shell Scripts (8 concurrent) 1 2 3
32 x Dhrystone 2 using register variables 1 2 3 4 5 6 7 8 9 10
32 x Double-Precision Whetstone 1 2 3 4 5 6 7 8 9 10
32 x Execl Throughput 1 2 3
32 x File Copy 1024 bufsize 2000 maxblocks 1 2 3
32 x File Copy 256 bufsize 500 maxblocks 1 2 3
32 x File Copy 4096 bufsize 8000 maxblocks 1 2 3
32 x Pipe Throughput 1 2 3 4 5 6 7 8 9 10
32 x Pipe-based Context Switching 1 2 3 4 5 6 7 8 9 10
32 x Process Creation 1 2 3
32 x System Call Overhead 1 2 3 4 5 6 7 8 9 10
32 x Shell Scripts (1 concurrent) 1 2 3
32 x Shell Scripts (8 concurrent) 1 2 3
========================================================================
BYTE UNIX Benchmarks (Version 5.1.3)
System: shrimp-paste: GNU/Linux
OS: GNU/Linux -- 6.8.11-300.fc40.ppc64le -- #1 SMP Mon May 27 14:48:15 UTC 2024
Machine: ppc64le (unknown)
Language: en_US.utf8 (charmap="UTF-8", collate="UTF-8")
20:34:42 up 7:16, 2 users, load average: 0.17, 23.10, 33.96; runlevel 2024-06-05
------------------------------------------------------------------------
Benchmark Run: Wed Jun 05 2024 20:34:42 - 21:03:09
32 CPUs in system; running 1 parallel copy of tests
Dhrystone 2 using register variables 43066559.3 lps (10.0 s, 7 samples)
Double-Precision Whetstone 4835.0 MWIPS (10.0 s, 7 samples)
Execl Throughput 3317.1 lps (29.8 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks 241162.4 KBps (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks 61272.0 KBps (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks 846105.6 KBps (30.0 s, 2 samples)
Pipe Throughput 779278.4 lps (10.0 s, 7 samples)
Pipe-based Context Switching 41152.3 lps (10.0 s, 7 samples)
Process Creation 4803.7 lps (30.0 s, 2 samples)
Shell Scripts (1 concurrent) 4640.7 lpm (60.0 s, 2 samples)
Shell Scripts (8 concurrent) 3796.2 lpm (60.0 s, 2 samples)
System Call Overhead 745761.8 lps (10.0 s, 7 samples)
System Benchmarks Index Values BASELINE RESULT INDEX
Dhrystone 2 using register variables 116700.0 43066559.3 3690.4
Double-Precision Whetstone 55.0 4835.0 879.1
Execl Throughput 43.0 3317.1 771.4
File Copy 1024 bufsize 2000 maxblocks 3960.0 241162.4 609.0
File Copy 256 bufsize 500 maxblocks 1655.0 61272.0 370.2
File Copy 4096 bufsize 8000 maxblocks 5800.0 846105.6 1458.8
Pipe Throughput 12440.0 779278.4 626.4
Pipe-based Context Switching 4000.0 41152.3 102.9
Process Creation 126.0 4803.7 381.2
Shell Scripts (1 concurrent) 42.4 4640.7 1094.5
Shell Scripts (8 concurrent) 6.0 3796.2 6327.0
System Call Overhead 15000.0 745761.8 497.2
========
System Benchmarks Index Score 800.9
------------------------------------------------------------------------
Benchmark Run: Wed Jun 05 2024 21:03:09 - 21:32:30
32 CPUs in system; running 32 parallel copies of tests
Dhrystone 2 using register variables 449736205.6 lps (10.0 s, 7 samples)
Double-Precision Whetstone 112382.5 MWIPS (9.8 s, 7 samples)
Execl Throughput 36818.4 lps (29.8 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks 878485.1 KBps (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks 212868.6 KBps (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks 3705686.7 KBps (30.0 s, 2 samples)
Pipe Throughput 10828494.4 lps (10.0 s, 7 samples)
Pipe-based Context Switching 1449711.4 lps (10.0 s, 7 samples)
Process Creation 70064.8 lps (30.0 s, 2 samples)
Shell Scripts (1 concurrent) 67413.9 lpm (60.0 s, 2 samples)
Shell Scripts (8 concurrent) 8397.6 lpm (60.1 s, 2 samples)
System Call Overhead 13942866.4 lps (10.0 s, 7 samples)
System Benchmarks Index Values BASELINE RESULT INDEX
Dhrystone 2 using register variables 116700.0 449736205.6 38537.8
Double-Precision Whetstone 55.0 112382.5 20433.2
Execl Throughput 43.0 36818.4 8562.4
File Copy 1024 bufsize 2000 maxblocks 3960.0 878485.1 2218.4
File Copy 256 bufsize 500 maxblocks 1655.0 212868.6 1286.2
File Copy 4096 bufsize 8000 maxblocks 5800.0 3705686.7 6389.1
Pipe Throughput 12440.0 10828494.4 8704.6
Pipe-based Context Switching 4000.0 1449711.4 3624.3
Process Creation 126.0 70064.8 5560.7
Shell Scripts (1 concurrent) 42.4 67413.9 15899.5
Shell Scripts (8 concurrent) 6.0 8397.6 13996.0
System Call Overhead 15000.0 13942866.4 9295.2
========
System Benchmarks Index Score 7717.0
-
My Talos II, Fedora 39:
Architecture: ppc64le
Byte Order: Little Endian
CPU(s): 72
On-line CPU(s) list: 0-71
Model name: POWER9, altivec supported
Model: 2.3 (pvr 004e 1203)
Thread(s) per core: 4
Core(s) per socket: 18
Socket(s): 1
Frequency boost: enabled
CPU(s) scaling MHz: 61%
CPU max MHz: 3800,0000
CPU min MHz: 2166,0000
Caches (sum of all):
L1d: 576 KiB (18 instances)
L1i: 576 KiB (18 instances)
L2: 4,5 MiB (9 instances)
L3: 90 MiB (9 instances)
NUMA:
NUMA node(s): 1
NUMA node0 CPU(s): 0-71
Vulnerabilities:
Gather data sampling: Not affected
Itlb multihit: Not affected
L1tf: Mitigation; RFI Flush, L1D private per thread
Mds: Not affected
Meltdown: Mitigation; RFI Flush, L1D private per thread
Mmio stale data: Not affected
Reg file data sampling: Not affected
Retbleed: Not affected
Spec rstack overflow: Not affected
Spec store bypass: Mitigation; Kernel entry/exit barrier (eieio)
Spectre v1: Mitigation; __user pointer sanitization, ori31 speculation barrier enabled
Spectre v2: Mitigation; Software count cache flush (hardware accelerated), Software link stack flush
Srbds: Not affected
Tsx async abort: Not affected
# # # # # # # ##### ###### # # #### # #
# # ## # # # # # # # ## # # # # #
# # # # # # ## ##### ##### # # # # ######
# # # # # # ## # # # # # # # # #
# # # ## # # # # # # # ## # # # #
#### # # # # # ##### ###### # # #### # #
Version 5.1.3 Based on the Byte Magazine Unix Benchmark
Multi-CPU version Version 5 revisions by Ian Smith,
Sunnyvale, CA, USA
January 13, 2011 johantheghost at yahoo period com
------------------------------------------------------------------------------
Use directories for:
* File I/O tests (named fs***) = /home/dknoto/Oprogramowanie/Unix-Bench/byte-unixbench-master/UnixBench/tmp
* Results = /home/dknoto/Oprogramowanie/Unix-Bench/byte-unixbench-master/UnixBench/results
------------------------------------------------------------------------------
1 x Dhrystone 2 using register variables 1 2 3 4 5 6 7 8 9 10
1 x Double-Precision Whetstone 1 2 3 4 5 6 7 8 9 10
1 x Execl Throughput 1 2 3
1 x File Copy 1024 bufsize 2000 maxblocks 1 2 3
1 x File Copy 256 bufsize 500 maxblocks 1 2 3
1 x File Copy 4096 bufsize 8000 maxblocks 1 2 3
1 x Pipe Throughput 1 2 3 4 5 6 7 8 9 10
1 x Pipe-based Context Switching 1 2 3 4 5 6 7 8 9 10
1 x Process Creation 1 2 3
1 x System Call Overhead 1 2 3 4 5 6 7 8 9 10
1 x Shell Scripts (1 concurrent) 1 2 3
1 x Shell Scripts (8 concurrent) 1 2 3
72 x Dhrystone 2 using register variables 1 2 3 4 5 6 7 8 9 10
72 x Double-Precision Whetstone 1 2 3 4 5 6 7 8 9 10
72 x Execl Throughput 1 2 3
72 x File Copy 1024 bufsize 2000 maxblocks 1 2 3
72 x File Copy 256 bufsize 500 maxblocks 1 2 3
72 x File Copy 4096 bufsize 8000 maxblocks 1 2 3
72 x Pipe Throughput 1 2 3 4 5 6 7 8 9 10
72 x Pipe-based Context Switching 1 2 3 4 5 6 7 8 9 10
72 x Process Creation 1 2 3
72 x System Call Overhead 1 2 3 4 5 6 7 8 9 10
72 x Shell Scripts (1 concurrent) 1 2 3
72 x Shell Scripts (8 concurrent) 1 2 3
========================================================================
BYTE UNIX Benchmarks (Version 5.1.3)
System: talos2: GNU/Linux
OS: GNU/Linux -- 6.8.11-200.fc39.ppc64le -- #1 SMP Sun May 26 19:56:17 UTC 2024
Machine: ppc64le (unknown)
Language: en_US.utf8 (charmap="UTF-8", collate="UTF-8")
00:13:10 up 14:27, 3 users, load average: 0.17, 0.23, 0.09; runlevel 2024-06-06
------------------------------------------------------------------------
Benchmark Run: czw cze 06 2024 00:13:10 - 00:41:21
72 CPUs in system; running 1 parallel copy of tests
Dhrystone 2 using register variables 43482427.2 lps (10.0 s, 7 samples)
Double-Precision Whetstone 4958.8 MWIPS (10.0 s, 7 samples)
Execl Throughput 3584.1 lps (29.9 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks 499495.6 KBps (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks 128933.5 KBps (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks 1587747.4 KBps (30.0 s, 2 samples)
Pipe Throughput 789756.5 lps (10.0 s, 7 samples)
Pipe-based Context Switching 54265.4 lps (10.0 s, 7 samples)
Process Creation 4442.8 lps (30.0 s, 2 samples)
Shell Scripts (1 concurrent) 5002.6 lpm (60.0 s, 2 samples)
Shell Scripts (8 concurrent) 4411.7 lpm (60.0 s, 2 samples)
System Call Overhead 739118.2 lps (10.0 s, 7 samples)
System Benchmarks Index Values BASELINE RESULT INDEX
Dhrystone 2 using register variables 116700.0 43482427.2 3726.0
Double-Precision Whetstone 55.0 4958.8 901.6
Execl Throughput 43.0 3584.1 833.5
File Copy 1024 bufsize 2000 maxblocks 3960.0 499495.6 1261.4
File Copy 256 bufsize 500 maxblocks 1655.0 128933.5 779.1
File Copy 4096 bufsize 8000 maxblocks 5800.0 1587747.4 2737.5
Pipe Throughput 12440.0 789756.5 634.9
Pipe-based Context Switching 4000.0 54265.4 135.7
Process Creation 126.0 4442.8 352.6
Shell Scripts (1 concurrent) 42.4 5002.6 1179.8
Shell Scripts (8 concurrent) 6.0 4411.7 7352.8
System Call Overhead 15000.0 739118.2 492.7
========
System Benchmarks Index Score 998.1
------------------------------------------------------------------------
Benchmark Run: czw cze 06 2024 00:41:21 - 01:09:46
72 CPUs in system; running 72 parallel copies of tests
Dhrystone 2 using register variables 577677974.0 lps (10.0 s, 7 samples)
Double-Precision Whetstone 211750.9 MWIPS (9.9 s, 7 samples)
Execl Throughput 32708.5 lps (29.9 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks 11708757.2 KBps (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks 3254020.0 KBps (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks 22886575.0 KBps (30.0 s, 2 samples)
Pipe Throughput 18331410.8 lps (10.0 s, 7 samples)
Pipe-based Context Switching 2564695.6 lps (10.0 s, 7 samples)
Process Creation 75847.8 lps (30.0 s, 2 samples)
Shell Scripts (1 concurrent) 110553.5 lpm (60.0 s, 2 samples)
Shell Scripts (8 concurrent) 14402.6 lpm (60.1 s, 2 samples)
System Call Overhead 23145656.7 lps (10.0 s, 7 samples)
System Benchmarks Index Values BASELINE RESULT INDEX
Dhrystone 2 using register variables 116700.0 577677974.0 49501.1
Double-Precision Whetstone 55.0 211750.9 38500.2
Execl Throughput 43.0 32708.5 7606.6
File Copy 1024 bufsize 2000 maxblocks 3960.0 11708757.2 29567.6
File Copy 256 bufsize 500 maxblocks 1655.0 3254020.0 19661.8
File Copy 4096 bufsize 8000 maxblocks 5800.0 22886575.0 39459.6
Pipe Throughput 12440.0 18331410.8 14735.9
Pipe-based Context Switching 4000.0 2564695.6 6411.7
Process Creation 126.0 75847.8 6019.7
Shell Scripts (1 concurrent) 42.4 110553.5 26073.9
Shell Scripts (8 concurrent) 6.0 14402.6 24004.3
System Call Overhead 15000.0 23145656.7 15430.4
========
System Benchmarks Index Score 18698.4
-
Talos II Gentoo
Architecture: ppc64
CPU op-mode(s): 32-bit, 64-bit
Byte Order: Big Endian
CPU(s): 144
On-line CPU(s) list: 0-143
Model name: POWER9, altivec supported
Model: 2.2 (pvr 004e 1202)
Thread(s) per core: 4
Core(s) per socket: 18
Socket(s): 2
Frequency boost: enabled
CPU(s) scaling MHz: 58%
CPU max MHz: 3800.0000
CPU min MHz: 2154.0000
Caches (sum of all):
L1d: 1.1 MiB (36 instances)
L1i: 1.1 MiB (36 instances)
L2: 10 MiB (20 instances)
L3: 200 MiB (20 instances)
NUMA:
NUMA node(s): 2
NUMA node0 CPU(s): 0-71
NUMA node8 CPU(s): 72-143
Vulnerabilities:
Gather data sampling: Not affected
Itlb multihit: Not affected
L1tf: Mitigation; RFI Flush, L1D private per thread
Mds: Not affected
Meltdown: Mitigation; RFI Flush, L1D private per thread
Mmio stale data: Not affected
Retbleed: Not affected
Spec rstack overflow: Not affected
Spec store bypass: Mitigation; Kernel entry/exit barrier (eieio)
Spectre v1: Mitigation; __user pointer sanitization, ori31 speculation barrier enabled
Spectre v2: Mitigation; Indirect branch serialisation (kernel only)
Srbds: Not affected
Tsx async abort: Not affected
# # # # # # # ##### ###### # # #### # #
# # ## # # # # # # # ## # # # # #
# # # # # # ## ##### ##### # # # # ######
# # # # # # ## # # # # # # # # #
# # # ## # # # # # # # ## # # # #
#### # # # # # ##### ###### # # #### # #
Version 5.1.3 Based on the Byte Magazine Unix Benchmark
Multi-CPU version Version 5 revisions by Ian Smith,
Sunnyvale, CA, USA
January 13, 2011 johantheghost at yahoo period com
------------------------------------------------------------------------------
Use directories for:
* File I/O tests (named fs***) = /ramtmp/byte-unixbench/UnixBench/tmp
* Results = /ramtmp/byte-unixbench/UnixBench/results
------------------------------------------------------------------------------
1 x Dhrystone 2 using register variables 1 2 3 4 5 6 7 8 9 10
1 x Double-Precision Whetstone 1 2 3 4 5 6 7 8 9 10
1 x Execl Throughput 1 2 3
1 x File Copy 1024 bufsize 2000 maxblocks 1 2 3
1 x File Copy 256 bufsize 500 maxblocks 1 2 3
1 x File Copy 4096 bufsize 8000 maxblocks 1 2 3
1 x Pipe Throughput 1 2 3 4 5 6 7 8 9 10
1 x Pipe-based Context Switching 1 2 3 4 5 6 7 8 9 10
1 x Process Creation 1 2 3
1 x System Call Overhead 1 2 3 4 5 6 7 8 9 10
1 x Shell Scripts (1 concurrent) 1 2 3
1 x Shell Scripts (8 concurrent) 1 2 3
144 x Dhrystone 2 using register variables 1 2 3 4 5 6 7 8 9 10
144 x Double-Precision Whetstone 1 2 3 4 5 6 7 8 9 10
144 x Execl Throughput 1 2 3
144 x File Copy 1024 bufsize 2000 maxblocks 1 2 3
144 x File Copy 256 bufsize 500 maxblocks 1 2 3
144 x File Copy 4096 bufsize 8000 maxblocks 1 2 3
144 x Pipe Throughput 1 2 3 4 5 6 7 8 9 10
144 x Pipe-based Context Switching 1 2 3 4 5 6 7 8 9 10
144 x Process Creation 1 2 3
144 x System Call Overhead 1 2 3 4 5 6 7 8 9 10
144 x Shell Scripts (1 concurrent) 1 2 3
144 x Shell Scripts (8 concurrent) 1 2 3
========================================================================
BYTE UNIX Benchmarks (Version 5.1.3)
System: gentoobe: GNU/Linux
OS: GNU/Linux -- 6.5.0gentoobe -- #37 SMP Sun Nov 12 19:59:47 UTC 2023
Machine: ppc64 (PowerNV T2P9D01 REV 1.00)
Language: en_US.utf8 (charmap="UTF-8", collate="UTF-8")
19:55:48 up 31 days, 5:02, 2 users, load average: 0.13, 0.84, 7.00; runlevel 2024-05-06
------------------------------------------------------------------------
Benchmark Run: Thu Jun 06 2024 19:55:48 - 20:23:54
144 CPUs in system; running 1 parallel copy of tests
Dhrystone 2 using register variables 42751196.0 lps (10.0 s, 7 samples)
Double-Precision Whetstone 5171.1 MWIPS (9.8 s, 7 samples)
Execl Throughput 3417.1 lps (30.0 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks 726116.8 KBps (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks 191752.7 KBps (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks 2144675.1 KBps (30.0 s, 2 samples)
Pipe Throughput 1010071.8 lps (10.0 s, 7 samples)
Pipe-based Context Switching 107060.2 lps (10.0 s, 7 samples)
Process Creation 6845.4 lps (30.0 s, 2 samples)
Shell Scripts (1 concurrent) 5133.7 lpm (60.0 s, 2 samples)
Shell Scripts (8 concurrent) 4607.8 lpm (60.0 s, 2 samples)
System Call Overhead 761520.7 lps (10.0 s, 7 samples)
System Benchmarks Index Values BASELINE RESULT INDEX
Dhrystone 2 using register variables 116700.0 42751196.0 3663.3
Double-Precision Whetstone 55.0 5171.1 940.2
Execl Throughput 43.0 3417.1 794.7
File Copy 1024 bufsize 2000 maxblocks 3960.0 726116.8 1833.6
File Copy 256 bufsize 500 maxblocks 1655.0 191752.7 1158.6
File Copy 4096 bufsize 8000 maxblocks 5800.0 2144675.1 3697.7
Pipe Throughput 12440.0 1010071.8 812.0
Pipe-based Context Switching 4000.0 107060.2 267.7
Process Creation 126.0 6845.4 543.3
Shell Scripts (1 concurrent) 42.4 5133.7 1210.8
Shell Scripts (8 concurrent) 6.0 4607.8 7679.6
System Call Overhead 15000.0 761520.7 507.7
========
System Benchmarks Index Score 1229.9
------------------------------------------------------------------------
Benchmark Run: Thu Jun 06 2024 20:23:54 - 20:52:15
144 CPUs in system; running 144 parallel copies of tests
Dhrystone 2 using register variables 1301566752.7 lps (10.0 s, 7 samples)
Double-Precision Whetstone 361620.3 MWIPS (9.2 s, 7 samples)
Execl Throughput 45703.9 lps (29.9 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks 32471136.0 KBps (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks 9562904.3 KBps (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks 36136175.5 KBps (30.0 s, 2 samples)
Pipe Throughput 47642631.4 lps (10.0 s, 7 samples)
Pipe-based Context Switching 5996735.7 lps (10.0 s, 7 samples)
Process Creation 74604.5 lps (30.0 s, 2 samples)
Shell Scripts (1 concurrent) 170218.4 lpm (60.0 s, 2 samples)
Shell Scripts (8 concurrent) 23748.9 lpm (60.2 s, 2 samples)
System Call Overhead 50438100.4 lps (10.0 s, 7 samples)
System Benchmarks Index Values BASELINE RESULT INDEX
Dhrystone 2 using register variables 116700.0 1301566752.7 111531.0
Double-Precision Whetstone 55.0 361620.3 65749.2
Execl Throughput 43.0 45703.9 10628.8
File Copy 1024 bufsize 2000 maxblocks 3960.0 32471136.0 81997.8
File Copy 256 bufsize 500 maxblocks 1655.0 9562904.3 57781.9
File Copy 4096 bufsize 8000 maxblocks 5800.0 36136175.5 62303.8
Pipe Throughput 12440.0 47642631.4 38297.9
Pipe-based Context Switching 4000.0 5996735.7 14991.8
Process Creation 126.0 74604.5 5921.0
Shell Scripts (1 concurrent) 42.4 170218.4 40145.9
Shell Scripts (8 concurrent) 6.0 23748.9 39581.6
System Call Overhead 15000.0 50438100.4 33625.4
========
System Benchmarks Index Score 35625.3
-
My Talos II, Fedora 39:
Architecture: ppc64le
Byte Order: Little Endian
CPU(s): 72
...
Socket(s): 1
Could it be a Talos II Lite?
-
Apparently upgrading to dual NVMe drives in a HighPoint Rocket 1204 really helped versus SATA!
Architecture: ppc64le
Byte Order: Little Endian
CPU(s): 32
On-line CPU(s) list: 0-31
Model name: POWER9, altivec supported
Model: 2.3 (pvr 004e 1203)
Thread(s) per core: 4
Core(s) per socket: 8
Socket(s): 1
Frequency boost: enabled
CPU(s) scaling MHz: 58%
CPU max MHz: 3800.0000
CPU min MHz: 2166.0000
Caches (sum of all):
L1d: 256 KiB (8 instances)
L1i: 256 KiB (8 instances)
L2: 4 MiB (8 instances)
L3: 80 MiB (8 instances)
NUMA:
NUMA node(s): 1
NUMA node0 CPU(s): 0-31
Vulnerabilities:
Gather data sampling: Not affected
Itlb multihit: Not affected
L1tf: Mitigation; RFI Flush, L1D private per thread
Mds: Not affected
Meltdown: Mitigation; RFI Flush, L1D private per thread
Mmio stale data: Not affected
Reg file data sampling: Not affected
Retbleed: Not affected
Spec rstack overflow: Not affected
Spec store bypass: Mitigation; Kernel entry/exit barrier (eieio)
Spectre v1: Mitigation; __user pointer sanitization, ori31 speculation barrier enabled
Spectre v2: Mitigation; Software count cache flush (hardware accelerated), Software li
nk stack flush
Srbds: Not affected
Tsx async abort: Not affected
========================================================================
BYTE UNIX Benchmarks (Version 5.1.3)
System: garlic: GNU/Linux
OS: GNU/Linux -- 6.8.11-300.fc40.ppc64le -- #1 SMP Mon May 27 14:48:15 UTC 2024
Machine: ppc64le (unknown)
Language: en_US.utf8 (charmap="UTF-8", collate="UTF-8")
19:21:24 up 1 day, 5:15, 1 user, load average: 1.02, 0.93, 0.53; runlevel 2024-06-07
------------------------------------------------------------------------
Benchmark Run: Sat Jun 08 2024 19:21:24 - 19:49:33
32 CPUs in system; running 1 parallel copy of tests
Dhrystone 2 using register variables 43041836.3 lps (10.0 s, 7 samples)
Double-Precision Whetstone 4837.3 MWIPS (10.0 s, 7 samples)
Execl Throughput 3534.7 lps (30.0 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks 505013.3 KBps (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks 129743.4 KBps (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks 1617495.7 KBps (30.0 s, 2 samples)
Pipe Throughput 780267.9 lps (10.0 s, 7 samples)
Pipe-based Context Switching 45540.0 lps (10.0 s, 7 samples)
Process Creation 4907.1 lps (30.0 s, 2 samples)
Shell Scripts (1 concurrent) 4964.0 lpm (60.0 s, 2 samples)
Shell Scripts (8 concurrent) 4012.3 lpm (60.0 s, 2 samples)
System Call Overhead 745210.3 lps (10.0 s, 7 samples)
System Benchmarks Index Values BASELINE RESULT INDEX
Dhrystone 2 using register variables 116700.0 43041836.3 3688.2
Double-Precision Whetstone 55.0 4837.3 879.5
Execl Throughput 43.0 3534.7 822.0
File Copy 1024 bufsize 2000 maxblocks 3960.0 505013.3 1275.3
File Copy 256 bufsize 500 maxblocks 1655.0 129743.4 783.9
File Copy 4096 bufsize 8000 maxblocks 5800.0 1617495.7 2788.8
Pipe Throughput 12440.0 780267.9 627.2
Pipe-based Context Switching 4000.0 45540.0 113.9
Process Creation 126.0 4907.1 389.5
Shell Scripts (1 concurrent) 42.4 4964.0 1170.7
Shell Scripts (8 concurrent) 6.0 4012.3 6687.1
System Call Overhead 15000.0 745210.3 496.8
========
System Benchmarks Index Score 982.0
------------------------------------------------------------------------
Benchmark Run: Sat Jun 08 2024 19:49:33 - 20:17:50
32 CPUs in system; running 32 parallel copies of tests
Dhrystone 2 using register variables 458837067.6 lps (10.0 s, 7 samples)
Double-Precision Whetstone 113857.0 MWIPS (9.9 s, 7 samples)
Execl Throughput 37688.4 lps (29.9 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks 7241790.7 KBps (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks 1936219.8 KBps (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks 12073774.9 KBps (30.0 s, 2 samples)
Pipe Throughput 11005546.8 lps (10.0 s, 7 samples)
Pipe-based Context Switching 1753793.6 lps (10.0 s, 7 samples)
Process Creation 68433.7 lps (30.0 s, 2 samples)
Shell Scripts (1 concurrent) 70747.1 lpm (60.0 s, 2 samples)
Shell Scripts (8 concurrent) 9283.9 lpm (60.1 s, 2 samples)
System Call Overhead 13679550.5 lps (10.0 s, 7 samples)
System Benchmarks Index Values BASELINE RESULT INDEX
Dhrystone 2 using register variables 116700.0 458837067.6 39317.7
Double-Precision Whetstone 55.0 113857.0 20701.3
Execl Throughput 43.0 37688.4 8764.8
File Copy 1024 bufsize 2000 maxblocks 3960.0 7241790.7 18287.4
File Copy 256 bufsize 500 maxblocks 1655.0 1936219.8 11699.2
File Copy 4096 bufsize 8000 maxblocks 5800.0 12073774.9 20816.9
Pipe Throughput 12440.0 11005546.8 8846.9
Pipe-based Context Switching 4000.0 1753793.6 4384.5
Process Creation 126.0 68433.7 5431.2
Shell Scripts (1 concurrent) 42.4 70747.1 16685.6
Shell Scripts (8 concurrent) 6.0 9283.9 15473.2
System Call Overhead 15000.0 13679550.5 9119.7
========
System Benchmarks Index Score 12583.4
-
My Talos II, Fedora 39:
Architecture: ppc64le
Byte Order: Little Endian
CPU(s): 72
...
Socket(s): 1
Could it be a Talos II Lite?
No, it's a full Talos II but with one CPU. When I feel that the hardware is already inefficient I will buy a second CPU ;)
So far the only program that is too slow on Talos is Firefox :(
-
Hi, this is very interesting!
Here's my Talos II Lite
Architecture: ppc64le
Byte Order: Little Endian
CPU(s): 32
On-line CPU(s) list: 0-31
Model name: POWER9, altivec supported
Model: 2.3 (pvr 004e 1203)
Thread(s) per core: 4
Core(s) per socket: 8
Socket(s): 1
Frequency boost: enabled
CPU(s) scaling MHz: 100%
CPU max MHz: 3800.0000
CPU min MHz: 2166.0000
L1d cache: 256 KiB (8 instances)
L1i cache: 256 KiB (8 instances)
L2 cache: 4 MiB (8 instances)
L3 cache: 80 MiB (8 instances)
NUMA node(s): 1
NUMA node0 CPU(s): 0-31
Vulnerability Gather data sampling: Not affected
Vulnerability Itlb multihit: Not affected
Vulnerability L1tf: Mitigation; RFI Flush, L1D private per thread
Vulnerability Mds: Not affected
Vulnerability Meltdown: Mitigation; RFI Flush, L1D private per thread
Vulnerability Mmio stale data: Not affected
Vulnerability Reg file data sampling: Not affected
Vulnerability Retbleed: Not affected
Vulnerability Spec rstack overflow: Not affected
Vulnerability Spec store bypass: Mitigation; Kernel entry/exit barrier (eieio)
Vulnerability Spectre v1: Mitigation; __user pointer sanitization, ori31 speculation barrier enabled
Vulnerability Spectre v2: Mitigation; Software count cache flush (hardware accelerated), Software link stack flush
Vulnerability Srbds: Not affected
Vulnerability Tsx async abort: Not affected
BYTE UNIX Benchmarks (Version 5.1.3)
System: kvanneholtnipa: GNU/Linux
OS: GNU/Linux -- 6.9.3-gentoo -- #1 SMP Sun Jun 2 18:02:57 CEST 2024
Machine: ppc64le (PowerNV T2P9S01 REV 1.01)
Language: en_US.utf8 (charmap="UTF-8", collate="UTF-8")
22:03:56 up 7 days, 3:53, 5 users, load average: 0.33, 0.13, 0.20; runlevel 2024-06-02
------------------------------------------------------------------------
Benchmark Run: Sun Jun 09 2024 22:03:56 - 22:32:05
32 CPUs in system; running 1 parallel copy of tests
Dhrystone 2 using register variables 42969120.6 lps (10.0 s, 7 samples)
Double-Precision Whetstone 4930.0 MWIPS (10.0 s, 7 samples)
Execl Throughput 1089.7 lps (30.0 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks 55940.8 KBps (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks 13843.0 KBps (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks 203745.8 KBps (30.0 s, 2 samples)
Pipe Throughput 519706.9 lps (10.0 s, 7 samples)
Pipe-based Context Switching 56092.3 lps (10.0 s, 7 samples)
Process Creation 2319.9 lps (30.0 s, 2 samples)
Shell Scripts (1 concurrent) 2987.0 lpm (60.0 s, 2 samples)
Shell Scripts (8 concurrent) 2006.8 lpm (60.0 s, 2 samples)
System Call Overhead 469894.7 lps (10.0 s, 7 samples)
System Benchmarks Index Values BASELINE RESULT INDEX
Dhrystone 2 using register variables 116700.0 42969120.6 3682.0
Double-Precision Whetstone 55.0 4930.0 896.4
Execl Throughput 43.0 1089.7 253.4
File Copy 1024 bufsize 2000 maxblocks 3960.0 55940.8 141.3
File Copy 256 bufsize 500 maxblocks 1655.0 13843.0 83.6
File Copy 4096 bufsize 8000 maxblocks 5800.0 203745.8 351.3
Pipe Throughput 12440.0 519706.9 417.8
Pipe-based Context Switching 4000.0 56092.3 140.2
Process Creation 126.0 2319.9 184.1
Shell Scripts (1 concurrent) 42.4 2987.0 704.5
Shell Scripts (8 concurrent) 6.0 2006.8 3344.7
System Call Overhead 15000.0 469894.7 313.3
========
System Benchmarks Index Score 417.0
------------------------------------------------------------------------
Benchmark Run: Sun Jun 09 2024 22:32:05 - 23:00:27
32 CPUs in system; running 32 parallel copies of tests
Dhrystone 2 using register variables 424895900.7 lps (10.0 s, 7 samples)
Double-Precision Whetstone 118387.4 MWIPS (9.8 s, 7 samples)
Execl Throughput 11518.8 lps (29.9 s, 2 samples)
File Copy 1024 bufsize 2000 maxblocks 564503.0 KBps (30.0 s, 2 samples)
File Copy 256 bufsize 500 maxblocks 141555.5 KBps (30.0 s, 2 samples)
File Copy 4096 bufsize 8000 maxblocks 2199353.4 KBps (30.0 s, 2 samples)
Pipe Throughput 5601329.9 lps (10.0 s, 7 samples)
Pipe-based Context Switching 978736.8 lps (10.0 s, 7 samples)
Process Creation 26301.2 lps (30.0 s, 2 samples)
Shell Scripts (1 concurrent) 23083.4 lpm (60.0 s, 2 samples)
Shell Scripts (8 concurrent) 3091.5 lpm (60.2 s, 2 samples)
System Call Overhead 6228056.0 lps (10.0 s, 7 samples)
System Benchmarks Index Values BASELINE RESULT INDEX
Dhrystone 2 using register variables 116700.0 424895900.7 36409.2
Double-Precision Whetstone 55.0 118387.4 21525.0
Execl Throughput 43.0 11518.8 2678.8
File Copy 1024 bufsize 2000 maxblocks 3960.0 564503.0 1425.5
File Copy 256 bufsize 500 maxblocks 1655.0 141555.5 855.3
File Copy 4096 bufsize 8000 maxblocks 5800.0 2199353.4 3792.0
Pipe Throughput 12440.0 5601329.9 4502.7
Pipe-based Context Switching 4000.0 978736.8 2446.8
Process Creation 126.0 26301.2 2087.4
Shell Scripts (1 concurrent) 42.4 23083.4 5444.2
Shell Scripts (8 concurrent) 6.0 3091.5 5152.5
System Call Overhead 15000.0 6228056.0 4152.0
========
System Benchmarks Index Score 4148.7
I always felt my disks are very slow, and now I have proof. In single thread, they are about just 10% of the others here. Even with LUKS, they should not be THIS slow, should they?
-
That is a pretty stark difference. What’s your storage configuration?
-
That is a pretty stark difference. What’s your storage configuration?
I have Btrfs on top of LUKS on top of Raw disk, no raid or stripe.
Hardware:
CPU --> PLX Technology PEX 9733 PCIe Gen 3 4x U.2 --> Western Digital Ultrastar DC SW630 NVMe SSD
I also have 2 other NVME-disks of different make and size, all are slow.
-
Hi, this is very interesting!
Here's my Talos II Lite
- - 8< - - - - Cut
Seeing real numbers has finally gotten me motivated to clear out a disk, and test another distro.
Then I can also do some testing without full disk encryplion.
Will have to do some backup rotatoins first, thou :P