mstembera
|
82bb21dc7a
|
Optimize AVX2 path in NNUE evaluation
always selecting AffineTransform specialization for small inputs.
A related patch was tested as
Initially tested as a simplification
STC https://tests.stockfishchess.org/tests/view/6317c3f437f41b13973d6dff
LLR: 2.95 (-2.94,2.94) <-1.75,0.25>
Total: 58072 W: 15619 L: 15425 D: 27028
Ptnml(0-2): 241, 6191, 15992, 6357, 255
Elo gain speedup test
STC https://tests.stockfishchess.org/tests/view/63181c1b37f41b13973d79dc
LLR: 2.94 (-2.94,2.94) <0.00,2.00>
Total: 184496 W: 49922 L: 49401 D: 85173
Ptnml(0-2): 851, 19397, 51208, 19964, 828
and this patch gained in testing
speedup = +0.0071
P(speedup > 0) = 1.0000
on CPU: 16 x AMD Ryzen 9 3950X
closes https://github.com/official-stockfish/Stockfish/pull/4158
No functional change
|
2022-09-11 14:19:57 +02:00 |
|