1
0
Fork 0
mirror of https://github.com/sockspls/badfish synced 2025-05-02 01:29:36 +00:00
BadFish/src/nnue
mstembera 82bb21dc7a Optimize AVX2 path in NNUE evaluation
always selecting AffineTransform specialization for small inputs.

A related patch was tested as

Initially tested as a simplification
STC https://tests.stockfishchess.org/tests/view/6317c3f437f41b13973d6dff
LLR: 2.95 (-2.94,2.94) <-1.75,0.25>
Total: 58072 W: 15619 L: 15425 D: 27028
Ptnml(0-2): 241, 6191, 15992, 6357, 255

Elo gain speedup test
STC https://tests.stockfishchess.org/tests/view/63181c1b37f41b13973d79dc
LLR: 2.94 (-2.94,2.94) <0.00,2.00>
Total: 184496 W: 49922 L: 49401 D: 85173
Ptnml(0-2): 851, 19397, 51208, 19964, 828

and this patch gained in testing

speedup        = +0.0071
P(speedup > 0) =  1.0000
on CPU: 16 x AMD Ryzen 9 3950X

closes https://github.com/official-stockfish/Stockfish/pull/4158

No functional change
2022-09-11 14:19:57 +02:00
..
features Update copyright years 2022-01-06 15:45:45 +01:00
layers Optimize AVX2 path in NNUE evaluation 2022-09-11 14:19:57 +02:00
evaluate_nnue.cpp Use NNUE complexity in search, retune related parameters 2022-06-20 08:30:57 +02:00
evaluate_nnue.h Update copyright years 2022-01-06 15:45:45 +01:00
nnue_accumulator.h Update copyright years 2022-01-06 15:45:45 +01:00
nnue_architecture.h Update NNUE architecture to SFNNv5. Update network to nn-3c0aa92af1da.nnue. 2022-05-14 12:47:22 +02:00
nnue_common.h Clean up and simplify some nnue code. 2022-02-25 08:37:57 +01:00
nnue_feature_transformer.h Assorted small cleanups 2022-05-29 18:42:48 +02:00