BadFish

mirror of https://github.com/sockspls/badfish synced 2025-04-30 16:53:09 +00:00

Author	SHA1	Message	Date
Stephane Nicolet	05aa34e00e	Update list of top CPU contributors Contributors with >10,000 CPU hours as of November 4, 2018. Thank you! No functional change	2018-11-08 17:09:44 +01:00
SFisGOD	cd732c080b	Pawn and Piece Values Tuned at LTC Failed STC LLR: -2.96 (-2.94,2.94) [0.00,4.00] Total: 27487 W: 5846 L: 5903 D: 15738 http://tests.stockfishchess.org/tests/view/5be1d3190ebc595e0ae2e5b8 Passed 1st LTC LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 38503 W: 6270 L: 5999 D: 26234 http://tests.stockfishchess.org/tests/view/5be1f5ef0ebc595e0ae2e750 Passed 2nd LTC LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 34016 W: 5584 L: 5326 D: 23106 http://tests.stockfishchess.org/tests/view/5be2a1970ebc595e0ae2f1b4 This pull request lead to an interesting discussion about testing methodology for Stockfish: https://github.com/official-stockfish/Stockfish/pull/1804 Bench: 3647775	2018-11-08 16:34:10 +01:00
Joost VandeVondele	df50ea5dc6	fixup	2018-11-08 16:20:23 +01:00
Joost VandeVondele	9315ba60e6	Extension for king moves changing castling rights passed STC: LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 8463 W: 1919 L: 1747 D: 4797 http://tests.stockfishchess.org/tests/view/5be15d510ebc595e0ae2dec6 passed LTC: LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 142590 W: 23263 L: 22587 D: 96740 http://tests.stockfishchess.org/tests/view/5be1667b0ebc595e0ae2df2d Bench: 3607243	2018-11-08 16:20:23 +01:00
Fabian Fichter	a6fe035977	Simplify mobility danger Check sign only after adding mobility danger term. STC LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 9090 W: 2001 L: 1856 D: 5233 http://tests.stockfishchess.org/tests/view/5bdc5ee10ebc595e0ae27bc2 LTC LLR: 2.94 (-2.94,2.94) [-3.00,1.00] Total: 123466 W: 19766 L: 19805 D: 83895 http://tests.stockfishchess.org/tests/view/5bdc678e0ebc595e0ae27cf3 bench: 3630207	2018-11-04 21:30:35 +01:00
Stéphane Nicolet	8bb7a73708	Rook tweaks in evaluation Some small changes in evaluation to try to convince Stockfish to centralize her rooks more in middle game and avoid trapping them in the corners. Joint work by SFisGOD and snicolet. STC: LLR: 2.96 (-2.94,2.94) [0.00,4.00] Total: 99826 W: 21895 L: 21341 D: 56590 http://tests.stockfishchess.org/tests/view/5bdc3e280ebc595e0ae277df LTC: LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 21467 W: 3541 L: 3322 D: 14604 http://tests.stockfishchess.org/tests/view/5bdc9ff30ebc595e0ae28119 Bench: 3631608	2018-11-02 22:08:26 +01:00
Joost VandeVondele	3f1eb85a1c	Fix issues from using adjustedDepth too broadly The recently committed Fail-High patch (`081af90805`) had a number of changes beyond adjusting the depth of search on fail high, with some undesirable side effects. 1) Decreasing depth on PV output, confusing GUIs and players alike as described in issue #1787. The depth printed is anyway a convention, let's consider adjustedDepth an implementation detail, and continue to print rootDepth. Depth, nodes, time and move quality all increase as we compute more. (fixing this output has no effect on play). 2) Fixes go depth output (now based on rootDepth again, no effect on play), also reported in issue #1787 3) The depth lastBestDepth is used to compute how long a move is stable, a new move found during fail-high is incorrectly considered stable if based on adjustedDepth instead of rootDepth (this changes time management). Reverting this passed STC and LTC: STC LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 82982 W: 17810 L: 17808 D: 47364 http://tests.stockfishchess.org/tests/view/5bd391a80ebc595e0ae1e993 LTC LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 109083 W: 17602 L: 17619 D: 73862 http://tests.stockfishchess.org/tests/view/5bd40c820ebc595e0ae1f1fb 4) In the thread voting scheme, the rank of the fail-high thread is now artificially low, incorrectly since the quality of the move is much better than what adjustedDepth suggests (e.g. if it takes 10 iterations to find VALUE_KNOWN_WIN, it has very low depth). Further evidence comes from a test that showed that the move of highest depth is not better than that of the last PV (which is potentially of much lower adjustedDepth). I.e. this test http://tests.stockfishchess.org/tests/view/5bd37a120ebc595e0ae1e7c3 failed SPRT[0, 5]: LLR: -2.95 (-2.94,2.94) [0.00,5.00] Total: 10609 W: 2266 L: 2345 D: 5998 In a running 5+0.05 th 8 test (more than 10000 games) a positive Elo estimate is shown (strong enough for a [-3,1], possibly not [0,4]): http://tests.stockfishchess.org/tests/view/5bd421be0ebc595e0ae1f315 LLR: -0.13 (-2.94,2.94) [0.00,4.00] Total: 13644 W: 2573 L: 2532 D: 8539 Elo 1.04 [-2.52,4.61] / LOS 71% Thus, restore old behavior as a bugfix, keeping the core of the fail-high patch idea as resolving scheme. This is non-functional for bench, but changes searches via time management and in the threaded case. Bench: 3556672	2018-11-01 16:00:56 +01:00
SFisGOD	4a0db9ea3c	Combo Combo of two parameter tweaks and tuned values for Queen and ThreatByKing. STC LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 20180 W: 4439 L: 4198 D: 11543 http://tests.stockfishchess.org/tests/view/5bd7b8250ebc595e0ae22e97 LTC LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 86312 W: 14106 L: 13685 D: 58521 http://tests.stockfishchess.org/tests/view/5bd803560ebc595e0ae23213 This combo consists of the following: Queen Value (tuned values) Iter: 72056, A: 5000, alpha 0.602000, gamma 0.101000, clipping old, rounding deterministic param: QueenValueMg, best: 2528.91, start: 2528.00 param: QueenValueEg, best: 2687.12, start: 2698.00 ThreatByKing (tuned values) Green STC (50.8k games) http://tests.stockfishchess.org/tests/view/5bd1d5a00ebc595e0ae1cbec LTC (I stopped this test at 71.2k games. It's likely yellow.) http://tests.stockfishchess.org/tests/view/5bd263e70ebc595e0ae1d77e WeakUnopposedPawn (tweak) by xoto (https://github.com/xoto10) Green STC (102.8k games) http://tests.stockfishchess.org/tests/view/5bd306bb0ebc595e0ae1e146 Yellow LTC (90.8k games) http://tests.stockfishchess.org/tests/view/5bd3ea660ebc595e0ae1f16b aspiTune1 (tweak) by vondele (https://github.com/vondele) Green STC (125.9k games) http://tests.stockfishchess.org/tests/view/5bd2ae100ebc595e0ae1dab0 Yellow LTC (107.9k games) http://tests.stockfishchess.org/tests/view/5bd3eb700ebc595e0ae1f16f Thank you @31m059 (Mark Tenzer) for helping me! Also, thank you very much for recognizing my efforts. I genuinely appreciate it. Bench: 3556672	2018-11-01 15:39:19 +01:00
Vizvezdenec	7a61368971	Tweak of knight PSQT and mobility bonuses STC LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 16906 W: 3745 L: 3516 D: 9645 http://tests.stockfishchess.org/tests/view/5bd306a40ebc595e0ae1e144 LTC LLR: 2.96 (-2.94,2.94) [0.00,4.00] Total: 62779 W: 10249 L: 9901 D: 42629 http://tests.stockfishchess.org/tests/view/5bd3188f0ebc595e0ae1e296 Bench 3166402	2018-10-27 09:23:11 +02:00
Guenther Demetz	081af90805	On main thread: reduce depth after fail high This helps resolving consecutive FH's during aspiration more efficiently STC: http://tests.stockfishchess.org/tests/view/5bc857920ebc592439f85765 LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 4992 W: 1134 L: 980 D: 2878 Elo +10.72 LTC: http://tests.stockfishchess.org/tests/view/5bc868050ebc592439f857ef LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 8123 W: 1363 L: 1210 D: 5550 Elo +6.54 No-Regression test with 8 threads, tc=15+0.15: http://tests.stockfishchess.org/tests/view/5bc874ca0ebc592439f85938 LLR: 2.94 (-2.94,2.94) [-3.00,1.00] Total: 24740 W: 3977 L: 3863 D: 16900 Elo +1.60 This was a cooperation between me and Michael Stembera: -me recognizing SF having problems with resolving FH's efficiently at high depths, thus starting some tests based on consecutive FH's. -mstembera picking up the idea with first success at STC & LTC (so full credits to him!) -me suggesting how to resolve the issues pinpointed by S.G on PR #1768 and finally restricting the logic to the main thread so that it don't regresses at multi-thread. bench: 3314347	2018-10-25 23:08:06 +02:00
Peter Zsifkovits	bc3b148d57	NUMA for 9 threads or more Enable numa machinery only for STRICTLY MORE than 8 threads. Reason for this change is that nowadays SMP tests are always done with 8 threads. That is a problem for multi-socket Windows machines running on fishtest. No functional change	2018-10-25 23:03:25 +02:00
Günther Demetz	9fff272209	Revert Pull Request #1771 , see issue #1785 (#1786 ) no functional change bench: 4274207	2018-10-23 18:04:30 +02:00
mstembera	542a2b39ed	Small simplification in castling rights There is no need for a special struct with a static member to generate castling rights. No functional change.	2018-10-21 08:15:04 +02:00
ElbertoOne	738a6dfd4c	Simplify check extensions Remove the !moveCountPruning condition for check extensions, which seems not necessary. STC: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 22238 W: 4835 L: 4715 D: 12688 http://tests.stockfishchess.org/tests/view/5bb3241a0ebc592439f6d2ac LTC: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 36593 W: 5898 L: 5802 D: 24893 http://tests.stockfishchess.org/tests/view/5bb34c220ebc592439f6d5dc Bench: 4274207	2018-10-14 20:40:57 +02:00
Joost VandeVondele	97d2cc9a9c	Randomize draw eval The patch adds a small random component (+-1) to VALUE_DRAW for the evaluation of draw positions (mostly 3folds). This random component is not static, but potentially different for each visit of the node (hence derived from the node counter). The effect is that in positions with many 3fold draw lines, different lines are followed at each iteration. This keeps the search much more dynamic, as opposed to being locked to one particular 3fold. An example of a position where master suffers from 3fold-blindness and this patch solves quickly is the famous TCEC game 53: FEN: 3r2k1/pr6/1p3q1p/5R2/3P3p/8/5RP1/3Q2K1 b - - 0 51 master doesn't see that this is a lost position (draw eval up to depth 50) as Qf6-e6 d4-d5 (found by patch at depth 23) leads to a loss. The 3fold-blindness is more important at longer TC, the patch was yellow STC and LTC, but passed VLTC: STC LLR: -2.95 (-2.94,2.94) [0.00,5.00] Total: 46328 W: 10048 L: 9953 D: 26327 http://tests.stockfishchess.org/tests/view/5b9c0ca20ebc592cf275f7c7 LTC LLR: -2.95 (-2.94,2.94) [0.00,5.00] Total: 54663 W: 8938 L: 8846 D: 36879 http://tests.stockfishchess.org/tests/view/5b9ca1610ebc592cf27601d3 VLTC LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 31789 W: 4512 L: 4284 D: 22993 http://tests.stockfishchess.org/tests/view/5b9d1a670ebc592cf276076d Credit to @crossbr for pointing to this problem repeatedly, and giving the hint that many draw lines are typical in those situations. Bench: 4756639	2018-10-14 20:33:52 +02:00
Guenther Demetz	cb0111d3db	Correctly track down pv even in fail-high case Currently we update (track up) the pv even in the fail high case. However most times in such cases the pv in the ply below remains unset because there we have value == alpha and so finally we see truncated pv's (=just one move) in fail high cases. Of course tracking down these pv's (+sending them to the gui) comes at a certian cost, but no-regression tests passed: STC: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 16300 W: 3556 L: 3424 D: 9320 http://tests.stockfishchess.org/tests/view/5b9b73500ebc592cf275ea92 LTC: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 202411 W: 32734 L: 32897 D: 136780 http://tests.stockfishchess.org/tests/view/5b9baed10ebc592cf275ef6d N.B.: Digging also into qsearch was tried in another version but seemed not to pass the tests. This means that we don't always will get a pv until the very tips. No functional change	2018-10-14 20:19:46 +02:00
Miguel Lahoz	0370077c37	Simplify evaluation of blockers_for_king Currently, we have two evaluation terms which account for pinned pieces. One is for all pinned pieces in kingDanger computation and another for just pinned pawns in ThreatByRank. We can increase the relevant bonus for kingDanger calculation and do away with the ThreatByRank, which seems to just add more complexity. STC: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 113353 W: 24299 L: 24356 D: 64698 http://tests.stockfishchess.org/tests/view/5ba348c20ebc592cf2766e61 LTC: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 96458 W: 15514 L: 15511 D: 65433 http://tests.stockfishchess.org/tests/view/5ba398830ebc592cf2767563 At 100k games, I thought it struggles a bit, but some related [0,4] tests attempting individual tweaks seem to fail: I tried directly tweaking ThreatByRank: http://tests.stockfishchess.org/tests/view/5ba3c6300ebc592cf276791c http://tests.stockfishchess.org/tests/view/5ba3c6190ebc592cf2767917 @Vizveznedec was also recently trying to tweak the same coeffecients for kingDanger calculation: http://tests.stockfishchess.org/tests/view/5ba2c7320ebc592cf27664b2 http://tests.stockfishchess.org/tests/view/5ba2c8220ebc592cf27664b8 http://tests.stockfishchess.org/tests/view/5ba2c7880ebc592cf27664b4 http://tests.stockfishchess.org/tests/view/5ba2c7ce0ebc592cf27664b6 Bench: 4648095	2018-10-14 20:15:16 +02:00
Joost VandeVondele	d615f15fce	small ttCapture simplification. ttCapture can be assigned to only once outside of the main loop. The patch seems functional at higher depths (seems possible in the case of non-legal TTmoves that are captures). passed STC LLR: 2.94 (-2.94,2.94) [-3.00,1.00] Total: 23189 W: 5098 L: 4980 D: 13111 http://tests.stockfishchess.org/tests/view/5bb3822c0ebc592439f6d966 passed LTC LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 10336 W: 1665 L: 1529 D: 7142 http://tests.stockfishchess.org/tests/view/5bb39a190ebc592439f6db8a unchanged bench: 4312846	2018-10-14 20:10:47 +02:00
31m059	489357d7b2	Combo This PR is a combination of two unrelated [0, 4] patches that appeared promising but not quite strong enough to pass on their own. The combination initially failed STC with a positive score after a long run, and the subsequent speculative LTC test passed. * tweak_threatOnQueen4 : Increase the middlegame components of ThreatByMinor[QUEEN] and ThreatByRook[QUEEN] by 15 each. Bryan's (@crossbr) analysis of CCC Bonus Game 10 inspired several tests on penalizing a queen with limited safe mobility. While attempting to implement this idea, I noticed that when I did not include the queen's current square in the calculations, the Elo gains seemed to vanish--and only then did I have the idea to revisit ThreatByMinor[QUEEN] and ThreatByRook[QUEEN], adding a corresponding value to each. Without Bryan's work, this test would never have been submitted. I would also like to recognize the efforts and contributions of @SFisGOD, who also vigorously worked on this idea. * Use pure static eval for null move pruning : This idea was directly re-purposed from a promising test by Jerry Donald Watson (@jerrydonaldwatson) in August. It was also independently developed and tested by Stefan Geschwentner (@locutus2) previously. Thank you all! STC (failed yellow): LLR: -2.96 (-2.94,2.94) [0.00,4.00] Total: 83913 W: 17986 L: 17825 D: 48102 http://tests.stockfishchess.org/tests/view/5bbc59300ebc592439f76aa5 LTC: LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 137198 W: 22351 L: 21772 D: 93075 http://tests.stockfishchess.org/tests/view/5bbce35f0ebc592439f77639 Bench: 4312846	2018-10-14 20:02:31 +02:00
Eduardo Caceres	8141bdd179	Fix two typos in comments Note by snicolet: I use this non-functional change patch as a pretext to correct the wrong bench number I introduced in the message of the previous commit. Bench: 4059356	2018-09-27 21:39:36 +02:00
Joost VandeVondele	bbf9daa175	Remove essentially unused code this was added recently as part of a larger commit, but only changes eval of positions at MAX_PLY depth a little. Can be safely removed: passed STC: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 7424 W: 1640 L: 1492 D: 4292 http://tests.stockfishchess.org/html/live_elo.html?5ba3bcbe0ebc592cf27677ff passed LTC: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 73554 W: 12028 L: 11990 D: 49536 http://tests.stockfishchess.org/html/live_elo.html?5ba397ee0ebc592cf2767556 unchanged Bench: 4248710	2018-09-27 21:28:38 +02:00
protonspring	13d06edb84	Two simplifications in passed pawns evaluation These two simplifications appear to be affecting and/or offsetting each other. Neither can be removed independently, but in combination they pass -3,1. STC LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 36391 W: 7888 L: 7795 D: 20708 http://tests.stockfishchess.org/tests/view/5b9bce410ebc592cf275f1b2 LTC LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 19513 W: 3237 L: 3114 D: 13162 http://tests.stockfishchess.org/tests/view/5b9c0edf0ebc592cf275f80e Closes https://github.com/official-stockfish/Stockfish/pull/1769 bench 4059356	2018-09-27 21:18:18 +02:00
Rocky640	49b1591505	Pawn PSQT Tuned Tested against master "Tweak opposite color bishops endgame scaling" using values from a 100K SPSA with ck=10 Passed STC http://tests.stockfishchess.org/tests/view/5ba7fe7a0ebc592cf276b971 LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 27717 W: 6052 L: 5782 D: 15883 Passed LTC http://tests.stockfishchess.org/tests/view/5ba815790ebc592cf276bb6b LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 17486 W: 2919 L: 2712 D: 11855 bench: 4441247	2018-09-27 20:58:40 +02:00
Joost VandeVondele	33b2f6398c	Remove unneeded branch Storing unconditionally the current generation and bound is equivalent to master. Part of the condition was added as a speed optimization in #429. Here the branch is fully eliminated. passed STC single-threaded: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 73515 W: 16378 L: 16359 D: 40778 http://tests.stockfishchess.org/tests/view/5b2fc38c0ebc5902b2e57fd5 passed STC multi-threaded: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 63725 W: 12916 L: 12874 D: 37935 http://tests.stockfishchess.org/tests/view/5b307b8f0ebc5902b2e5895f The multithreaded test was run after a plausible suggestion by @mstembera that the effect of this could be larger with many cores. The result seems to indicate this doesn't really matter on the 8core architecture abundantly available on fishtest. No functional change	2018-09-27 20:48:11 +02:00
Vizvezdenec	0fa957cf66	Tweak opposite colord bishops endgame scaling. Make scale factor dependant on asymmetry of pawn structure. STC http://tests.stockfishchess.org/tests/view/5b92a2a80ebc592cf2753dd4 LLR: 2.96 (-2.94,2.94) [0.00,5.00] Total: 31490 W: 6870 L: 6587 D: 18033 LTC http://tests.stockfishchess.org/tests/view/5b92f8170ebc592cf2754438 LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 54928 W: 8988 L: 8653 D: 37287 This patch shows that SF can use some more complicated endgame heuristics to evaluate endgames better from the distance. Closes https://github.com/official-stockfish/Stockfish/pull/1767 Bench: 4248710	2018-09-10 12:22:44 +02:00
ElbertoOne	4bef7aa5cd	Parameter tweaks in PSQT and NMP This patch is a combinaison of two parameters tweaks patches which have failed as strong yellows at LTC recently, by Alain Savard (Rocky640) and Fabian Fichter (ianfab): http://tests.stockfishchess.org/tests/view/5b8a71e60ebc592cf2749b1d http://tests.stockfishchess.org/tests/view/5b81ce3b0ebc5902bdbb6585 Passed STC: LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 57200 W: 12392 L: 12008 D: 32800 http://tests.stockfishchess.org/tests/view/5b8d0a5a0ebc592cf274c48f And LTC: LLR: 2.96 (-2.94,2.94) [0.00,4.00] Total: 37215 W: 6233 L: 5962 D: 25020 http://tests.stockfishchess.org/tests/view/5b8d56090ebc592cf274cb53 Closes https://github.com/official-stockfish/Stockfish/pull/1764 Bench: 4136116 --------------- How to continue from there? The null move reduction formula in line 769 of search.cpp is quite convoluted and full of mysterious magic constants at the moment, it would certainly be nice to simplify it and/or gain more Elo from it: ``` Depth R = ( (823 + 67 * depth / ONE_PLY) / 256 + std::min(int(eval - beta) / 200, 3)) * ONE_PLY; ```	2018-09-04 10:43:02 +02:00
Stéphane Nicolet	767c4ad1fc	Update list of authors And also fix some spaces and formatting oddities in the code. No functional change	2018-09-03 22:11:30 +02:00
Stéphane Nicolet	2bfaf45455	Re-introduce "keep pawns on both flanks" Re-introduce the "keep pawns on both flanks" idea. STC yellow: LLR: -2.95 (-2.94,2.94) [0.00,5.00] Total: 93279 W: 20175 L: 19853 D: 53251 http://tests.stockfishchess.org/tests/view/5b8a00370ebc592cf274916a LTC: LLR: 2.96 (-2.94,2.94) [0.00,5.00] Total: 11440 W: 1960 L: 1792 D: 7688 http://tests.stockfishchess.org/tests/view/5b8a329f0ebc592cf2749615 Closes https://github.com/official-stockfish/Stockfish/pull/1761 Bench: 4609645	2018-09-01 11:30:38 +02:00
Rocky640	f923dc0fe5	Long Diagonal Tweaks a) Reduce PSQT values along the long diagonals on non-central squares and increase the LongDiagonal bonus accordingly. The effect is to penalise bishops on the long diagonal which can not "see" the 2 central squares. The "good" bishops still have more or less the same bonus as current master. b) For a bishop on a central square, because of the "\| s" term in the code, the LongDiagonalBonus was always given. So while being there, remove the "\| s" and compensate the central Bishop PSQT accordingly. Passed STC LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 44498 W: 9658 L: 9323 D: 25517 http://tests.stockfishchess.org/tests/view/5b8992770ebc592cf2748942 Passed LTC LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 63092 W: 10324 L: 9975 D: 42793 http://tests.stockfishchess.org/tests/view/5b89a17a0ebc592cf2748b59 Closes https://github.com/official-stockfish/Stockfish/pull/1760 bench: 4693901	2018-09-01 04:33:17 +02:00
protonspring	e846a9306d	Remove PawnsOnBothFlanks It looks like PawnsOnBothFlanks can be removed from initiative(). A barrage of tests seem to confirm that the adjustment to -110 does not gain elo to offset any potential loss by removing PawnsOnBothFlanks. STC LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 22014 W: 4760 L: 4639 D: 12615 http://tests.stockfishchess.org/tests/view/5b7f50cc0ebc5902bdbb3a3e LTC LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 40561 W: 6667 L: 6577 D: 27317 http://tests.stockfishchess.org/tests/view/5b801f9f0ebc5902bdbb4467 The barrage of 0,4 tests on the -136 value are in my ps_tunetests branch. http://tests.stockfishchess.org/tests/user/protonspring Closes https://github.com/official-stockfish/Stockfish/pull/1751 Bench: 4413173 ------------- How to continue from there? The fact that endgames with all the pawns on only one flank are drawish is a well-known chess idea, so it seems quite strange that this can be removed so easily without losing Elo. In the past there had been attempts to improve on PawnsOnBothFlanks with similar concepts (for instance using the pawn span value), but the tests were at best neutral. Maybe Stockfish is now mature enough that these refined ideas would work to replace PawnsOnBothFlanks?	2018-08-29 02:49:10 +02:00
MJZ1977	10bb2e6cdb	Fix bug with "excludedMove" for probcut Bugfix: "excludedMove" has to be skipped in the probcut loop too. If it is not skipped, the probcut can exit quickly with a wrong return value corresponding to the excluded move. See the following forum thread for a discussion: https://groups.google.com/forum/?fromgroups=#!topic/fishcooking/GGithf_VwSU STC : LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 17130 W: 3747 L: 3617 D: 9766 http://tests.stockfishchess.org/tests/view/5b8460c40ebc5902bdbb999a LTC : LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 12387 W: 2064 L: 1930 D: 8393 http://tests.stockfishchess.org/tests/view/5b8466f90ebc5902bdbb9a21 To go further : it can be perhaps useful to tune the singular extension search parameters. Closes https://github.com/official-stockfish/Stockfish/pull/1754 Bench: 4308541	2018-08-29 02:28:09 +02:00
Steinar H. Gunderson	166bf90e41	Shrink the hash table of tablebases back to 4096 entries There is no need to make this as large as 65536 just for the sake of the single 7-man tablebase that happens to have the key 0xf9247fff. Idea for the fix by Ronald de Man, who suggested simply to allow more buckets past the end. We also implement Robin Hood hashing for the hash table, which takes the worst -case search for full 7-man tablebases down from 68 to 11 probes (Also takes the average probe length from 2.06 to 2.05). For a table with 8K entries, the corresponding numbers would be worst-case from 9 to 4, with average from 1.30 to 1.29. https://github.com/official-stockfish/Stockfish/pull/1747 No functional change	2018-08-29 02:00:20 +02:00
Ondrej Mosnacek	4aa091cf44	Refactor pure static eval code This commit tries to make the new pure static eval code more readable by splitting up the nested assignments into separate lines and making a few more cosmetic tweaks. No functional change.	2018-08-29 01:24:45 +02:00
protonspring	8a4821923a	make DistanceRing more consistent This is a non-functional change. By pre-incrementing minKingPawnDistance instead of post-incrementing, we can remove this -1. This also makes DistanceRing more consistent with the rest of stockfish since it now holds an actual "distance" instead of a less natural distance-1. In current master, PseudoAttacks[KING][ksq] == DistanceRingBB[ksq][0] With this patch, it will be PseudoAttacks[KING][ksq] == DistanceRingBB[ksq][1] ie squares at distance 1 from the king. This is more natural use of distance. The current array size DistanceRingBB[SQUARE_NB][8] is still OK with the new definition, because maximum distance between two squares on a chess board is seven (for example Kh1 and a8). No functional change.	2018-08-29 01:07:38 +02:00
Vizvezdenec	6307fd08e6	Tweak stat bonus formula Tweak stat bonus formula on top of latest elo gain by @snicolet STC http://tests.stockfishchess.org/tests/view/5b830a810ebc5902bdbb7e9c LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 27797 W: 6113 L: 5842 D: 15842 LTC http://tests.stockfishchess.org/tests/view/5b831f2c0ebc5902bdbb8038 LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 13655 W: 2294 L: 2099 D: 9262 I think that more elo can be found in tweaks of this parameters so I plan to further try some "hand-tuning", including increasing/decreasing ratio of two constants and making bonus assimetric to 0. Thx to @AndyGrant for helping with github and @jerrydonaldwatson for original idea. Closes https://github.com/official-stockfish/Stockfish/pull/1748 Bench: 4172767	2018-08-29 00:53:31 +02:00
VoyagerOne	3ac3b68540	Don't modify Eval with search stats at ttHits STC: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 28344 W: 6148 L: 6040 D: 16156 http://tests.stockfishchess.org/tests/view/5b7d6b4e0ebc5902bdbb1914 LTC: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 41084 W: 6769 L: 6680 D: 27635 http://tests.stockfishchess.org/tests/view/5b7d7f5b0ebc5902bdbb1b85 Bench: 4457440	2018-08-29 00:41:53 +02:00
Stefan Geschwentner	28543cddc6	Store only unchanged static evaluations in TT A recent commit introduced a decrease of the static evaluation of an inner node dependent on the previous stat score, which finally was also stored in the transposition table. Now only the unchanged static evaluation are stored there. Remark: For the case that a static evaluation can be retrieved from the transposition table the value is now used unchanged. Another test which also applies the modification in this case failed: http://tests.stockfishchess.org/tests/view/5b7af6df0ebc5902bdbae2f6 STC: LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 6707 W: 1547 L: 1383 D: 3777 http://tests.stockfishchess.org/tests/view/5b7a92df0ebc5902bdbadcf3 LTC: LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 36203 W: 6046 L: 5781 D: 24376 http://tests.stockfishchess.org/tests/view/5b7abaa10ebc5902bdbadfa9 Closes https://github.com/official-stockfish/Stockfish/pull/1742 Bench: 4457440	2018-08-20 21:52:29 +02:00
Stéphane Nicolet	f3b8a69919	Use an affine formula to mix stats and eval Follow-up for the previous patch: we use an affine formula to mix stats and evaluation in search. The idea is to give a bonus if the previous move of the opponent was historically bad, and a malus if the previous move of the opponent was historically good. More precisely, if x is the stat score of the previous move by the opponent, we implement the following formulas to tweak the evaluation at an internal node of the tree for our pruning decisions at this node: if x = 0, use v' = eval(P) if x > 0, use v' = eval(P) - 5 - x/1024 if x < 0, use v' = eval(P) + 5 - x/1024 For reference, the previous master had this simpler rule: if x > 0, use v' = eval(P) - 10 if x <= 0, use v' = eval(P) STC: LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 29322 W: 6359 L: 6088 D: 16875 http://tests.stockfishchess.org/tests/view/5b76a5980ebc5902bdba957f LTC: LLR: 2.96 (-2.94,2.94) [0.00,5.00] Total: 30893 W: 5154 L: 4910 D: 20829 http://tests.stockfishchess.org/tests/view/5b76ca6d0ebc5902bdba9914 Closes https://github.com/official-stockfish/Stockfish/pull/1740 Bench: 4592766	2018-08-18 01:23:36 +02:00
VoyagerOne	96c3a1f2ec	Mix search stats with evaluation Mix search stats with evaluation: if the opponent's move has a good historyStat, then decrease the evaluation of the internal node a bit for the pruning decisions during search. STC; LLR: 2.96 (-2.94,2.94) [0.00,5.00] Total: 72083 W: 15683 L: 15203 D: 41197 http://tests.stockfishchess.org/tests/view/5b74c3ea0ebc5902bdba7d41 LTC: LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 29104 W: 4867 L: 4630 D: 19607 http://tests.stockfishchess.org/tests/view/5b7565000ebc5902bdba851b Closes https://github.com/official-stockfish/Stockfish/pull/1738 Bench: 4514101 ----------- How to continue from there? • the use of the previous stat score can probably be simplified in lines 587 and 716 • we could try to use a continuous bonus based on the previous stat score, instead of just a fixed offset of -10 when the opponent previous move was good. ---------- Comments by Stefan Geschwentner: Interesting idea. Because only the eval in search is tweak this should only influence the eval and static eval used at inner nodes, and not on the return search value (which comes in the end from quiescence search), except through saving in TT followed by a TT cutoff. So essentialy this effects diverse pruning/reduction parts -- eval and static eval are lowered for good opponent moves: • tt cutoff (ttValue) • improving (static eval) • more razoring (eval) • less futility pruning (eval) • less null move pruning (eval + static eval) (but with little more depth) • more probcut (static eval) • more move futility pruning (static eval)	2018-08-17 11:40:29 +02:00
protonspring	d0f09de2d2	Simplify king file dependancy in evaluate_shelter() Remove the special value we used for the file of the king in the evaluate_shelter() function, and compensate by tweaking some of the ShelterStrength[] array values. STC LLR: 2.94 (-2.94,2.94) [-3.00,1.00] Total: 17069 W: 3782 L: 3652 D: 9635 http://tests.stockfishchess.org/tests/view/5b75eb0d0ebc5902bdba8f3d LTC LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 42639 W: 6973 L: 6887 D: 28779 http://tests.stockfishchess.org/tests/view/5b75fd7f0ebc5902bdba906b Closes https://github.com/official-stockfish/Stockfish/pull/1739 Bench: 4639508	2018-08-17 10:21:20 +02:00
Stéphane Nicolet	881cab2525	Double weight of capture history We double in this patch the weight of the capture history table in the local scoring of captures for move ordering. The capture history table is indexed by the triplet (capturing piece, capture square, captured piece) and gets information like "it seems to have been historically good in that part of the search tree to capture a pawn with a rook on g3, even if it seems to lose material", and affect the normaly pure « Most Valuable Victim » ordering of captures. Finished yellow at STC after 228842 games (posting a +1.36 Elo gain): LLR: -2.95 (-2.94,2.94) [0.00,4.00] Total: 228842 W: 50894 L: 50152 D: 127796 http://tests.stockfishchess.org/tests/view/5b714bb00ebc5902bdba332d Passed LTC: LLR: 2.96 (-2.94,2.94) [0.00,4.00] Total: 43251 W: 7425 L: 7131 D: 28695 http://tests.stockfishchess.org/tests/view/5b71c7d40ebc5902bdba3e51 Thanks to user Vizvezdenec for running the LTC test. Closes https://github.com/official-stockfish/Stockfish/pull/1736 Bench: 4272361	2018-08-14 10:12:31 +02:00
Alain SAVARD	4d22d3e52d	Remove pawncount array in imbalance This is a natural follow up to last commit where values on the QuadraticOurs diagonal and some piece value deltas were changed. @Stefano80 tried to simplify the newly introduced pawncount array using QuadraticOurs[1][1] =52 and a -30 adjustment on pawn values His STC [-3,1] was green http://tests.stockfishchess.org/tests/view/5b707f5b0ebc5902bdba2745 but not his LTC[-3,1] http://tests.stockfishchess.org/tests/view/5b7095700ebc5902bdba2a49 So I started a 80000 30+0.3 SPSA on the QuadraticOurs diagonal and on the piece values using @Stefano80 start values. SPSA gave the new values QuadraticOurs[1][1] =38 and a -33 on pawn values (the other changes on QuadraticOurs were kept, but were not ignificant according to this test http://tests.stockfishchess.org/tests/view/5b710ccb0ebc5902bdba2f27) STC http://tests.stockfishchess.org/tests/view/5b710b220ebc5902bdba2f19 LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 50902 W: 11214 L: 11150 D: 28538 LTC http://tests.stockfishchess.org/tests/view/5b7124ef0ebc5902bdba3106 LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 34271 W: 5852 L: 5753 D: 22666 Closes https://github.com/official-stockfish/Stockfish/pull/1735 bench: 4738555	2018-08-14 08:36:27 +02:00
GuardianRM	41cc4eb953	Non-linear bonus for pawn count This patch introduces a non-linear bonus for pawns, along with some (linear) corrections for the other pieces types. The original values were obtained by a massive non-linear tuning of both pawns and other pieces by GuardianRM, while Alain Savard and Chris Cain later simplified the patch by observing that, apart from the pawn case, the tuned corrections were in fact almost affine and could be incorporated in our current code base via the piece values in types.h (offset) and the diagonal of the quadratic matrix (slope). See discussion in PR#1725 : https://github.com/official-stockfish/Stockfish/pull/1725 STC: LLR: 2.97 (-2.94,2.94) [0.00,5.00] Total: 42948 W: 9662 L: 9317 D: 23969 http://tests.stockfishchess.org/tests/view/5b6ff6e60ebc5902bdba1d87 LTC: LLR: 2.97 (-2.94,2.94) [0.00,5.00] Total: 19683 W: 3409 L: 3206 D: 13068 http://tests.stockfishchess.org/tests/view/5b702dbd0ebc5902bdba216b How to continue from there? - Maybe the non-linearity for the pawn value could be somewhat tempered again and a simpler linear correction for pawns would work? Closes https://github.com/official-stockfish/Stockfish/pull/1734 Bench: 4681496	2018-08-12 18:40:11 +02:00
Stefano Cardanobile	b5581b7779	Combo of several promising parameter tweaks Combo of several tuning patches which finished yellow at LTC. [STC](http://tests.stockfishchess.org/tests/view/5b6ead340ebc5902bdba14ce) LR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 10668 W: 2445 L: 2239 D: 5984 Elo: 6.25 [1.76,10.69] (95%) [LTC](http://tests.stockfishchess.org/tests/view/5b6eb50e0ebc5902bdba151f) LLR: 2.96 (-2.94,2.94) [0.00,4.00] Total: 23761 W: 4155 L: 3923 D: 15683 Elo: 3.02 [0.29,5.67] (95%) Original patches: - [Piece values](http://tests.stockfishchess.org/tests/view/5b6d2cc00ebc5902bdba02d5) by Stefano Cardanobile - [Stat bonus](http://tests.stockfishchess.org/tests/view/5b6adbc90ebc5902bdb9da73) by Stefan Geschwentner - [Rook on pawn](http://tests.stockfishchess.org/tests/view/5b62a95b0ebc5902bdb961c0) by Mark Tenzer - [Hanging bonus](http://tests.stockfishchess.org/tests/view/5b5d2fa00ebc5902bdb90855) by Ivan Ilvec - [ss tweak](http://tests.stockfishchess.org/tests/view/5b58b7240ebc5902bdb89025) by miguel-l Bench: 4694813	2018-08-12 10:09:30 +02:00
Jerry Donald Watson	348cd5ed74	Simple razoring: depth 1 only, no distinction between PV / NonPV We simplify the razoring logic by applying it to all nodes at depth 1 only. An added advantage is that only one razor margin is needed now, and we treat PV and Non-PV nodes in the same manner. How to continue? - There may be some conditions in which depth 2 razoring is beneficial. - We can see whether the razor margin can be tuned, perhaps even with a different value for PV nodes. - Perhaps we can unify the treatment of PV and Non-PV nodes in other parts of the search as well. STC: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 5474 W: 1281 L: 1127 D: 3066 http://tests.stockfishchess.org/tests/view/5b6de3b20ebc5902bdba0d1e LTC: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 62670 W: 10749 L: 10697 D: 41224 http://tests.stockfishchess.org/tests/view/5b6dee340ebc5902bdba0eb0 In addition, we ran a fixed LTC test against a similar patch which also passed SPRT [-3, 1]: ELO: 0.23 +-2.1 (95%) LOS: 58.6% Total: 36412 W: 6168 L: 6144 D: 24100 http://tests.stockfishchess.org/tests/view/5b6e83940ebc5902bdba1485 We are opting for this patch as the more logical and simple of the two, and it appears to be no less strong. Thanks in particular to @DU-jdto for input into this patch. Bench: 4476945	2018-08-12 09:54:16 +02:00
Miguel Lahoz	f1088c9822	Remove Condition For Passed Pawns Currently, we do not consider pawns passed if there is another pawn of the same color in front of them. It appears that this condition is not necessary. The idea is that the doubled pawns are likely to be weak and one of them will be likely captured anyway. On the other hand, if we do somehow manage to promote a pawn, then the pawn behind it becomes passed as well. In any case, the end result is we end up with an extra potentially passed pawn. The current evaluation for passed pawns already handles this case by also scaling down this effect. STC: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 28291 W: 6287 L: 6178 D: 15826 http://tests.stockfishchess.org/tests/view/5b6c4b960ebc5902bdb9f256 LTC: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 30717 W: 5256 L: 5151 D: 20310 http://tests.stockfishchess.org/tests/view/5b6c82980ebc5902bdb9f863 Bench: 4938285	2018-08-10 06:16:29 +02:00
Stefan Geschwentner	198418ee67	LMR simplification Unify the "quiet" and "non-quiet" reduction rules for use at any kind of moves. The idea behind it was that both rules reduce at similiar cases in master: one directly for late previous moves and the other indirectly by using a bad stat score which is used for most move sorting and so approximates the late move condition. For captures/promotions the old rule was triggered in 25% but the new rule only for 3% of all cases (so now more reductions are done, whereas for quiet moves reductions keep the same level). STC: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 162327 W: 35976 L: 36134 D: 90217 http://tests.stockfishchess.org/tests/view/5b6a9a430ebc5902bdb9d5c1 LTC: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 29570 W: 5083 L: 4976 D: 19511 http://tests.stockfishchess.org/tests/view/5b6bc5d00ebc5902bdb9e9d6 Bench: 4526980	2018-08-09 14:45:35 +02:00
Stefano Cardanobile	bd4d2b0576	First check threshold in space evaluation Currently, we first calculate some bitboards at the top of Evaluation::space() and then check whether we actually need them. Invert the ordering. Of course this does not make a difference in current master because the constexpr bitboard calculations are in fact done at compile time by any decent compiler, but I find my version a bit healthier since it will always meet or exceed current implementation even if we eventually change the spaceMask to something not contsexpr. No functional change.	2018-08-08 17:58:41 +02:00
FauziAkram	c569cf263d	King Psqt Tuning After a session of tuning for King Psqt I got some new values, which was later tweaked manually by me Fauzi, to result in an Elo-gain patch which seems to scale pretty well: STC: LLR: -2.96 (-2.94,2.94) [0.00,4.00] Total: 100653 W: 22550 L: 22314 D: 55789 [Yellow patch] LTC: LLR: 2.96 (-2.94,2.94) [0.00,4.00] Total: 147079 W: 25584 L: 24947 D: 96548 [Green Patch] Bench: 4669050	2018-08-08 17:49:16 +02:00
Stefano Cardanobile	d96c1c32a2	Introduce voting system for best move selection Introduce voting system for best move selction in multi-threads mode. Joint work with Stefan Geschwentner, based on ideas introduced by Michael Stembera. Moves are upvoted by every thread using the margin to the minimum score across threads and the completed depth. First thread voting for the winner move is selected as best thread. Passed STC, LTC. A further LTC test with only 4 threads failed with positive score. A LTC with 31 threads was stopped with LLR 0.77 after 25k games to avoid use of excessive resources (equivalent to 1.5M STC games). Similar ideas were proposed by Michael Stembera 2 years ago #507, #508. This implementation seems simpler and more understandable, the results slightly more promising. Further possible work: 1) Tweak of the formula using for assigning votes. 2) Use a different baseline for the score dependent part: maximum score or winning probability could make more sense. 3) Assign votes in `Thread::Search` as iterations are completed and use voting results to stop search. 4) Select best thread as the threads voting for best move with the highest completed depth or, alternatively, vote on PV moves. Link to SPRT tests [stopped LTC, 31 threads 20+0.02](http://tests.stockfishchess.org/tests/view/5b61dc090ebc5902bdb95192) LLR: 0.77 (-2.94,2.94) [0.00,5.00] Total: 25602 W: 3977 L: 3850 D: 17775 Elo: 1.70 [-0.68,4.07] (95%) [passed LTC, 8 threads 20+0.02](http://tests.stockfishchess.org/tests/view/5b5df5180ebc5902bdb9162d) LLR: 2.96 (-2.94,2.94) [0.00,5.00] Total: 44478 W: 7602 L: 7300 D: 29576 Elo: 1.92 [-0.29,3.94] (95%) [failed LTC, 4 threads 20+0.02](http://tests.stockfishchess.org/tests/view/5b5f39ef0ebc5902bdb92792) LLR: -2.94 (-2.94,2.94) [0.00,5.00] Total: 29922 W: 5286 L: 5285 D: 19351 Elo: 0.48 [-1.98,3.10] (95%) [passed STC, 4 threads 5+0.05](http://tests.stockfishchess.org/tests/view/5b5dbf0f0ebc5902bdb9131c) LLR: 2.97 (-2.94,2.94) [0.00,5.00] Total: 9108 W: 2033 L: 1858 D: 5217 Elo: 6.11 [1.26,10.89] (95%) No functional change (in simple threat mode)	2018-08-08 17:34:12 +02:00

... 4 5 6 7 8 ...

4697 commits