BadFish

mirror of https://github.com/sockspls/badfish synced 2025-04-30 00:33:09 +00:00

Author	SHA1	Message	Date
syzygy1	108f0da4d7	Tablebases root ranking This patch corrects both MultiPV behaviour and "go searchmoves" behaviour for tablebases. We change the logic of table base probing at root positions from filtering to ranking. The ranking code is much more straightforward than the current filtering code (this is a simplification), and also more versatile. If the root is a TB position, each root move is probed and assigned a TB score and a TB rank. The TB score is the Value to be displayed to the user for that move (unless the search finds a mate score), while the TB rank determines which moves should appear higher in a multi-pv search. In game play, the engine will always pick a move with the highest rank. Ranks run from -1000 to +1000: 901 to 1000 : TB win 900 : normally a TB win, in rare cases this could be a draw 1 to 899 : cursed TB wins 0 : draw -1 to -899 : blessed TB losses -900 : normally a TB loss, in rare cases this could be a draw -901 to -1000 : TB loss Normally all winning moves get rank 1000 (to let the search pick the best among them). The exception is if there has been a first repetition. In that case, moves are ranked strictly by DTZ so that the engine will play a move that lowers DTZ (and therefore cannot repeat the position a second time). Losing moves get rank -1000 unless they have relatively high DTZ, meaning they have some drawing chances. Those get ranks towards -901 (when they cross -900 the draw is certain). Closes https://github.com/official-stockfish/Stockfish/pull/1467 No functional change (without tablebases).	2018-04-18 18:46:24 +02:00
Marco Costalba	f35e52f030	Merge Stats tables Use a recursive std::array with variadic template parameters to get rid of the last redundacy. The first template T parameter is the base type of the array, the W parameter is the weight applied to the bonuses when we update values with the << operator, the D parameter limits the range of updates (range is [-W * D, W * D]), and the last parameters (Size and Sizes) encode the dimensions of the array. This allows greater flexibility because we can now tweak the range [-W * D, W * D] for each table. Patch removes more lines than what adds and streamlines the Stats soup in movepick.h Closes PR#1422 and PR#1421 No functional change.	2018-03-03 11:35:33 +01:00
Marco Costalba	4c57cf0ead	Code style fixes in search.cpp Some code style triviality. No functional change.	2018-02-12 22:58:25 +01:00
Günther Demetz	1b6459195c	Simplify verification search (#1362 ) 1. avoid recursive call of verification. For the interested side to move recursion makes no sense. For the other side it could make sense in case of mutual zugzwang, but I was not able to figure out any concrete problematic position. Allows the removal of 2 local variables. 2. avoid further reduction by removing R += ONE_PLY; Benchmark with zugzwang-suite (see #1338), max 45 secs per position: Patch solves 33 out of 37 Master solves 31 out of 37 STC: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 76188 W: 13866 L: 13840 D: 48482 http://tests.stockfishchess.org/tests/view/5a5612ed0ebc590297da516c LTC: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 40479 W: 5247 L: 5152 D: 30080 http://tests.stockfishchess.org/tests/view/5a56f7d30ebc590299e4550e bench: 5340015	2018-01-13 09:01:23 +01:00
Joost VandeVondele	9afa1d7330	New Year 2018 Adjust copyright headers. No functional change.	2018-01-01 13:18:10 +01:00
Joost VandeVondele	1c50d8cbf5	Upon changing the number of threads, make sure all threads are bound The heuristic to avoid thread binding if less than 8 threads are requested resulted in the first 7 threads not being bound. The branch was verified to yield a roughly 13% speedup by @CoffeeOne on the appropriate hardware and OS, and an earlier version of this patch tested well on his machine: http://tests.stockfishchess.org/tests/view/5a3693480ebc590ccbb8be5a ELO: 9.24 +-4.6 (95%) LOS: 100.0% Total: 5000 W: 634 L: 501 D: 3865 To make sure all threads (including mainThread) are bound as soon as the total number exceeds 7, recreate all threads on a change of thread number. To do this, unify Threads::init, Threads::exit and Threads::set are unified in a single Threads::set function that goes through the needed steps. The code includes several suggestions from @joergoster. Fixes issue #1312 No functional change	2017-12-26 10:40:42 +01:00
Günther Demetz	b53239d641	Enhanced verify search (#1338 ) by disabling null-move-pruning for the side to move for first part of the remaining search tree. This helps to better recognize zugzwang. STC: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 18220 W: 3379 L: 3253 D: 11588 http://tests.stockfishchess.org/tests/view/5a2fa6460ebc590ccbb8bc2f LTC: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 41899 W: 5359 L: 5265 D: 31275 http://tests.stockfishchess.org/tests/view/5a2fcf440ebc590ccbb8bc47 For further detail see commit notes and discussion at `6401a80ab9` bench: 5776193	2017-12-18 16:30:27 +01:00
Stefan Geschwentner	4bc11984fc	Introduce capture history table for capture move sorting Introduce capture move history table indexed by moved piece, target square and captured piece type for sorting capture moves. STC: LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 11374 W: 2096 L: 1924 D: 7354 http://tests.stockfishchess.org/tests/view/59fac8dc0ebc590ccbb89fc5 LTC: LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 24791 W: 3196 L: 3001 D: 18594 http://tests.stockfishchess.org/tests/view/59fae4d20ebc590ccbb89fd9 Bench: 5536775	2017-11-03 13:57:18 +01:00
Stéphane Nicolet	043a469f83	Score unopposed weak pawns only if majors Do not use the opposed flag for scoring backward and isolated pawns in pawns.cpp, instead give a S(5,25) bonus for each opponent unopposed weak pawns when we have a rook or a queen on the board. STC run stopped after 113188 games: LLR: 1.63 (-2.94,2.94) [0.00,5.00] Total: 113188 W: 20804 L: 20251 D: 72133 http://tests.stockfishchess.org/tests/view/59b58e4d0ebc5916ff64b12e LTC: LLR: 2.95 (-2.94,2.94) [0.00,5.00] Total: 66673 W: 8672 L: 8341 D: 49660 http://tests.stockfishchess.org/tests/view/59b902580ebc5916ff64b231 This is Alain Savard's idea, just with a different bonus. Original patch there: green STC, http://tests.stockfishchess.org/tests/view/597dcd2b0ebc5916ff64a09b yellow LTC, http://tests.stockfishchess.org/tests/view/597ea69e0ebc5916ff64a0e6 Bench: 6259498	2017-09-17 09:52:27 +02:00
Joost VandeVondele	e385f194e9	Fix uninitialized memory usage After increasing the number of threads, the histories were not cleared, resulting in uninitialized memory usage. This patch fixes this by clearing threads histories in Thread c'tor as is the idomatic way. This fixes issue 1227 No functional change.	2017-09-01 20:16:56 +02:00
Marco Costalba	232c50fed0	Fix incorrect StateInfo We use Position::set() to set root position across threads. But there are some StateInfo fields (previous, pliesFromNull, capturedPiece) that cannot be deduced from a fen string, so set() clears them and to not lose the info we need to backup and later restore setupStates->back(). Note that setupStates is shared by threads but is accessed in read-only mode. This fixes regression introduced by `df6cb446ea` Tested with 3 threads at STC: LLR: 2.95 (-2.94,2.94) [-4.00,0.00] Total: 14436 W: 2304 L: 2196 D: 9936 Bench: 5608839	2017-08-14 23:01:58 -07:00
Marco Costalba	df6cb446ea	Thread code reformat Simplify out low level sync stuff (mutex and friends) and avoid to use them directly in many functions. Also some renaming and better comment while there. No functional change.	2017-08-13 04:41:59 -07:00
Joost VandeVondele	2783203428	Simplify finished search in ponder/infinite mode. In this rare case (e.g. go infinite on a stalemate), just spin till ponderhit/stop comes. The Thread::wait() is a renmant of the old YBWC code, today with lazy SMP, threads don't need to wait when outside of their idle loop. No functional change.	2017-08-10 22:42:52 -07:00
Marco Costalba	66c5eaebd8	Re-apply the fix for Limits::ponder race But this time correctly set Threads.ponder We avoid using 'limits' for passing pondering flag because we don't want to have 2 ponder variables in search scope: Search::Limits.ponder and Threads.ponder. This would be confusing also because limits.ponder is set at the beginning of the search and never changes, instead Threads.ponder can change value asynchronously during search. No functional change.	2017-08-10 12:47:31 -07:00
Marco Costalba	44236f4ed9	Revert "Fix a race on Limits::ponder" This reverts commit `5410424e3d`. After the commit pondering is broken, so revert for now. I will resubmit with a proper fix. The issue is mine, Joost original code is correct. No functional change.	2017-08-10 10:59:38 -07:00
Joost VandeVondele	5410424e3d	Fix a race on Limits::ponder Limits::ponder was used as a signal between uci and search threads, but is not an atomic variable, leading to the following race as flagged by a sanitized binary. Expect input: ``` spawn ./stockfish send "uci\n" expect "uciok" send "setoption name Ponder value true\n" send "go wtime 4000 btime 4000\n" expect "bestmove" send "position startpos e2e4 d7d5\n" send "go wtime 4000 btime 4000 ponder\n" sleep 0.01 send "ponderhit\n" expect "bestmove" send "quit\n" expect eof ``` Race: ``` WARNING: ThreadSanitizer: data race (pid=7191) Read of size 4 at 0x0000005c2260 by thread T1: Previous write of size 4 at 0x0000005c2260 by main thread: Location is global 'Search::Limits' of size 88 at 0x0000005c2220 (stockfish+0x0000005c2260) ``` The reason of teh race is that ponder is not just set in UCI go() assignment but also is signaled by an async ponderhit in uci.cpp: else if (token == "ponderhit") Search::Limits.ponder = 0; // Switch to normal search The fix is to add an atomic bool to the threads structure to signal the ponder status, letting Search::Limits to reflect just what was passed to 'go'. No functional change.	2017-08-10 10:46:46 -07:00
joergoster	377d77dbe9	Provide selective search depth info for each pv move No functional change Closes #1166	2017-07-13 16:30:03 -07:00
Joost VandeVondele	36a93d90f7	Move stop signal to Threads Instead of having Signals in the search namespace, make the stop variables part of the Threads structure. This moves more of the shared (atomic) variables towards the thread-related structures, making their role more clear. No functional change Closes #1149	2017-07-13 16:08:37 -07:00
Marco Costalba	05513a6641	Only main thread checks time The main change of the patch is that now time check is done only by main thread. In the past, before lazy SMP, we needed all the threds to check for available time because main thread could have been blocked on a split point, now this is no more the case and main thread can do the job alone, greatly simplifying the logic. Verified for regression testing on STC with 7 threads: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 11895 W: 1741 L: 1608 D: 8546 No functional change. Closes #1152	2017-06-28 17:03:35 -07:00
Joost VandeVondele	3cb0200459	Fix four data races. the nodes, tbHits, rootDepth and lastInfoTime variables are read by multiple threads, but not declared atomic, leading to data races as found by -fsanitize=thread. This patch fixes this issue. It is based on top of the CI-threading branch (PR #1129), and should fix the corresponding CI error messages. The patch passed an STC check for no regression: http://tests.stockfishchess.org/tests/view/5925d5590ebc59035df34b9f LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 169597 W: 29938 L: 30066 D: 109593 Whereas rootDepth and lastInfoTime are not performance critical, nodes and tbHits are. Indeed, an earlier version using relaxed atomic updates on the latter two variables failed STC testing (http://tests.stockfishchess.org/tests/view/592001700ebc59035df34924), which can be shown to be due to x86-32 (http://tests.stockfishchess.org/tests/view/592330ac0ebc59035df34a89). Indeed, the latter have no instruction to atomically update a 64bit variable. The proposed solution thus uses a variable in Position that is accessed only by one thread, which is copied every few thousand nodes to the shared variable in Thread. No functional change. Closes #1130 Closes #1129	2017-06-21 13:37:58 -07:00
Joost VandeVondele	d8f683760c	Adjust copyright headers to 2017 (#965 ) No functional change.	2017-01-11 08:46:29 +01:00
Joost VandeVondele	1c316c41bb	Correctly zero-initialize MainThread It can be used uninitialized in time management. Fixes all valgrind errors on './stockfish go wtime 8000 btime 8000 winc 500 binc 500' This is one (of the many) quirks of C++. There is a subtle difference between: new Foo new Foo() The first statement calls the default constructor (that in case of a POD leaves data members uninitialized), the second one performs a value-initialization (that in case of POD is equivalent to a zero-initialization) See: http://stackoverflow.com/questions/620137/do-the-parentheses-after-the-type-name-make-a-difference-with-new http://stackoverflow.com/questions/5116541/difference-between-creating-object-with-or-without No functional change.	2017-01-07 10:02:43 +01:00
Marco Costalba	fe99de20ff	Correct zero-init of Thread data members If not explicitly initialized in a class constructor, then all data members are default-initialized when the corresponing struct/class is instanced. For array and built-in types (int, char, etc..) default-initialization is a no-op and we need to explicitly zero them. No functional change.	2017-01-05 08:50:17 +01:00
Marco Costalba	0d9a9f5e98	Handle Windows Processors Groups Under Windows it is not possible for a process to run on more than one logical processor group. This usually means to be limited to use max 64 cores. To overcome this, some special platform specific API should be called to set group affinity for each thread. Original code from Texel by Peter �sterlund. Tested by Jean-Paul Vael on a Xeon E7-8890 v4 with 88 threads and confimed speed up between 44 and 88 threads is about 30%, as expected. No functional change.	2016-11-22 07:56:04 +01:00
Marco Costalba	e18321f55a	Correcty resey TB hit counter Restore original behaviour to reset the counter before a new move search. Also fixed some warnings and added const qualifier to a couple of functions, as suggested by m_stembera. Thanks to Werner Bergmans for reporting the regression. No functional change.	2016-10-22 08:22:13 +02:00
syzygy	ca67752645	Per-thread TB hit counters Use a per-thread counter to reduce contention with many cores and endgame positions. Measured around 1% speed-up on a 12 core and 8% on 28 cores with 6-men, searching on: 7R/1p3k2/2p2P2/3nR1P1/8/3b1P2/7K/r7 b - - 3 38 Also retire the unused set_nodes_searched() and fix a couple of return types and naming conventions. No functional change.	2016-10-21 06:15:45 +02:00
Marco Costalba	8662bdfa12	Fix crash when passing a mate/stalemate position Both Tablebases::filter_root_moves() and extract_ponder_from_tt(9 were unable to handle a mate/stalemate position. Spotted and reported by Dann Corbit. Added some mate/stalemate positions to bench so to early catch this regression in the future. No functional change.	2016-09-24 07:37:52 +02:00
Marco Costalba	ca14345ba2	Filter root moves filter before copy to threads Currently root moves are copied to all teh threads but are DTZ filtered only in main thread at the beginning of teh search. This patch moves the TB filtering before the copy of root moves fixing issue #679 https://github.com/official-stockfish/Stockfish/issues/679 No bench change.	2016-06-11 09:24:40 +02:00
Marco Costalba	94e41274bb	Fix incorrect draw detection In this position we should have draw for repetition: position fen rnbqkbnr/2pppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w KQkq - 0 1 moves g1f3 g8f6 f3g1 go infinite But latest patch broke it. Actually we had two(!) very subtle bugs, the first is that Position::set() clears the passed state and in particular 'previous' member, so that on passing setupStates, 'previous' pointer was reset. Second bug is even more subtle: SetupStates was based on std::vector as container, but when vector grows, std::vector copies all its contents to a new location invalidating all references to its entries. Because all StateInfo records are linked by 'previous' pointer, this made pointers go stale upon adding more element to setupStates. So revert to use a std::deque that ensures references are preserved when pushing back new elements. No functional change.	2016-04-18 00:13:16 +02:00
Marco Costalba	7eaea3848c	StateInfo is usually allocated on the stack by search() And passed in do_move(), this ensures maximum efficiency and speed and at the same time unlimited move numbers. The draw back is that to handle Position init we need to reserve a StateInfo inside Position itself and use at init time and when copying from another Position. After lazy SMP we don't need anymore this gimmick and we can get rid of this special case and always pass an external StateInfo to Position object. Also rewritten and simplified Position constructors. Verified it does not regress with a 3 threads SMP test: ELO: -0.00 +-12.7 (95%) LOS: 50.0% Total: 1000 W: 173 L: 173 D: 654 No functional change.	2016-04-17 08:29:33 +02:00
Lyudmil Antonov	89723339d9	Assorted English grammar changes No functional change Resolves #567	2016-01-16 21:34:29 +00:00
ppigazzini	d4af15f682	Update AUTHORS and copyright notice No functional change Resolves #555	2016-01-02 09:43:51 +00:00
Marco Costalba	9742fb10fd	Update Copyright year No functional change. Resolves #554	2016-01-01 10:17:36 +00:00
Leonid Pechenik	69240a982d	Simplify time management and fix 'ponder on' bug Simplify time management code by removing hard stops for unchanging first root moves. Search is now stopped earlier at the end iteration if it did not have fail-lows at root. This simplification also fixes pondering bug. Ponder flag was true by default and cutechess-cli doesn't change it to false even though no pondering is possible. Fix the issue by setting the default value of 'Ponder' flag to false. 10+0.1: ELO: 3.51 +-3.0 (95%) LOS: 99.0% Total: 20000 W: 3898 L: 3696 D: 12406 40+0.4: ELO: 1.39 +-2.7 (95%) LOS: 84.7% Total: 20000 W: 3104 L: 3024 D: 13872 60+0.06: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 37231 W: 5333 L: 5236 D: 26662 Stopped run at 100+1: LLR: 1.09 (-2.94,2.94) [-3.00,1.00] Total: 37253 W: 4862 L: 4856 D: 27535 Resolves #523 Fixes #510	2015-12-14 18:00:52 +00:00
Marco Costalba	93195555ed	Rewrite how threads are spawned Instead of creating a running std::thread and returning, wait in Thread c'tor that the native thread of execution goes to sleep in idle_loop(). In this way we can simplify how search is started, because when main thread is idle we are sure also all other threads will be idle, in any case, even at thread creation and startup. After lazy smp went in, we can simpify and rewrite a lot of logic that is now no more needed. This is hopefully the final big cleanup. Tested for no regression at 5+0.1 with 3 threads: LLR: 2.95 (-2.94,2.94) [-5.00,0.00] Total: 17411 W: 3248 L: 3198 D: 10965 No functional change.	2015-11-21 07:48:50 +01:00
Marco Costalba	76ed0ab501	Retire ThreadBase Now that we don't have anymore TimerThread, there is no need of this long class hierarchy. Also assorted reformatting while there. To verify no regression, passed at STC with 7 threads: LLR: 2.97 (-2.94,2.94) [-5.00,0.00] Total: 30990 W: 4945 L: 4942 D: 21103 No functional change.	2015-11-13 08:22:44 +01:00
Marco Costalba	9c9205860c	Get rid of timer thread Unfortunately std::condition_variable::wait_for() is not accurate in general case and the timer thread can wake up also after tens or even hundreds of millisecs after time has elapsded. CPU load, process priorities, number of concurrent threads, even from other processes, will have effect upon it. Even official documentation says: "This function may block for longer than timeout_duration due to scheduling or resource contention delays." So retire timer and use a polling scheme based on a local thread counter that counts search() calls and a small trick to keep polling frequency constant, independently from the number of threads. Tested for no regression at very fast TC 2+0.05 th 7: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 32969 W: 6720 L: 6620 D: 19629 TC 2+0.05 th 1: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 7765 W: 1917 L: 1765 D: 4083 And at STC TC, both single thread LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 15587 W: 3036 L: 2905 D: 9646 And with 7 threads LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 8149 W: 1367 L: 1227 D: 5555 bench: 8639247	2015-11-03 11:27:00 +01:00
Marco Costalba	86f04dbcc0	Assorted trivia in search.cpp The only interesting change is the moving of stack[MAX_PLY+4] back to its original position in id_loop (now renamed Thread::search). No functional change.	2015-10-31 19:26:35 +01:00
Stéphane Nicolet	80d7556af7	Some code and comment cleanup - Remove all references to split points - Some grammar and spelling fixes No Functional change Resolves #478	2015-10-29 15:28:59 +00:00
lucasart	00d9e9fd28	Use atomics instead of volatile Rely on well defined behaviour for message passing, instead of volatile. Three versions have been tested, to make sure this wouldn't cause a slowdown on any platform. v1: Sequentially consistent atomics No mesurable regression, despite the extra memory barriers on x86. Even with 15 threads and extreme time pressure, both acting as a magnifying glass: threads=15, tc=2+0.02 ELO: 2.59 +-3.4 (95%) LOS: 93.3% Total: 18132 W: 4113 L: 3978 D: 10041 threads=7, tc=2+0.02 ELO: -1.64 +-3.6 (95%) LOS: 18.8% Total: 16914 W: 4053 L: 4133 D: 8728 v2: Acquire/Release semantics This version generates no extra barriers for x86 (on the hot path). As expected, no regression either, under the same conditions: threads=15, tc=2+0.02 ELO: 2.85 +-3.3 (95%) LOS: 95.4% Total: 19661 W: 4640 L: 4479 D: 10542 threads=7, tc=2+0.02 ELO: 0.23 +-3.5 (95%) LOS: 55.1% Total: 18108 W: 4326 L: 4314 D: 9468 As suggested by Joona, another test at LTC: threads=15, tc=20+0.05 ELO: 0.64 +-2.6 (95%) LOS: 68.3% Total: 20000 W: 3053 L: 3016 D: 13931 v3: Final version: SeqCst/Relaxed threads=15, tc=10+0.1 ELO: 0.87 +-3.9 (95%) LOS: 67.1% Total: 9541 W: 1478 L: 1454 D: 6609 Resolves #474	2015-10-25 09:15:45 +00:00
Marco Costalba	307a5a4f63	Cleanup history stats And other assorted trivia. No functional change.	2015-10-24 17:29:12 +02:00
mbootsector	ecc5ff6693	Lazy SMP Start all threads searching on root position and use only the shared TT table as synching scheme. It seems this scheme scales better than YBWC for high number of threads. Verified for nor regression at STC 3 threads LLR: -2.95 (-2.94,2.94) [-3.00,1.00] Total: 40232 W: 6908 L: 7130 D: 26194 Verified for nor regression at LTC 3 threads LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 28186 W: 3908 L: 3798 D: 20480 Verified for nor regression at STC 7 threads LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 3607 W: 674 L: 526 D: 2407 Verified for nor regression at LTC 7 threads LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 4235 W: 671 L: 528 D: 3036 Tested with fixed games at LTC with 20 threads ELO: 44.75 +-7.6 (95%) LOS: 100.0% Total: 2069 W: 407 L: 142 D: 1520 Tested with fixed games at XLTC (120secs) with 20 threads ELO: 28.01 +-6.7 (95%) LOS: 100.0% Total: 2275 W: 349 L: 166 D: 1760 Original patch of mbootsector, with additional work from Ivan Ivec (log formula), Joerg Oster (id loop simplification) and Marco Costalba (assorted formatting and rework). Bench: 8116244	2015-10-20 06:58:08 +02:00
Marco Costalba	3c0fe1d9b2	Rework lock protecting When changing 'search' and 'splitPointsSize' we have to use thread locks, not split point ones, because can_join() is called under the formers. Verified succesfully with 24 hours toruture tests with 20 cores machine by Louis Zulli: it does not hangs. Verifyed for no regressions with STC, 7 threads: LLR: 2.94 (-2.94,2.94) [-3.00,1.00] Total: 52804 W: 8159 L: 8087 D: 36558 No functional change.	2015-09-30 10:47:20 +02:00
Joona Kiiski	613dc66c12	Careful SMP locking - Fix very occasional hangs Louis Zulli reported that Stockfish suffers from very occasional hangs with his 20 cores machine. Careful SMP debugging revealed that this was caused by "a ghost split point slave", where thread was marked as a split point slave, but wasn't actually working on it. The only logical explanation for this was double booking, where due to SMP race, the same thread is booked for two different split points simultaneously. Due to very intermittent nature of the problem, we can't say exactly how this happens. The current handling of Thread specific variables is risky though. Volatile variables are in some cases changed without spinlock being hold. In this case standard doesn't give us any kind of guarantees about how the updated values are propagated to other threads. We resolve the situation by enforcing very strict locking rules: - Values for key thread variables (splitPointsSize, activeSplitPoint, searching) can only be changed when the thread specific spinlock is held. - Structural changes for splitPoints[] are only allowed when the thread specific spinlock is held. - Thread booking decisions (per split point) can only be done when the thread specific spinlock is held. With these changes hangs didn't occur anymore during 2 days torture testing on Zulli's machine. We probably have a slight performance penalty in SMP mode due to more locking. STC (7 threads): ELO: -1.00 +-2.2 (95%) LOS: 18.4% Total: 30000 W: 4538 L: 4624 D: 20838 However stability is worth more than 1-2 ELO points in this case. No functional change Resolves #422	2015-09-10 19:15:43 +01:00
Marco Costalba	fb03188fc7	Assorted cleanup of last patches No functional change.	2015-04-11 23:24:43 +02:00
Stéphane Nicolet	2ca142a5b4	Use minimumSplitDepth = 5 Using minimumSplitDepth = 5 seems to be the best compromise in the current SMP implementation STC, 11 threads: ELO: 14.87 +-4.1 (95%) LOS: 100.0% Total: 8509 W: 1497 L: 1133 D: 5879 STC, 4 threads: ELO: 0.30 +-2.8 (95%) LOS: 58.2% Total: 20000 W: 3365 L: 3348 D: 13287 STC, 2 threads: ELO: -1.02 +-2.0 (95%) LOS: 16.4% Total: 40000 W: 7087 L: 7204 D: 25709 Resolves #324	2015-04-09 20:32:36 +01:00
Marco Costalba	5d1b92e8f9	Introduce elapsed_time() And reformat a bit time manager code. Note that now we set starting search time in think() and no more in ThreadPool::start_thinking(), the added delay is less than 1 msec, so below timer resolution (5msec) and should not affect time lossses ratio. No functional change.	2015-04-03 04:19:26 +02:00
Marco Costalba	dc3a5f791e	Allow Bitbases::init() to be called more than once Currently if we call it more than once, we crash. This is not a real problem, because this function is indeed called just once. Nevertheless with this small fix, that gets rid of a hidden 'static' variable, we cleanly resolve the issue. While there, fix also ThreadPool::exit to return in a consistent state. Now all the init() functions but UCI::init() are reentrant and can be called multiple times. No functional change.	2015-03-23 17:14:31 +01:00
Marco Costalba	be77406a55	Get rid of nativeThread No functional change.	2015-03-23 09:02:52 +01:00
Marco Costalba	26dabb1e6b	Use only one ConditionVariable to sync UI To sync UI with main thread it is enough a single condition variable because here we have a single producer / single consumer design pattern. Two condition variables are strictly needed just for many producers / many consumers case. Note that this is possible because now we don't send to sleep idle threads anymore while searching, so that now only UI can wake up the main thread and we can use the same ConditionVariable for both threads. The natural consequence is to retire wait_for_think_finished() and move all the logic under MainThread class, yielding the rename of teh function to join() No functional change.	2015-03-21 07:55:33 +01:00

1 2 3 4 5 ...

281 commits