BadFish

mirror of https://github.com/sockspls/badfish synced 2025-04-30 16:53:09 +00:00

Author	SHA1	Message	Date
Joost VandeVondele	8a9d269855	Merge remote-tracking branch 'upstream/master' into clusterMergeMaster7	2019-10-20 09:20:34 +02:00
xoto10	12d58adc68	Remove uithread With the current questions and issues around threading, I had a look at https://github.com/official-stockfish/Stockfish/issues/2299. It seems there was a problem with data races when requesting eval via UCI while a search was already running. To fix this an extra thread uithread was created, presumably to avoid an overlap with Threads.main() that was causing problems. Making this eval request seems to be outside the scope of UCI, and @vondele also reports that the data race is not even fixed reliably by this change. I suggest we simplify the threading here by removing this uithread and adding a comment signaling that user should not request eval when a search is already running. Closes https://github.com/official-stockfish/Stockfish/pull/2310 No functional change.	2019-10-20 00:27:17 +02:00
Stéphane Nicolet	8726beba59	Restore development version (revert previous commit) Revert the previous patch now that the binary for the super-final of TCEC season 16 has been sent. Maybe the feature of showing the name of compiler will be added to the master branch in the future. But we may use a cleaner way to code it, see some ideas using the Makefile approach at the end of pull request #2327 : https://github.com/official-stockfish/Stockfish/pull/2327 Bench: 3618154	2019-09-26 23:27:48 +02:00
Stéphane Nicolet	0436f01d05	Temporary patch to show the compiler for TCEC submission This patch shows a description of the compiler used to compile Stockfish, when starting from the console. Usage: ``` ./stockfish compiler ``` Example of output: ``` Stockfish 240919 64 POPCNT by T. Romstad, M. Costalba, J. Kiiski, G. Linscott Compiled by clang++ 9.0.0 on Apple __VERSION__ macro expands to: 4.2.1 Compatible Apple LLVM 9.0.0 (clang-900.0.38) ``` No functional change	2019-09-25 22:28:51 +02:00
Joost VandeVondele	0fd0e4e849	Merge branch 'master' into clusterMergeMaster6	2019-07-01 16:36:58 +02:00
Jörg Oster	c9d73d1aa5	Try to get a more precise bench time (#2211 ) Initialization of larger hash sizes can take some time. Don't include this time in the bench by resetting the timer after Search::clear(). Also move 'ucinewgame' command down in the list, so that it is processed after the configuration of Threads and Hash size. No functional change.	2019-06-30 15:16:20 +02:00
Joost VandeVondele	982880bd70	Merge remote-tracking branch 'upstream/master' into clusterMergeMaster4	2019-02-17 13:18:08 +01:00
Joost VandeVondele	58d3ee6175	Simplify pondering time management (#1899 ) stopOnPonderhit is used to stop search quickly on a ponderhit. It is set by mainThread as part of its time management. However, master employs it as a signal between mainThread and the UCI thread. This is not necessary, it is sufficient for the UCI thread to signal that pondering finished, and mainThread should do its usual time-keeping job, and in this case stop immediately. This patch implements this, removing stopOnPonderHit as an atomic variable from the ThreadPool, and moving it as a normal variable to mainThread, reducing its scope. In MainThread::check_time() the search is stopped immediately if ponder switches to false, and the variable stopOnPonderHit is set. Furthermore, ponder has been moved to mainThread, as the variable is only used to exchange signals between the UCI thread and mainThread. The version has been tested locally (as fishtest doesn't support ponder): Score of ponderSimp vs master: 2616 - 2528 - 8630 [0.503] 13774 Elo difference: 2.22 +/- 3.54 which indicates no regression. No functional change.	2019-01-20 19:14:24 +01:00
Joost VandeVondele	87f0fa55a0	[cluster] keep track of node counts cluster-wide. This generalizes exchange of signals between the ranks using a non-blocking all-reduce. It is now used for the stop signal and the node count, but should be easily generalizable (TB hits, and ponder still missing). It avoids having long-lived outstanding non-blocking collectives (removes an early posted Ibarrier). A bit too short a test, but not worse than before: Score of new-r4-1t vs old-r4-1t: 459 - 401 - 1505 [0.512] 2365 Elo difference: 8.52 +/- 8.43	2018-12-29 15:34:57 +01:00
Joost VandeVondele	e526c5aa52	[cluster] Make bench compatible Fix one TODO. Takes care of output from bench. Sum nodes over ranks.	2018-12-29 15:34:56 +01:00
Joost VandeVondele	54a0a228f6	[cluster] Some formatting cleanup standarize whitespace a bit. Also adds two TODOs for follow up work. No functional change.	2018-12-29 15:34:56 +01:00
Omri Mor	29c166a072	MPI/Cluster implementation for Stockfish Based on Peter Österlund's "Lazy Cluster" algorithm, but with some simplifications. To compile, point COMPCXX to the MPI C++ compiler wrapper (mpicxx).	2018-12-29 15:34:55 +01:00
Stéphane Nicolet	cf5d683408	Stockfish 10-beta Preparation commit for the upcoming Stockfish 10 version, giving a chance to catch last minute feature bugs and evaluation regression during the one-week code freeze period. Also changing the copyright dates to include 2019. No functional change	2018-11-19 11:18:21 +01:00
DU-jdto	ed26d71354	Fix a few minor code style inconsistencies No functional change.	2018-03-21 01:35:05 +01:00
Ronald de Man	860223c5e6	Fix gcc PGO build on Windows This fixes the issue #1375 of the PGO builds failing under Windows: https://github.com/official-stockfish/Stockfish/issues/1375 Solution found during this discussion in the fishcooking forum: https://groups.google.com/forum/?fromgroups=#!topic/fishcooking/RjIPgeFFLPQ Closes #1408. No functional change.	2018-02-12 23:14:33 +01:00
Joost VandeVondele	9afa1d7330	New Year 2018 Adjust copyright headers. No functional change.	2018-01-01 13:18:10 +01:00
Marco Costalba	fa5b0936ee	Wide bench coverage Add tests for: - Positions with move list - Chess960 positions Now bench covers almost all cases, only few endgames are still out of reach (verified with lcov) It is a non functionality patch, but bench changed because we added new test positions. bench: 5479946	2017-08-19 13:58:12 +02:00
Marco Costalba	45e254a0a0	Restore perft Rewrite perft to be placed naturally inside new bench code. In particular we don't have special custom code to run perft anymore but perft is just a new parameter of 'go' command. So user API is now changed, old style command: $perft 5 becomes $go perft 4 No functional change.	2017-08-18 09:04:38 -07:00
Marco Costalba	444d99b6d2	Rewrite benchmark First step in improving bench to handle arbitrary UCI commands so to test many more code paths. This first patch just set the new code structure. No functional change.	2017-08-18 09:04:38 -07:00
Marco Costalba	bdeda52efd	Retire States global variable And other small touches in uci.cpp No functional change.	2017-08-12 23:54:48 -07:00
Joost VandeVondele	2783203428	Simplify finished search in ponder/infinite mode. In this rare case (e.g. go infinite on a stalemate), just spin till ponderhit/stop comes. The Thread::wait() is a renmant of the old YBWC code, today with lazy SMP, threads don't need to wait when outside of their idle loop. No functional change.	2017-08-10 22:42:52 -07:00
Marco Costalba	66c5eaebd8	Re-apply the fix for Limits::ponder race But this time correctly set Threads.ponder We avoid using 'limits' for passing pondering flag because we don't want to have 2 ponder variables in search scope: Search::Limits.ponder and Threads.ponder. This would be confusing also because limits.ponder is set at the beginning of the search and never changes, instead Threads.ponder can change value asynchronously during search. No functional change.	2017-08-10 12:47:31 -07:00
Marco Costalba	44236f4ed9	Revert "Fix a race on Limits::ponder" This reverts commit `5410424e3d`. After the commit pondering is broken, so revert for now. I will resubmit with a proper fix. The issue is mine, Joost original code is correct. No functional change.	2017-08-10 10:59:38 -07:00
Joost VandeVondele	5410424e3d	Fix a race on Limits::ponder Limits::ponder was used as a signal between uci and search threads, but is not an atomic variable, leading to the following race as flagged by a sanitized binary. Expect input: ``` spawn ./stockfish send "uci\n" expect "uciok" send "setoption name Ponder value true\n" send "go wtime 4000 btime 4000\n" expect "bestmove" send "position startpos e2e4 d7d5\n" send "go wtime 4000 btime 4000 ponder\n" sleep 0.01 send "ponderhit\n" expect "bestmove" send "quit\n" expect eof ``` Race: ``` WARNING: ThreadSanitizer: data race (pid=7191) Read of size 4 at 0x0000005c2260 by thread T1: Previous write of size 4 at 0x0000005c2260 by main thread: Location is global 'Search::Limits' of size 88 at 0x0000005c2220 (stockfish+0x0000005c2260) ``` The reason of teh race is that ponder is not just set in UCI go() assignment but also is signaled by an async ponderhit in uci.cpp: else if (token == "ponderhit") Search::Limits.ponder = 0; // Switch to normal search The fix is to add an atomic bool to the threads structure to signal the ponder status, letting Search::Limits to reflect just what was passed to 'go'. No functional change.	2017-08-10 10:46:46 -07:00
Marco Costalba	750dfa0521	Fix some races and clarify the code Better split code that should be run at startup from code run at ucinewgame. Also fix several races when 'bench', 'perft' and 'ucinewgame' are sent just after 'bestomve' from the engine threads are still running. Also use a specific UI thread instead of main thread when setting up the Position object used by UI uci loop. This fixes a race when sending 'eval' command while searching. We accept a race on 'setoption' to allow the GUI to change an option while engine is searching withouth stalling the pipe. Note that changing an option while searchingg is anyhow not mandated by UCI protocol. No functional change.	2017-08-10 10:19:56 -07:00
Joost VandeVondele	36a93d90f7	Move stop signal to Threads Instead of having Signals in the search namespace, make the stop variables part of the Threads structure. This moves more of the shared (atomic) variables towards the thread-related structures, making their role more clear. No functional change Closes #1149	2017-07-13 16:08:37 -07:00
Joost VandeVondele	7edd1f7ccd	Execute an implied ucinewgame at startup execute an implied ucinewgame upon entering the UCI::loop, to make sure that searches starting with and without an (optional) ucinewgame command yield the same search. This is needed now that seach::clear() initializes tables to non-zero default values. No functional change Closes #1101 Closes #1104	2017-05-15 18:54:13 -07:00
joergoster	8b15961349	Fix multiPV issue #502 In general, this patch handles the cases where we don't have a valid score for each PV line in a multiPV search. This can happen if the search has been stopped in an unfortunate moment while still in the aspiration loop. The patch consists of two parts. Part 1: The new PVIdx was already part of the k-best pv's in the last iteration, and we therefore have a valid pv and score to output from the last iteration. This is taken care of with: bool updated = (i <= PVIdx && rootMoves[i].score != -VALUE_INFINITE); Case 2: The new PVIdx was NOT part of the k-best pv's in the last iteration, and we have no valid pv and score to output. Not from the current nor from the previous iteration. To avoid this, we are now also considering the previous score when sorting, so that the PV lines with no actual but with a valid previous score are pushed up again, and the previous score can be displayed. bool operator<(const RootMove& m) const { return m.score != score ? m.score < score : m.previousScore < previousScore; } // Descending sort I also added an assertion in UCI::value() to possibly catch similar issues earlier. No functional change. Closes #502 Closes #1074	2017-05-03 19:46:40 -07:00
Joost VandeVondele	d8f683760c	Adjust copyright headers to 2017 (#965 ) No functional change.	2017-01-11 08:46:29 +01:00
Aram Tumanian	e6c2899020	Make a version of Position::do_move() without the givesCheck parameter In 10 of 12 calls total to Position::do_move()the givesCheck argument is simply gives_check(m). So it's reasonable to make an overload without this parameter, which wraps the existing version. No functional change.	2016-11-12 09:55:12 +01:00
Marco Costalba	c0bb041539	Rewrite syzygy in C++ Rewrite the code in SF style, simplify and document it. Code is now much clear and bug free (no mem-leaks and other small issues) and is also smaller (more than 600 lines of code removed). All the code has been rewritten but root_probe() and root_probe_wdl() that are completely misplaced and should be retired altogheter. For now just leave them in the original version. Code is fully and deeply tested for equivalency both in functionality and in speed with hundreds of games and test positions and is guaranteed to be 100% equivalent to the original. Tested with tb_dbg branch for functional equivalency on more than 12M positions. stockfish.exe bench 128 1 16 syzygy.epd Position: 2016/2016 Total 12121156 Hits 0 hit rate (%) 0 Total time (ms) : 4417851 Nodes searched : 1100151204 Nodes/second : 249024 Tested with 5,000 games match against master, 1 Thread, 128 MB Hash each, tc 40+0.4, which is almost equivalent to LTC in Fishtest on this machine. 3-, 4- and 5-men syzygy bases on SSD, 12-moves opening book to emphasize mid- and endgame. Score of SF-SyzygyC++ vs SF-Master: 633 - 617 - 3750 [0.502] 5000 ELO difference: 1 No functional change.	2016-11-05 07:55:08 +01:00
Stéphane Nicolet	805afcbf3d	Move CheckInfo under StateInfo This greately simplifies usage because hides to the search the implementation specific CheckInfo. This is based on the work done by Marco in pull request #716, implementing on top of it the ideas in the discussion: caching the calls to slider_blockers() in the CheckInfo structure, and simplifying the slider_blockers() function by removing its first parameter. Compared to master, bench is identical but the number of calls to slider_blockers() during bench goes down from 22461515 to 18853422, hopefully being a little bit faster overall. archlinux, gcc-6 make profile-build ARCH=x86-64-bmi2 50 runs each bench: base = 2356320 +/- 981 test = 2403811 +/- 981 diff = 47490 +/- 1828 speedup = 0.0202 P(speedup > 0) = 1.0000 perft 6: base = 175498484 +/- 429925 test = 183997959 +/- 429925 diff = 8499474 +/- 469401 speedup = 0.0484 P(speedup > 0) = 1.0000 perft 7 (but only 10 runs): base = 185403228 +/- 468705 test = 188777591 +/- 468705 diff = 3374363 +/- 476687 speedup = 0.0182 P(speedup > 0) = 1.0000 $ ./pyshbench ../Stockfish/master ../Stockfish/test 20 run base test diff ... base = 2501728 +/- 182034 test = 2532997 +/- 182034 diff = 31268 +/- 5116 speedup = 0.0125 P(speedup > 0) = 1.0000 No functional change.	2016-08-27 09:53:26 +02:00
Marco Costalba	ca14345ba2	Filter root moves filter before copy to threads Currently root moves are copied to all teh threads but are DTZ filtered only in main thread at the beginning of teh search. This patch moves the TB filtering before the copy of root moves fixing issue #679 https://github.com/official-stockfish/Stockfish/issues/679 No bench change.	2016-06-11 09:24:40 +02:00
Marco Costalba	94e41274bb	Fix incorrect draw detection In this position we should have draw for repetition: position fen rnbqkbnr/2pppppp/8/8/8/8/PPPPPPPP/RNBQKBNR w KQkq - 0 1 moves g1f3 g8f6 f3g1 go infinite But latest patch broke it. Actually we had two(!) very subtle bugs, the first is that Position::set() clears the passed state and in particular 'previous' member, so that on passing setupStates, 'previous' pointer was reset. Second bug is even more subtle: SetupStates was based on std::vector as container, but when vector grows, std::vector copies all its contents to a new location invalidating all references to its entries. Because all StateInfo records are linked by 'previous' pointer, this made pointers go stale upon adding more element to setupStates. So revert to use a std::deque that ensures references are preserved when pushing back new elements. No functional change.	2016-04-18 00:13:16 +02:00
Marco Costalba	7eaea3848c	StateInfo is usually allocated on the stack by search() And passed in do_move(), this ensures maximum efficiency and speed and at the same time unlimited move numbers. The draw back is that to handle Position init we need to reserve a StateInfo inside Position itself and use at init time and when copying from another Position. After lazy SMP we don't need anymore this gimmick and we can get rid of this special case and always pass an external StateInfo to Position object. Also rewritten and simplified Position constructors. Verified it does not regress with a 3 threads SMP test: ELO: -0.00 +-12.7 (95%) LOS: 50.0% Total: 1000 W: 173 L: 173 D: 654 No functional change.	2016-04-17 08:29:33 +02:00
ppigazzini	d4af15f682	Update AUTHORS and copyright notice No functional change Resolves #555	2016-01-02 09:43:51 +00:00
Marco Costalba	9742fb10fd	Update Copyright year No functional change. Resolves #554	2016-01-01 10:17:36 +00:00
Marco Costalba	93195555ed	Rewrite how threads are spawned Instead of creating a running std::thread and returning, wait in Thread c'tor that the native thread of execution goes to sleep in idle_loop(). In this way we can simplify how search is started, because when main thread is idle we are sure also all other threads will be idle, in any case, even at thread creation and startup. After lazy smp went in, we can simpify and rewrite a lot of logic that is now no more needed. This is hopefully the final big cleanup. Tested for no regression at 5+0.1 with 3 threads: LLR: 2.95 (-2.94,2.94) [-5.00,0.00] Total: 17411 W: 3248 L: 3198 D: 10965 No functional change.	2015-11-21 07:48:50 +01:00
Marco Costalba	86f04dbcc0	Assorted trivia in search.cpp The only interesting change is the moving of stack[MAX_PLY+4] back to its original position in id_loop (now renamed Thread::search). No functional change.	2015-10-31 19:26:35 +01:00
mbootsector	ecc5ff6693	Lazy SMP Start all threads searching on root position and use only the shared TT table as synching scheme. It seems this scheme scales better than YBWC for high number of threads. Verified for nor regression at STC 3 threads LLR: -2.95 (-2.94,2.94) [-3.00,1.00] Total: 40232 W: 6908 L: 7130 D: 26194 Verified for nor regression at LTC 3 threads LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 28186 W: 3908 L: 3798 D: 20480 Verified for nor regression at STC 7 threads LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 3607 W: 674 L: 526 D: 2407 Verified for nor regression at LTC 7 threads LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 4235 W: 671 L: 528 D: 3036 Tested with fixed games at LTC with 20 threads ELO: 44.75 +-7.6 (95%) LOS: 100.0% Total: 2069 W: 407 L: 142 D: 1520 Tested with fixed games at XLTC (120secs) with 20 threads ELO: 28.01 +-6.7 (95%) LOS: 100.0% Total: 2275 W: 349 L: 166 D: 1760 Original patch of mbootsector, with additional work from Ivan Ivec (log formula), Joerg Oster (id loop simplification) and Marco Costalba (assorted formatting and rework). Bench: 8116244	2015-10-20 06:58:08 +02:00
Marco Costalba	ca38358574	Run PVS-STUDIO analyzer Fix issues after a run of PVS-STUDIO analyzer. Mainly false positives but warnings are anyhow useful to point out not very readable code. Noteworthy is the memset() one, where PVS prefers ss-2 instead of stack. This is because memeset() could be optimized away by the compiler when using 'stack', due to stack being a local variable no more used after memset. This should normally not happen, but when it happens it leads to very sublte and difficult to find bug, so better to be safe than sorry. No functional change.	2015-10-05 09:13:33 +02:00
lucasart	2e86d1febc	Restore deterministic search state Introduce helper function Search::reset() which clears all kind of search memory, in order to restore a deterministic search state. Generalize TT.clear() into Search::reset() for the following use cases: - bench: needed to guarantee deterministic bench (ie. if you call bench from interactive command line twice in a row you get the same value). - Clear Hash: restore clean search state, which is the purpose of this button. - ucinewgame: ditto. No functional change. Resolves #346	2015-05-07 14:20:32 -07:00
Marco Costalba	926f215061	Add support for playing in 'nodes as time' mode When running more games in parallel, or simply when running a game with a background process, due to how OS scheduling works, there is no guarantee that the CPU resources allocated evenly between the two players. This introduces noise in the result that leads to unreliable result and in the worst cases can even invalidate the result. For instance in SF test framework we avoid running from clouds virtual machines because are a known source of very unstable CPU speed. To overcome this issue, without requiring changes to the GUI, the idea is to use searched nodes instead of time, and to convert time to available nodes upfront, at the beginning of the game. When nodestime UCI option is set at a given nodes per milliseconds (npmsec), at the beginning of the game (and only once), the engine reads the available time to think, sent by the GUI with 'go wtime x' UCI command. Then it translates time in available nodes (nodes = npmsec * x), then feeds available nodes instead of time to the time management logic and starts the search. During the search the engine checks the searched nodes against the available ones in such a way that all the time management logic still fully applies, and the game mimics a real one played on real time. When the search finishes, before returning best move, the total available nodes are updated, subtracting the real searched nodes. After the first move, the time information sent by the GUI is ignored, and the engine fully relies on the updated total available nodes to feed time management. To avoid time losses, the speed of the engine (npms) must be set to a value lower than real speed so that if the real TC is for instance 30 secs, and npms is half of the real speed, the game will last on average 15 secs, so much less than the TC limit, providing for a safety 'time buffer'. There are 2 main limitations with this mode. 1. Engine speed should be the same for both players, and this limits the approach to mainly parameter tuning patches. 2. Because npms is fixed while, in real engines, the speed increases toward endgame, this introduces an artifact that is equivalent to an altered time management. Namely it is like the time management gives less available time than what should be in standard case. May be the second limitation could be mitigated in a future with a smarter 'dynamic npms' approach. Tests shows that the standard deviation of the results with 'nodestime' is lower than in standard TC, as is expected because now all the introduced noise due the random speed variability of the engines during the game is fully removed. Original NIT idea by Michael Hoffman that shows how to play in NIT mode without requiring changes to the GUI. This implementation goes a bit further, the key difference is that we read TC from GUI only once upfront instead of re-reading after every move as in Michael's implementation. No functional change.	2015-04-03 04:40:55 +02:00
Marco Costalba	be77406a55	Get rid of nativeThread No functional change.	2015-03-23 09:02:52 +01:00
Marco Costalba	26dabb1e6b	Use only one ConditionVariable to sync UI To sync UI with main thread it is enough a single condition variable because here we have a single producer / single consumer design pattern. Two condition variables are strictly needed just for many producers / many consumers case. Note that this is possible because now we don't send to sleep idle threads anymore while searching, so that now only UI can wake up the main thread and we can use the same ConditionVariable for both threads. The natural consequence is to retire wait_for_think_finished() and move all the logic under MainThread class, yielding the rename of teh function to join() No functional change.	2015-03-21 07:55:33 +01:00
Marco Costalba	686b45e121	Retire one do_move() overload After Lucas patch it is almost useless. No functional change.	2015-02-15 12:23:03 +01:00
Marco Costalba	65f46794af	Implicit conversion from ExtMove to Move Verified with perft there is no speed regression, and code is simpler. It is also conceptually correct becuase an extended move is just a move that happens to have also a score. No functional change.	2015-01-31 19:22:07 +01:00
Marco Costalba	fe4fda474f	Fun with initializer lists in UCI::square No functional change.	2015-01-18 11:04:51 +01:00
Marco Costalba	f53aea45e3	Add syzygy support bench: 8080602	2015-01-18 08:27:46 +01:00
Marco Costalba	3c07603dac	Import C++11 branch Import C++11 branch from: https://github.com/mcostalba/Stockfish/tree/c++11 The version imported is teh last one as of today: `6670e93e50` Branch is fully equivalent with master but syzygy tablebases that are missing (but will be added with next commit). bench: 8080602	2015-01-18 08:00:50 +01:00

1 2 3 4 5

236 commits