BadFish

mirror of https://github.com/sockspls/badfish synced 2025-07-15 13:29:14 +00:00

Author	SHA1	Message	Date
Marco Costalba	6fa83f5188	Merge branch 'eval_cache' Use an eval cache instead of TT to store node position evaluations. It is already an improvment and, because it frees two TT entry slots, paves the way to extend TT to store both upper and lower bounds. After 4855 games, single thread, 15"+0.05 Mod vs Orig 1165 -920 - 2770 ELO +18 bench: 5149248	2012-12-04 08:05:15 +01:00
Marco Costalba	ce248e7920	Increase MAX_THREADS to 64 And document why this is an hard limit. It seems for some (lucky) people 32 threads are not enough. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-12-04 07:57:15 +01:00
Marco Costalba	4e5d834e8e	Add eval cache infrastructure With this patch series we want to introduce a per-thread evaluation cache to store node evaluation and do not rely anymore on the TT table for this. This patch just introduces the infrastructure. No functional change.	2012-12-01 14:01:20 +01:00
Marco Costalba	1b6b711c44	Further rearrange search() No functional change.	2012-10-01 10:44:04 +02:00
Marco Costalba	ed0fb0b05f	Add support for node limited search Handle also the SMP case. This has been quite tricky, not trivial to enforce the node limit in SMP case becuase with "helpful master" concept we can have recursive split points and we cannot lock them all at once so there is the risk of counting the same nodes more than once. Anyhow this patch should be race free and counted nodes are correct. No functional change.	2012-09-30 10:19:22 +02:00
Marco Costalba	8dcb4bc3cc	Terminate threads before to exit main() It is very difficult and risky to assure that a running thread doesn't access a global variable. This is currently true, but could change in the future and we don't want to rely on code that works 'by accident'. The threads are still running when ThreadPool destructor is called (after main() returns) and this could lead to crashes if a thread accesses a global that has been already freed. The solution is to use an exit() function and call it while we are still in main(), ensuring global variables are still alive at threads termination time. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-08-29 19:11:44 +02:00
Marco Costalba	3df2c01b57	Correctly handle handover of setup states Before the search we setup the starting position doing all the moves (sent by GUI) from start position to the position just before to start searching. To do this we use a set of StateInfo records used by each do_move() call. These records shall be kept valid during all the search because repetition draw detection uses them to back track all the earlier positions keys. The problem is that, while searching, the GUI could send another 'position' command, this calls set_position() that clears the states! Of course a crash follows shortly. Before searching all the relevant parameters are copied in start_searching() just for this reason: to fully detach data accessed during the search from the UCI protocol handling. So the natural solution would be to copy also the setup states. Unfortunatly this approach does not work because StateInfo contains a pointer to the previous record, so naively copying and then freeing the original memory leads to a crash. That's why we use two std::auto_ptr (one belonging to UCI and another to Search) to safely transfer ownership of the StateInfo records to the search, after we have setup the root position. As a nice side-effect all the possible memory leaks are magically sorted out for us by std::auto_ptr semantic. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-08-27 19:17:02 +02:00
Marco Costalba	b6883c872d	Introduce struct Mutex and ConditionVariable To mimics C++11 std::mutex and std::condition_variable, also rename locks and condition variables to be more uniform across the classes. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-08-24 12:30:36 +01:00
Marco Costalba	7a2825053e	Use size_t as operator[] argument type This better mimics std::vector::operator[] and fixes a warning with MSVC 64bit. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-08-22 11:44:43 +01:00
Marco Costalba	dba1bc354a	Simplify idle_loop() signature We can detect the split point master also from within idle_loop, so we can call the function without parameters and remove an overloaded member hack in Thread class. Note that we don't need to take a lock around curSplitPoint when entering idle_loop() because if we are the master then curSplitPoint cannot change under our feet (because is_searching is set and so we cannot be reallocated), if we are a slave we enter idle_loop() only upon Thread creation and in that case is always splitPointsCnt == 0. This is true even in the very rare case that curSplitPoint != NULL, if we have been already allocated even before entering idle_loop(). No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-08-19 23:01:28 +01:00
Marco Costalba	4b19430103	Prefer size_t over int for array sizes Align to standard library conventions. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-08-19 11:01:46 +01:00
Marco Costalba	960a689769	Rename ThreadsManager to ThreadPool It is a more standard naming convention. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-06-24 09:45:37 +01:00
Marco Costalba	be3b8f3ae9	Retire "Active reparenting" After 6K games at 60" + 0.1 on QUAD with 4 threads this implementation fails to show a measurable increase, result is well within error bar. Perhaps with 8 or more threads resut is better but we don't have the hardware to test. So retire for now and in case re-add in the future if it proves good on big machines. The only good news is that we don't have a regression and implementation is stable and bug-free, so could be reused somewhere in the future. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-04-22 17:52:31 +01:00
Marco Costalba	ce159b16b9	Fix endless reaparenting loop The check for detecting when a split point has all the slaves still running is done with: slavesMask == allSlavesMask When a thread reparents, slavesMask is increased, then, if the same thread finishes, because there are no more moves, slavesMask returns to original value and the above condition returns to be true. So that the just finished thread immediately reparents again with the same split point, then starts and then immediately exits in a tight loop that ends only when a second slave finishes, so that slavesMask decrements and the condition becomes false. This gives a spurious and anomaly high number of faked reparents. With this patch, that rewrites the logic to avoid this pitfall, the reparenting success rate drops to a more realistical 5-10% for 4 threads case. As a side effect note that now there is no more the limit of maxThreadsPerSplitPoint when reparenting. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-04-17 18:51:49 +01:00
Marco Costalba	44432f67d7	Active Reparenting In Young Brothers Wait Concept (YBWC) available slaves are booked by the split point master, then start to search below the assigned split point and, once finished, return in idle state waiting to be booked by another master. This patch introduces "Active Reparenting" so that when a slave finishes its job on the assigned split point, instead of passively waiting to be booked, searches a suitable active split point and reprents itselfs to that split point. Then immediately starts to search below the split point in exactly the same way of the others split point's slaves. This reduces to zero the time waiting in idle loop and should increase scalability especially whit many (8 or more) cores. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-04-10 18:22:58 +01:00
Marco Costalba	c2fc80e5d1	Revert thread_local stuff Unfortunatly accessing thread local variable is much slower than object data (see previous patch log msg), so we have to revert to old code to avoid speed regression. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-04-06 18:47:55 +01:00
Marco Costalba	b1f57e92ce	Use thread_local compiler specifics Much faster then pthread_getspecific() but still a speed regression against the original code. Following are the nps on a bench: Position 454165 454838 455433 tls 441046 442767 442767 ms (Win) 450521 447510 451105 ms (pthread) 422115 422115 424276 Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-04-06 18:03:15 +01:00
Marco Costalba	e1919384a2	Don't store Thread info in Position But use the newly introduced local storage for this. A good code semplification and also the correct way to go. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-04-06 14:36:45 +01:00
Marco Costalba	699f700162	Introduce thread local storage Use thread local storage to store a pointer to the thread we are running on. This will allow to remove thread info from Position class. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-04-06 14:36:39 +01:00
Marco Costalba	673bc5526f	Use a Thread instead of an array index No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-04-04 12:12:08 +01:00
Marco Costalba	7eb6a488ad	Use a std::vector to store searchMoves A std::set (that is a rb_tree) seems really overkill to store at most a handful of moves and nothing in the common case. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-04-01 12:45:43 +01:00
Marco Costalba	304deb5e83	Rename Materials and Pawns hash stuff No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-03-31 11:59:23 +01:00
Marco Costalba	3d0d0237c5	Simplify start_searching() signature Retire the "sync" behaviour that now is up to the caller to honour. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-03-26 18:59:01 +01:00
Marco Costalba	3aa471f2a9	Introduce and use wait_for_search_finished() Helper function that allows us to simplify the code. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-03-26 18:22:41 +01:00
Marco Costalba	32d3a07c67	Move ThreadsManager::exit() to d'tor And add final touches to this long patch series. All the series has been verified against regression with 20K games at fast TC. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-03-26 08:18:17 +01:00
Marco Costalba	c483ffc773	Try to mimic std::thread API No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-03-25 12:43:19 +01:00
Marco Costalba	41561c9bb8	Use std::vector<Thread*> to store threads We store pointers instead of Thread objects because Thread is not copy-constructible nor copy-assignable and default ones are not suitable. So we cannot store directly in a std::vector. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-03-25 10:23:52 +01:00
Marco Costalba	553655eb07	Refactor Thread class Associate platform OS thread to the Thread class instead of creating it from ThreadsManager. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-03-25 10:23:51 +01:00
Marco Costalba	f01b53c374	Refactor ThreadsManager::set_size() functionality Split the data allocation, now done (mostly once) in read_uci_options(), from the wake up and sleeping of the slave threads upon entering/exiting the search. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-03-25 10:23:49 +01:00
Marco Costalba	b356e0fae3	Rename lock.h to platform.h And move some more platform specific code there. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-03-24 10:05:17 +01:00
Marco Costalba	2608b9249d	Retire ss->bestMove And introduce SPlitPoint bestMove to pass back the best move after a split point. This allow to define as const the search stack passed to split. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-02-21 20:31:22 +01:00
Marco Costalba	6088ac2108	Small renaming in Thread struct Should be a bit more clear the meaning of the single variables. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-02-18 10:57:00 +01:00
Marco Costalba	b1cf1acb93	Move wait_for_stop_or_ponderhit() under Thread This method belongs to Thread, not to ThreadsManager. Reshuffle stuff in thread.cpp while there. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-02-03 17:33:09 +01:00
Marco Costalba	c94cfebb7e	Reduce lock contention in idle_loop Release split point lock before to wake up master thread. This seems to increase speed in case "sleeping threads" are used: After 7792 games with 4 threads at very fast TC (2"+0.05) Mod vs Orig 1722 - 1627 - 4443 ELO +4 (+- 5.1) No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-02-03 13:43:58 +01:00
Marco Costalba	51e8efdab5	Fix subtle race with slave allocation When allocating a slave we set both is_searching and splitPoint under lock protection. Unfortunatly the order in which the variables are set is not defined. This article was very clarifying: http://software.intel.com/en-us/blogs/2007/11/30/volatile-almost-useless-for-multi-threaded-programming/ So when in idle loop we test for is_searching and then access splitPoint, it could happen that splitPoint is still not updated leading to a possible crash. Fix the race lock protecting splitPoint access. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-01-31 20:19:25 +01:00
Marco Costalba	7fb6fd2f55	Reformat threads code Apart from some renaming the biggest change is the retire of split_point_finished() replaced by slavesMask flags. As a side effect we now take also split point lock when allocation available threads. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-01-27 19:24:22 +01:00
Marco Costalba	3d937e1e90	Simplify locking usage pass references (Windows style) instead of pointers (Posix style) as function arguments. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-01-23 20:30:19 +01:00
Justin Blanchard	2a21543c88	Remove unused #include lines	2012-01-19 00:48:53 +08:00
Marco Costalba	103b368ab7	Move struct RootMove to Search namespace And directly pass RootMoves instead of SearchMoves to main thread. A class declaration is better suited in a header and slims a bit the fatty search.cpp No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-01-14 14:22:34 +01:00
Marco Costalba	a29dd88f75	Use a set to store SearchMoves We just need to verify if a legal move is among the SearchMoves, so we don't need a vector for this. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-01-14 13:06:01 +01:00
Marco Costalba	8307da0de7	Update copyright year to 2012 And refresh Readme.txt while there. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2011-12-30 13:52:16 +01:00
Marco Costalba	9cb187762a	Wait for main thread to finish before to exit Currently after a 'quit' command UI thread raises stop signal, exits from uci_loop() and calls Threads.exit() while the search threads are still active. In Threads.exit() main thread is asked to terminate, but if it is parked in idle_loop() it will exit and free its resources (in particular the shared Movepicker object) while sibling slaves are still active and this leads to a crash. The fix is to let the UI thread always wait for main thread to finish the search before to return from uci_loop(). Found by Valgrind when running with 8 threads. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2011-12-29 10:33:06 +01:00
Marco Costalba	5b8ca1eee7	Move SearchStack under Search namespace No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2011-12-04 12:03:59 +01:00
Marco Costalba	0f7cbaca75	Tidy up comments in thread.cpp No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2011-12-03 09:58:46 +01:00
Marco Costalba	bb3427ca85	Detach search arguments from UI thread Detach from the UI thread the input arguments used by the search threads so that the UI thread is able to receive and process any command sent by the GUI while other threads keep searching. With this patch there is no more need to block the UI thread after a "stop", so it is a more reliable and robust solution than the previous patch. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2011-11-27 17:46:18 +01:00
Marco Costalba	6809b57cfc	After a "stop" do not read new input until search finishes Unfortunatly xboard sends immediately the new position to search after sending "stop" when we have a ponder miss. Becuase main thread position is not copied but is referenced directly from root position and the latter is modified by the "position.." UCI command we end up with the working position that changes under our feet while the search is still recovering after the "stop" and this causes a crash. This happens only with the (broken) xboard, native UCI does not have this problem. Reported by otello1984 No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2011-11-27 12:19:33 +01:00
Marco Costalba	ed04c010eb	Rewrite async I/O Use the starting thread to wait for GUI input and instead use the other threads to search. The consequence is that now think() is alwasy started on a differnt thread than the caller that returns immediately waiting for input. This reformat greatly simplifies the code and is more in line with the common way to implement this feature. As a side effect now we don't need anymore Makefile tricks with sleep() to allow profile builds. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2011-11-26 11:51:12 +01:00
Marco Costalba	43204d9ac2	Reformat all_slaves_finished() Rename and move under ThreadsManager class. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2011-11-06 13:45:47 +01:00
Marco Costalba	d58176bfea	Use a timer to avoid polling The timer will be fired asynchronously to handle time management flags, while other threads are searching. This implementation uses a thread waiting on a timed condition variable instead of real timers. This approach allow to reduce platform dependant code to a minimum and also is the most portable given that timers libraries are very different among platforms and also the best ones are not compatible with olds Windows. Also retire the now unused polling code. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2011-11-05 18:19:38 +01:00
Marco Costalba	2617aa415e	Rewrite how commands from GUI are read Instead of polling for input use a dedicated listener thread to read commands from the GUI independently from other threads. To do this properly we have to delegate to the listener all the reading from the GUI: while searching but also while waiting for a command, like in std::getline(). So we have two possible behaviours: in-sync mode, in which the thread mimics std::getline() and the caller blocks until something is read from GUI, and async mode where the listener continuously reads and processes GUI commands while other threads are searching. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2011-11-05 08:35:17 +01:00

1 2 3

118 commits