BadFish

mirror of https://github.com/sockspls/badfish synced 2025-04-30 16:53:09 +00:00

Author	SHA1	Message	Date
Marco Costalba	578b21bbee	Split PSQT init from Position init Easier for tuning psq tables: TUNE(myParameters, PSQT::init); Also move PSQT code in a new *.cpp file, and retire the old and hacky psqtab.h that required to be included only once to work correctly, this is not idiomatic for a header file. Give wide visibility to psq tables (previously visible only in position.cpp), this will easy the use of psq tables outside Position, for instance in move ordering. Finally trivial code style fixes of the latest patches. Original patch of Lucas Braesch. No functional change.	2015-05-03 20:07:15 +02:00
VoyagerOne	b7063ef65b	Change extra ply LMR condition to: cmh <= 0 && hist < 0 Extra ply LMR condition is now cmh <= 0 && h < 0 Instead of cmh + h < 0 STC: LLR: 2.96 (-2.94,2.94) [-1.50,4.50] Total: 55210 W: 10812 L: 10557 D: 33841 LTC: LLR: 2.95 (-2.94,2.94) [0.00,4.00] Total: 13212 W: 2239 L: 2045 D: 8928 Bench: 8420865 Resolves #339	2015-04-26 20:12:25 +01:00
lucasart	6c040c821a	Retire FORCE_INLINE No speed regression on my machine (i7-3770k, gcc 4.9.1, linux 3.16): stat test master diff mean 2,482,415 2,474,987 7,906 stdev 4,603 5,644 2,497 speedup 0.32% P(speedup>0) 100.0% Fishtest 9+0.03: ELO: 0.26 +-1.8 (95%) LOS: 61.2% Total: 60000 W: 12437 L: 12392 D: 35171 No functional change. Resolves #334	2015-04-15 21:21:45 +01:00
VoyagerOne	20e92895af	Removed extra condition (history < 0) in LMR to help sync up with move ordering. LMR condition is now cmh+history<0 Instead of history<0 OR cmh+history<0 STC: LLR: 2.96 (-2.94,2.94) [-3.00, 1.00] Total: 26446 W: 5092 L: 4980 D: 16374 LTC: LLR: 2.96 (-2.94,2.94) [-3.00, 1.00] Total: 14129 W: 2340 L: 2209 D: 9580 Bench: 7815183 Resolves #331	2015-04-12 20:05:59 +01:00
Marco Costalba	fb03188fc7	Assorted cleanup of last patches No functional change.	2015-04-11 23:24:43 +02:00
Stefan Geschwentner	27efc5ac99	Update stats at pv nodes If a quiet best move is found at a pv node then always update stats. STC: LLR: 2.96 (-2.94,2.94) [-1.50,4.50] Total: 41485 W: 8047 L: 7830 D: 25608 LTC: LLR: 2.96 (-2.94,2.94) [0.00,6.00] Total: 14351 W: 2420 L: 2250 D: 9681 Bench: 6985247 Resolves #330	2015-04-10 20:34:55 +01:00
Stefan Geschwentner	ef4d89c9bd	update stats also in check Update stats also if in check (drop condition). STC: LLR: 3.22 (-2.94,2.94) [-3.00,1.00] Total: 87472 W: 16929 L: 16913 D: 53630 LTC: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 39971 W: 6436 L: 6345 D: 27190 Bench: 7086031 Resolves #327	2015-04-09 20:41:08 +01:00
lucasart	aaf17326e2	Prune evasions when we can castle A minor simplification. STC: LLR: 2.95 (-2.94,2.94) [-3.50,0.50] Total: 67877 W: 12882 L: 12904 D: 42091 STC: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 20677 W: 4023 L: 3901 D: 12753 LTC: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 12221 W: 2022 L: 1888 D: 8311 Bench: 7911336 Resolves #326	2015-04-09 20:34:06 +01:00
Marco Costalba	926f215061	Add support for playing in 'nodes as time' mode When running more games in parallel, or simply when running a game with a background process, due to how OS scheduling works, there is no guarantee that the CPU resources allocated evenly between the two players. This introduces noise in the result that leads to unreliable result and in the worst cases can even invalidate the result. For instance in SF test framework we avoid running from clouds virtual machines because are a known source of very unstable CPU speed. To overcome this issue, without requiring changes to the GUI, the idea is to use searched nodes instead of time, and to convert time to available nodes upfront, at the beginning of the game. When nodestime UCI option is set at a given nodes per milliseconds (npmsec), at the beginning of the game (and only once), the engine reads the available time to think, sent by the GUI with 'go wtime x' UCI command. Then it translates time in available nodes (nodes = npmsec * x), then feeds available nodes instead of time to the time management logic and starts the search. During the search the engine checks the searched nodes against the available ones in such a way that all the time management logic still fully applies, and the game mimics a real one played on real time. When the search finishes, before returning best move, the total available nodes are updated, subtracting the real searched nodes. After the first move, the time information sent by the GUI is ignored, and the engine fully relies on the updated total available nodes to feed time management. To avoid time losses, the speed of the engine (npms) must be set to a value lower than real speed so that if the real TC is for instance 30 secs, and npms is half of the real speed, the game will last on average 15 secs, so much less than the TC limit, providing for a safety 'time buffer'. There are 2 main limitations with this mode. 1. Engine speed should be the same for both players, and this limits the approach to mainly parameter tuning patches. 2. Because npms is fixed while, in real engines, the speed increases toward endgame, this introduces an artifact that is equivalent to an altered time management. Namely it is like the time management gives less available time than what should be in standard case. May be the second limitation could be mitigated in a future with a smarter 'dynamic npms' approach. Tests shows that the standard deviation of the results with 'nodestime' is lower than in standard TC, as is expected because now all the introduced noise due the random speed variability of the engines during the game is fully removed. Original NIT idea by Michael Hoffman that shows how to play in NIT mode without requiring changes to the GUI. This implementation goes a bit further, the key difference is that we read TC from GUI only once upfront instead of re-reading after every move as in Michael's implementation. No functional change.	2015-04-03 04:40:55 +02:00
Marco Costalba	df722521ba	Rename of TimeMgr and friends More natural naming IMO. No functional change.	2015-04-03 04:19:29 +02:00
Marco Costalba	5d1b92e8f9	Introduce elapsed_time() And reformat a bit time manager code. Note that now we set starting search time in think() and no more in ThreadPool::start_thinking(), the added delay is less than 1 msec, so below timer resolution (5msec) and should not affect time lossses ratio. No functional change.	2015-04-03 04:19:26 +02:00
mstembera	6661a31541	Simplification to use only one counter move. STC http://tests.stockfishchess.org/tests/view/5518dca30ebc5902160ec5d0 LLR: 2.95 (-2.94,2.94) [-3.50,0.50] Total: 18868 W: 3638 L: 3530 D: 11700 LTC http://tests.stockfishchess.org/tests/view/5518f7ed0ebc5902160ec5d4 LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 69767 W: 11019 L: 10973 D: 47775 Extracted from http://tests.stockfishchess.org/tests/view/5511028a0ebc5902160ec40b Original patch by hxim. All credit goes to him. Bench: 7664249 Resolves #320	2015-04-03 01:16:15 +08:00
Marco Costalba	6c42575208	Assorted code style of latest commits No functional chnage.	2015-03-29 10:16:10 +02:00
mbootsector	1d5eaba573	Retire follow-up move heuristic STC: http://tests.stockfishchess.org/tests/view/5501d0f30ebc5902160ec0fd LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 34891 W: 6904 L: 6808 D: 21179 LTC: http://tests.stockfishchess.org/tests/view/550328540ebc5902160ec133 LLR: 3.10 (-2.94,2.94) [-3.00,1.00] Total: 182653 W: 29866 L: 29993 D: 122794 Bench: 8396161 Resolves #310	2015-03-28 22:12:06 +00:00
VoyagerOne	ac8e6ff000	Use CounterMoveHistory when calculating LMR for cut nodes If the sum of CounterMoveHistory heuristic and History heuristic is below zero, then reduce an extra ply in cut nodes LTC: LLR: 2.96 (-2.94,2.94) [0.00,6.00] Total: 6479 W: 1099 L: 967 D: 4413 Bench: 7773299 Resolves #315	2015-03-28 21:15:49 +00:00
Marco Costalba	3a6753328c	Clean up previous patch No functional change.	2015-03-25 07:05:27 +01:00
VoyagerOne	e51965aa57	Introduce a new counter move history penalty Extra penalty for TT move in previous ply when it gets refuted STC: LLR: 2.94 (-2.94,2.94) [-1.50,4.50] Total: 31303 W: 6216 L: 6025 D: 19062 LTC: LLR: 2.97 (-2.94,2.94) [0.00,6.00] Total: 6950 W: 1189 L: 1054 D: 4707 Bench: 8191926 Resolves #309	2015-03-24 23:04:08 +00:00
lucasart	35b6079852	Fix comment We always probe, but we do not prune at PV nodes. No functional change. Resolves #300	2015-03-20 22:40:03 +00:00
Marco Costalba	54889618c2	Reformat FastMove Align to SF coding style. Verified no regression: LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 55938 W: 10893 L: 10835 D: 34210 No functional change.	2015-03-18 08:12:59 +01:00
Marco Costalba	9a6cfee73b	Simplify nosleep logic Avoid redundant 'while' conditions. It is enough to check them in the outer loop. Quick tested for no regression 10K games at 4 threads ELO: -1.32 +-3.9 (95%) LOS: 25.6% Total: 10000 W: 1653 L: 1691 D: 6656 No functional change.	2015-03-18 08:01:50 +01:00
Marco Costalba	2e8552db76	Fix a bogus use of mutex Spinlock must be used instead. Tested for no regression at 15+0.05 th 4: LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 25928 W: 4303 L: 4190 D: 17435 No functional change. Resolves #297	2015-03-17 08:19:29 +00:00
Marco Costalba	a4b2eeea75	Re-arrange history update code Unify the quites moves loop for both cases, the compiler optimizes away the if (is_ok((ss-1)->currentMove)) inside loop, so that the result is same speed as original. No functional change.	2015-03-16 15:14:09 +01:00
Marco Costalba	13d4df95cd	Use acquire() and release() for spinlocks It is more idiomatick than lock() and unlock() No functional change.	2015-03-16 08:14:08 +01:00
Joona Kiiski	f04f50b368	Do not sleep, but yield During the search, do not block on condition variable, but instead use std::this_thread::yield(). Clear gain with 16 threads. Again results vary highly depending on hardware, but on average it's a clear gain. ELO: 12.17 +-4.3 (95%) LOS: 100.0% Total: 7998 W: 1407 L: 1127 D: 5464 There is no functional change in single thread mode Resolves #294	2015-03-15 19:45:30 +00:00
Joona Kiiski	d71f707040	Introduce yielding spin locks Idea and original implementation by Stephane Nicolet 7 threads 15+0.05 ELO: 3.54 +-2.9 (95%) LOS: 99.2% Total: 17971 W: 2976 L: 2793 D: 12202 There is no functional change in single thread mode	2015-03-14 19:14:52 +00:00
mstembera	062ca91db5	New easy move implementation Spend much less time in positions where one move is much better than all other alternatives. We carry forward pv stability information from the previous search to identify such positions. It's based on my old InstaMove idea but with two significant improvements. 1) Much better instability detection inside the search itself. 2) When it's time to make a FastMove we no longer make it instantly but still spend at least 10% of normal time verifying it. Credit to Gull for the inspiration. BIG thanks to Gary because this would not work without accurate PV! 20K ELO: 8.22 +-3.0 (95%) LOS: 100.0% Total: 20000 W: 4203 L: 3730 D: 12067 STC LLR: 2.96 (-2.94,2.94) [-1.50,4.50] Total: 23266 W: 4662 L: 4492 D: 14112 LTC LLR: 2.95 (-2.94,2.94) [0.00,6.00] Total: 12470 W: 2091 L: 1931 D: 8448 Resolves #283	2015-03-12 19:49:30 +00:00
Stefan Geschwentner	13c11f4048	Introduce Counter Move History tables Introduce a counter move history table which additionally is indexed by the last move's piece and target square. For quiet move ordering use now the sum of standard and counter move history table. STC: LLR: 2.96 (-2.94,2.94) [-1.50,4.50] Total: 4747 W: 1005 L: 885 D: 2857 LTC: LLR: 2.95 (-2.94,2.94) [0.00,6.00] Total: 5726 W: 1001 L: 872 D: 3853 Because of reported low NPS on multi core test STC (7 threads): ELO: 7.26 +-3.3 (95%) LOS: 100.0% Total: 14937 W: 2710 L: 2398 D: 9829 Bench: 7725341 Resolves #282	2015-03-12 07:29:57 +00:00
Joona Kiiski	81c7975dcd	Use thread specific mutexes instead of a global one. This is necessary to improve the scalability with high number of cores. There is no functional change in a single thread mode. Resolves #281	2015-03-11 21:59:34 +00:00
Marco Costalba	4b59347194	Retire spinlocks Use Mutex instead. This is in preparaation for merging with master branch, where we stilll don't have spinlocks. Eventually spinlocks will be readded in some future patch, once c++11 has been merged. No functional change.	2015-03-11 21:20:47 +01:00
Marco Costalba	8725494966	Add thread_win32.h header Workaround slow std::thread implementation in mingw and gcc for Windows with our own old low level thread functions. No functional change.	2015-03-10 12:42:40 +01:00
Marco Costalba	63a5fc2366	Rename available_to() Change this API to be more natural and simple. Inspired by a patch by Joona. No functional change.	2015-03-01 12:33:05 +01:00
Marco Costalba	0b36ba74fc	Don't assume the type of Time::point But instead use the proper definition. Also rewrite chrono functions while there. No functional change.	2015-02-24 14:08:14 +01:00
Marco Costalba	38112060dc	Use spinlock instead of mutex for Threads and SplitPoint It is reported to be defenitly faster with increasing number of threads, we go from a +3.5% with 4 threads to a +15% with 16 threads. The only drawback is that now when testing with more threads than physical available cores, the speed slows down to a crawl. This is expected and was similar at what we had setting the old sleepingThreads to false. No functional change.	2015-02-23 13:47:07 +01:00
Marco Costalba	098f645d26	Sync with master bench: 8253813	2015-02-23 13:36:15 +01:00
Marco Costalba	e2226cbb20	Use only 'level' as late join metric It seems other metric are useless, this allow us to simplify the code and to prune useless stuff. STC 20K games 4 threads ELO: -0.76 +-2.8 (95%) LOS: 29.9% Total: 20000 W: 3477 L: 3521 D: 13002 STC 10K games 16 threads ELO: 1.36 +-3.9 (95%) LOS: 75.0% Total: 10000 W: 1690 L: 1651 D: 6659 bench: 8253813	2015-02-22 12:59:34 +01:00
Marco Costalba	5fd5453e59	Further refine SMP code Backported from C++11 branch: https://github.com/official-stockfish/Stockfish/commit/7ff965eebfbc17d2b https://github.com/official-stockfish/Stockfish/commit/e74c2df907d5336d3d2b Fully verified it is equivalent to master (see log msg of individual commits for details). No functional change.	2015-02-21 11:33:03 +01:00
Marco Costalba	e74c2df907	Use sp->master instead of bestThread Verified with: dbg_hit_on(th != sp->master); It is 100% equivalent on more than 200K hits. No functional change.	2015-02-21 10:40:59 +01:00
Marco Costalba	7ff965eebf	Improve comments in SMP code No functional change.	2015-02-20 12:38:54 +01:00
Marco Costalba	a6f873cd8d	Use range-based-for in late join No functional change.	2015-02-20 10:50:47 +01:00
Marco Costalba	40548c9153	Sync with master bench: 7911944	2015-02-20 10:37:29 +01:00
Marco Costalba	667f350737	Clarify we don't late join with only 2 threads Thanks to Gary for pointing this out. No functional change.	2015-02-19 23:12:59 +01:00
Marco Costalba	950c8436ed	Use size_t consistently across thread code No functional change.	2015-02-19 10:43:28 +01:00
Marco Costalba	8d47caa16e	Retire redundant sp->slavesCount field It should be used slavesMask.count() instead. Verified 100% equivalent when sp->allSlavesSearching: dbg_hit_on(sp->allSlavesSearching, sp->slavesCount != sp->slavesMask.count()); No functional change.	2015-02-19 10:36:15 +01:00
Marco Costalba	b9d4e6f7fd	Fix a warning under MSVC Assignment of size_t to int. No functional change.	2015-02-19 10:18:24 +01:00
Marco Costalba	193a7ae35b	Add a couple of asserts to late join Document and clarify that we cannot rejoin on ourselves and that we never late join if we are master and all slaves have finished, inded in this case we exit idle_loop. No functional change.	2015-02-19 10:08:29 +01:00
Marco Costalba	4f906a2589	Remove useless condition in late join In case of Threads.size() == 2 we have that sp->allSlavesSearching is always false (because we have finished our search), bestSp is always NULL and we never late join, so there is no need to special case here. Tested with dbg_hit_on(sp && sp->allSlavesSearching) and verified it never fires. No functional change.	2015-02-19 09:53:39 +01:00
Marco Costalba	dccaa145d2	Compute SplitPoint::spLevel on the fly And retire a redundant field. This is important also from a concept point of view becuase we want to keep SMP structures as simple as possible with the only strictly necessary data. Verified with dbg_hit_on(sp->spLevel != level) that the values are 100% the same out of more 50K samples. No functional change.	2015-02-18 21:50:35 +01:00
Joona Kiiski	d65f75c153	Improve smp performance for high number of threads Balance threads between split points. There are huge differences between different machines and autopurging makes it very difficult to measure the improvement in fishtest, but the following was recorded for 16 threads at 15+0.05: For Bravone (1000 games): 0 ELO For Glinscott (1000 games): +20 ELO For bKingUs (1000 games): +50 ELO For fastGM (1500 games): +50 ELO The change was regression for no one, and a big improvement for some, so it should be fine to commit it. Also for 8 threads at 15+0.05 we measured a statistically significant improvement: ELO: 6.19 +-3.9 (95%) LOS: 99.9% Total: 10325 W: 1824 L: 1640 D: 6861 Finally it was verified that there was no (significant) regression for 4 threads: ELO: 0.09 +-2.8 (95%) LOS: 52.4% Total: 19908 W: 3422 L: 3417 D: 13069 2 threads: ELO: 0.38 +-3.0 (95%) LOS: 60.0% Total: 19044 W: 3480 L: 3459 D: 12105 1 thread: ELO: -1.27 +-2.1 (95%) LOS: 12.3% Total: 40000 W: 7829 L: 7975 D: 24196 Resolves #258	2015-02-16 20:36:13 +00:00
lucasart	f8f5dcbb68	Compute checkers from scratch This micro-optimization only complicates the code and provides no benefit. Removing it is even a speedup on my machine (i7-3770k, linux, gcc 4.9.1): stat test master diff mean 2,403,118 2,390,904 12,214 stdev 12,043 10,620 3,677 speedup 0.51% P(speedup>0) 100.0% No functional change.	2015-02-16 09:34:26 +08:00
Marco Costalba	686b45e121	Retire one do_move() overload After Lucas patch it is almost useless. No functional change.	2015-02-15 12:23:03 +01:00

1 2 3 4 5 ...

1596 commits