BadFish

mirror of https://github.com/sockspls/badfish synced 2025-07-14 21:19:13 +00:00

Author	SHA1	Message	Date
lucasart	328098d027	Fix TT comment and static_assert() Comment is based on a misunderstanding of what unaligned memory access is. Here is an article that explains it very clearly: https://www.kernel.org/doc/Documentation/unaligned-memory-access.txt No matter how we define TTEntry or TTCluster, there will never be any unaligned memory access. This is because the complier knows the alignment rules, and does the necessary adjustments to make sure unaligned memory access does not occur. The issue being adressed here has nothing to do with unaligned memory access. It is about cache performance. In order to achieve best cache performance: - we prefetch the cacheline as soon as possible. - we ensure that TT clusters do not spread across two cachelines. If they did, we would need to prefetch 2 cachelines, which could hurt cache performance. Therefore the true conditions to achieve this are: 1/ start adress of TT is cache line aligned. void TranspositionTable::resize() enforces this. 2/ TT cluster size should divide the cache line size. Currently, we pack 2 clusters per cache lines. It used to be 1 before "TT sardines". Does not matter what the ratio is, all we want is to fit an integer number of clusters per cache line. No functional change. Resolves #506	2015-11-20 23:23:53 -08:00
Marco Costalba	dc3508d157	Fix a comment in TTEntry::save Comment was slightly incorrect. No functional change.	2015-10-05 09:16:16 +02:00
mstembera	01fab4d432	Remove unnecessary generation check in TT save Checking for generation is unnecessary because if the key matches then the entry was probed and refreshed earlier. STC 2MB LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 57391 W: 10671 L: 10613 D: 36107 http://tests.stockfishchess.org/tests/view/55ef59fa0ebc5976a2d6da5d LTC 8MB LLR: 2.95 (-2.94,2.94) [-3.00,1.00] Total: 60732 W: 9260 L: 9199 D: 42273 http://tests.stockfishchess.org/tests/view/55ef8fe60ebc5976a2d6da6b STC 16MB LLR: 2.95 (-2.94,2.94) [-4.00,0.00] Total: 23443 W: 4369 L: 4293 D: 14781 http://tests.stockfishchess.org/tests/view/55ef8fe60ebc5976a2d6da6b No functional change Resolves #427	2015-09-17 17:13:45 -07:00
Marco Costalba	ee0371f86e	Cleanup work in misc.cpp Also some code style tidy up of latest patches. Also renamed checkSq -> checkSquares because it is a bitboard and not a square. No functional change.	2015-05-10 09:42:26 +02:00
mstembera	eaeb63f1d0	Smart TT save Don't overwrite more valuable data with less valuable data STC 2MB LLR: 2.96 (-2.94,2.94) [-1.50,4.50] Total: 21132 W: 4108 L: 3946 D: 13078 http://tests.stockfishchess.org/tests/view/5547d59f0ebc5940ca5d6883 LTC 8MB LLR: 2.97 (-2.94,2.94) [0.00,6.00] Total: 13381 W: 2149 L: 1987 D: 9245 http://tests.stockfishchess.org/tests/view/5549b5a80ebc5940ca5d68b9 STC 16MB regression w/ zero effective hash pressure LLR: 2.96 (-2.94,2.94) [-5.00,0.00] Total: 18944 W: 3607 L: 3564 D: 11773 http://tests.stockfishchess.org/tests/view/554b0fda0ebc5940ca5d68ea Bench: 8787152 Resolves #347	2015-05-09 17:43:57 +01:00
Marco Costalba	60c121f3b1	Sync with master bench: 7374604	2015-01-31 13:05:51 +01:00
Jean-Francois Romang	a3b4e9e23c	Ressurrect hashfull patch This is an old patch from Jean-Francois Romang to send UCI hashfull info to the GUI: https://github.com/mcostalba/Stockfish/pull/60/files It was wrongly judged as a slowdown, but it takes much less than 1 ms to run, indeed on my core i5 2.6Ghz it takes about 2 microsecs to run! Regression test is good: STC LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 7352 W: 1548 L: 1401 D: 4403 LTC LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 61432 W: 10307 L: 10251 D: 40874 I have set the name of the author to the original one. No functional change.	2015-01-30 18:07:20 +01:00
Marco Costalba	96e36a7897	Explicitly defaulted and deleted members Better than a bit obscure implicit ones. No functional change.	2015-01-21 13:18:19 +01:00
Marco Costalba	05cb58f4fc	Fix some missing rename from previous patch No functional change.	2015-01-17 22:15:15 +01:00
Marco Costalba	595fc342cf	Fix a possible overflow in TT resize On platforms where size_t is 32 bit, we can have an overflow in this expression: (mbSize * 1024 * 1024) Fix it setting max hash size of 2GB on platforms where size_t is 32 bit. A small rename while there: now struct Cluster is definied inside class TranspositionTable so we should drop the redundant TT prefix. No functional change.	2015-01-17 21:42:47 +01:00
Marco Costalba	4eb2d8ce09	Assorted headers cleanup Mostly comments fixing and other small things. No functional change.	2015-01-11 22:56:35 +01:00
Marco Costalba	42b48b08e8	Update copyright year No functional change.	2015-01-10 11:46:28 +01:00
Marco Costalba	413b243809	Coding style in TT code In particular seems more natural to return bool and TTEntry on the same line, actually we should pass and return them as a pair, but due to limitations of C++ and not wanting to use std::pair this can be an acceptable compromise. No functional change. Resolves #157	2014-12-14 23:49:00 +00:00
mstembera	14cf27e6f6	Avoid searching TT twice for the same key/position during probe() and store(). Just keep the pointer and remove code from tt.cpp STC LLR: 2.96 (-2.94,2.94) [-1.50,4.50] Total: 13620 W: 2810 L: 2665 D: 8145 LTC LLR: 2.97 (-2.94,2.94) [0.00,6.00] Total: 13021 W: 2238 L: 2073 D: 8710STC http://tests.stockfishchess.org/tests/view/548436860ebc59331739b90c STC 4MB ELO: 2.41 +-2.2 (95%) LOS: 98.6% Total: 40000 W: 8175 L: 7897 D: 23928 LTC 16MB ELO: 1.78 +-2.0 (95%) LOS: 96.1% Total: 39683 W: 6763 L: 6560 D: 26360 Resolves #151 Bench: 8116521	2014-12-13 07:22:37 +00:00
lucasart	e60cdca9b0	Convert TT depth to int8_t Now that half plies have been removed from the engine, we can encode TT depth into an int8_t. Range is -128 to +127, so it goes still further than the previous limit of 121 plies (with ONE_PLY == 2 where depth - DEPTH_NONE was encoded as an uint8_t). No functional change. Resolved #60	2014-10-01 20:51:32 +01:00
Marco Costalba	a1b62d68ec	Trivial code style fixes Mainly to sync mine and official repo. No functional change.	2014-09-30 09:05:20 +02:00
lucasart	c192b692cf	size_t cast in TranspositionTable::first_entry() 32-bit truncation would make this function bogus when clusterCount >= 2^33 (ie. Hash >= 256 GB). No function change.	2014-07-03 18:23:56 +08:00
lucasart	24ba204931	Raise max Hash to 1TB And use size_t where appropriate, as suggested on FishCooking. No functional change.	2014-07-01 18:37:18 +08:00
Ron Britvich	ccd823a4ff	Pack 3 TT entries in 32 bytes cluster Idea from Ron Britvich Code reworked by Marco Costalba and Joona Kiiski Bench: 8095369 Resolves #3 Resolves #10	2014-06-28 14:06:32 -04:00
Marco Costalba	585655b16e	Tidy up tt.h Backport some non-functional changes found working on 'dense TT' patch. No functional change.	2014-05-25 00:02:09 +02:00
Marco Costalba	9e72e35942	Fix an incorrect 'friend' declaration Spotted by Lee David. No functional change.	2014-03-23 11:17:38 +01:00
mstembera	ffdf63ff7c	Refresh TT entries generation automatically on probe And other assorted simplifications, tested with SPRT[-3, 1] Passed both short TC LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 18814 W: 3600 L: 3475 D: 11739 And long TC LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 20731 W: 3217 L: 3096 D: 14418 No functional change.	2014-03-23 09:46:15 +01:00
mstembera	553ead429d	Some minor cleanup stuff I came across while browsing the code. No functional change.	2014-03-03 08:57:20 +01:00
Marco Costalba	41641e3b1e	Assorted tweaks from DON Mainly renames and some little code style improvment, inspired by looking at DON sources: https://github.com/erashid/DON No functional change.	2014-02-09 17:31:45 +01:00
Marco Costalba	40c863d41a	Increase max hash size to 16GB TCEC season 3, which is due to start in a few weeks, just had its server upgraded to 64GB RAM and will therefore allow 16GB hash to be used per engine. This is almost the upper limit without changing the type of size and hashMask. After this we need to move to uint64_t instead of uint32_t. No functional change.	2014-01-18 18:22:32 +01:00
Marco Costalba	c9dcda6ac4	Update copyright year No functional change.	2014-01-02 01:49:18 +01:00
Richard Lloyd	13a73f67c0	Big assorted spelling fixes No functional change.	2013-12-02 20:29:35 +01:00
Lucas Braesch	eed508b444	Futility pruning simplification 1/ eval margin and gains removed: 16bit are now free on TT entries, due to the removal of eval margin. may be useful in the future :) gains removed: use instead by Value(128). search() and qsearch() are now consistent in this regard. 2/ futility_margin() linear formula instead of complex (log(depth), movecount) formula. 3/ unify pre & post futility pruning pre futility pruning used depth < 7 plies, while post futility pruning used depth < 4 plies. Now it's always depth < 7. Tested with fixed number of games both at short TC: ELO: 0.82 +-2.1 (95%) LOS: 77.3% Total: 40000 W: 7939 L: 7845 D: 24216 And long TC ELO: 0.59 +-2.0 (95%) LOS: 71.9% Total: 40000 W: 6876 L: 6808 D: 26316 bench 7243575	2013-11-09 10:17:27 +01:00
Marco Costalba	343544f3f7	Revert "Retire eval margin and gains" This reverts commit `ecd07e51d0`. Patch was incorrect and partial. It will be reapplied in the correct form. bench: 9189063	2013-11-07 22:32:13 +01:00
Lucas Braesch	ecd07e51d0	Retire eval margin and gains 1/ eval margin and gains removed: - gains removed by Value(128): search() and qsearch() now behave consistently! 2/ futility_margin() - testing showed that there is no added value in this weird (log(depth), movecount) formula, and a much simpler linear formula is just as good. In fact, it is most likely better, as it is not yet optimally tuned. - the new simplified formula also means we get rid of FutilityMargins[], its initialization code, and more importantly ss->futilityMoveCount, and the hacky code that updates it throughout the search(). - the current formula gives negative futility margins, and there is a hidden interaction between the move coutn pruning formula and the futility margin one: what happens is that MCP is supposed to be triggered before we use the non-sensical negative futility margins. 3/ unify pre & post futility pruning - pre futility pruning (what SF calls value based pruning) used depth < 7 plies, while post futility pruning (what SF calls static null move pruning) used depth < 4 plies. - also the condition depth < 7 in pre futility pruning was not obvious, and it seemd to be depth < 16 (futility_margin() returns an infinite value when depth >= 7). Tested with fixed number of games both at short TC: ELO: 0.82 +-2.1 (95%) LOS: 77.3% Total: 40000 W: 7939 L: 7845 D: 24216 And long TC ELO: 0.59 +-2.0 (95%) LOS: 71.9% Total: 40000 W: 6876 L: 6808 D: 26316 bench: 10206576	2013-11-07 19:46:51 +01:00
Lucas Braesch	7f142d6817	Use prefix operators wherever possible No functional change.	2013-10-05 18:10:43 +02:00
homoSapiensSapiens	002062ae93	Use #ifndef instead of #if !defined And #ifdef instead of #if defined This is more standard form (see for example iostream file). No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2013-07-24 19:49:17 +02:00
Marco Costalba	203fdc9ac1	Use calloc() in TranspositionTable::set_size() Function calloc() already initializes memory to zero, so avoid calling clear() afterwards. Also some renaming while there (inspired by DiscoCheck). No functional change.	2013-06-29 11:23:07 +02:00
Marco Costalba	2c7ab488a8	Fix description of TT entry It was way outdated and wrong ! No functional change.	2013-06-14 08:21:02 +02:00
Marco Costalba	481eda4ca0	Re-add "Cache line aligned TT" But this time do not play with pointers, in particular do not assume that size_t is an unsigned type of the same width as pointers. This code should be fully portable. No functional change.	2013-05-01 23:42:16 +02:00
Marco Costalba	293c44bc09	Revert "Cache line aligned TT" This reverts commit `083fe58124` It seems to break Android build No functional change.	2013-04-30 19:42:21 +02:00
Marco Costalba	083fe58124	Cache line aligned TT Let TT clusters (16*4=64 bytes) to hold on a singe cache line. This avoids the need for the double prefetch. Original patches by Lucas and Jean-Francois that has also tested on his AMD FX: BIG HASHTABLE ./stockfish bench 1024 1 18 > /dev/null Before: 1437642 nps 1426519 nps 1438493 nps After: 1474482 nps 1476375 nps 1475877 nps SMALL HASHTABLE ./stockfish bench 128 1 18 > /dev/null Before: 1435207 nps 1435586 nps 1433741 nps After: 1479143 nps 1471042 nps 1472286 nps No functional change.	2013-04-26 19:38:11 +02:00
Marco Costalba	c5ec94d0f1	Update copyright year No functional change.	2013-02-19 07:54:14 +01:00
Marco Costalba	0be7b8c542	Further simplify first_entry() We can encode the ClusterSize directly in the hashMask, this allows to skip the left shift. There is no real change, but bench number is now different because instead of using the lowest order bits of the key to index the start of the cluster, now we don't use the last two lsb bits that are always set to zero (cluster size is 4). So for instance, if 10 bits are used to index the cluster, instead of bits [9..0] now we use bits [11..2]. This changes the positions that end up in the same cluster affecting TT hits and so bench is different. Also some renaming while there. bench: 5383795	2013-02-09 16:37:20 +01:00
Marco Costalba	c698362680	Microptimize first_entry() for 32bits Do a 32bit bitwise 'and' instead of a 64bit subtract and bitwise 'and'. This is possible because even in the biggest hash table case (8GB) the number of entries is 2^29 so storable in an unsigned int. No functional change.	2013-02-09 10:55:38 +01:00
Marco Costalba	fe3352665b	Retire TTCluster and simplify TT Also some renaming while there. No functional change.	2013-02-09 10:55:20 +01:00
jundery	88c3670edf	Rename posKey stored in the transposition table [Edit: Slightly extended by me] No functional change.	2013-02-06 08:03:37 +01:00
Marco Costalba	9b1cf3cf43	Have fun with union in book.cpp Fancy way to use an union to map polyglot zobrist keys in one go. Also some renaming while there. No functional change.	2013-01-06 12:06:19 +01:00
Marco Costalba	3cf6471738	Revert evaluation cache And return on using TT as backing store for position evaluations. Tests (even on single thread) show eval cache was a regression. In multi thread result should be even worst because eval cache is a per-thread struct, while TT is shared. After 4957 games at 15"+0.05 (single thread) eval cache vs master 969 - 1093 - 2895 -9 ELO So previous reported result of +18 ELO was probably due to an issue in the testing framework (a bug in cutechess-cli) that has been fixed in the meanwhile. bench: 5386711	2012-12-27 13:57:17 +01:00
Marco Costalba	a2f46446cf	Revert store of distinct upper and lower bounds Test by Joona prooves the new feature don't value 70 added lines of code. Grand totals after 10040 games (crashes: 0) for tt_both master_9edc7 - 6a93488_6a934: 1756 - 1688 - 6596 ELO +2 (+- 2.7) Confirmed by test of Gary: After 8680 games: ELO: 0.80 +- 99%: 9.62 95%: 7.31 LOS: 65.38% Wins: 1288 Losses: 1268 Draws: 6130 Thanks a lot to both for testing it !!! bench 5149248	2012-12-15 11:18:52 +01:00
Marco Costalba	da98a45bcb	Ensure valueLower <= valueUpper In case a TTEntry stores both an upper and a lower bound ensure that upper bound is not smaller than lower bound. bench 1813815	2012-12-09 14:14:44 +01:00
Marco Costalba	feeafb0a50	Store distinct upper and lower bound scores This is more complex than what I'd like but I was unable to split in small chunks. Here we add 2 slots to TTEntry (valueUpper and depthUpper) so that sizeof(TTEntry) returns to the original 16 bytes and we can pack exactly 4 entries in a 64 bytes cache line. Now we save an upper bound score alongside a lower (exact) score. The idea is to increase TT cut-offs rates becuase there is now an higher probability for a node to use TT info. This patch is highly experimental and probably needs further steps as is hinted by an unrealistic bench number: bench: 2022385	2012-12-09 13:15:50 +01:00
Marco Costalba	98cd8239cc	Don't save eval score in TT This patch completes the removal of eval info in TT table. No functional change.	2012-12-01 15:19:50 +01:00
Marco Costalba	32c504076f	Use std::vector to implement HashTable Allows some code semplification and avoids directly allocation and managing heap memory. Also the usual renaming while there. No functional change and no speed regression. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-03-31 19:07:11 +01:00
Marco Costalba	304deb5e83	Rename Materials and Pawns hash stuff No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2012-03-31 11:59:23 +01:00

1 2 3

112 commits