BadFish

mirror of https://github.com/sockspls/badfish synced 2025-04-29 16:23:09 +00:00

Author	SHA1	Message	Date
Shawn Xu	b7f17346e5	Fix Sanitizer Tests closes https://github.com/official-stockfish/Stockfish/pull/5692 No functional change	2024-11-22 23:41:34 +01:00
Shawn Xu	60351b9df9	Introduce Various Correction histories This patch introduces three additional correction histories, namely, Major Piece Correction History, Minor Piece Correction History, and Non-Pawn Correction History. Introduced by @mcthouacbb in Sirius (https://github.com/mcthouacbb/Sirius) chess engine. The Major Piece Correction History is indexed by side-to-move and the Zobrist key representing the position of the King, Rook, and Queen of both sides. Likewise, the Minor Piece Correction History is indexed by side-to-move and the Zobrist key representing the position of the King, Knight, and Bishop of both sides. Also See: `97b85bbaac` `3099cdef2f` Introduced by @zzzzz151 in Starzix (https://github.com/zzzzz151/Starzix) chess engine. Non-Pawn correction history consists of side-to-move, side of Zobrist key, and a Zobrist key representing of the position of all non-pawn pieces of one side. The non-pawn correction values for both key sides are then summed. Also See: `34911772f1` `33e0df8dd2` The weights on the final correction value of the above correction histories, as well as existing correction histories, are then tuned in two separate SPSA sessions, totaling 75k games. SPSA1: https://tests.stockfishchess.org/tests/view/66e5243886d5ee47d953a86b (Stopped early due to some weights reaching the maximum value) SPSA2: https://tests.stockfishchess.org/tests/view/66e6a26f86d5ee47d953a965 Also thanks to @martinnovaak, (Motor https://github.com/martinnovaak/motor author) for insights and suggestions. Passed STC: LLR: 2.94 (-2.94,2.94) <0.00,2.00> Total: 23328 W: 6197 L: 5901 D: 11230 Ptnml(0-2): 82, 2582, 6041, 2876, 83 https://tests.stockfishchess.org/tests/view/66e8787b86d5ee47d953ab6f Passed LTC: LLR: 2.94 (-2.94,2.94) <0.50,2.50> Total: 10626 W: 2826 L: 2560 D: 5240 Ptnml(0-2): 4, 1054, 2941, 1300, 14 https://tests.stockfishchess.org/tests/view/66e8ab2386d5ee47d953aba8 closes https://github.com/official-stockfish/Stockfish/pull/5598 Bench: 1011161	2024-09-17 21:01:43 +02:00
Disservin	a06e7004c1	Port instrumented testing to python Since an unknown amount of time the instrumented CI has been a bit flawed, explained here https://github.com/official-stockfish/Stockfish/issues/5185. It also experiences random timeout issues where restarting the workflow fixes it or very long run times (more than other workflows) and is not very portable. The intention of this commit is to port the instrumented.sh to python which also works on other operating systems. It should also be relatively easy for beginners to add new tests to assert stockfish's output and to run it. From the source directory the following command can be run. `python3 ../tests/instrumented.py --none ./stockfish` A test runner will go over the test suites and run the test cases. All instrumented tests should have been ported over. The required python version for this is should be 3.7 (untested) + the requests package, testing.py includes some infrastructure code which setups the testing. fixes https://github.com/official-stockfish/Stockfish/issues/5185 closes https://github.com/official-stockfish/Stockfish/pull/5583 No functional change	2024-09-17 20:24:17 +02:00
Joost VandeVondele	ab00c24c7e	Fix some of the tests due to https://github.com/official-stockfish/Stockfish/issues/5185 some CI tests are skipped. This patch fixes a few tests that need updating. closes https://github.com/official-stockfish/Stockfish/pull/5560 No functional change	2024-09-03 17:48:58 +02:00
Joost VandeVondele	3d92950859	Limit depth after extensions to avoid asserts. currently extensions can cause depth to exceed MAX_PLY. This triggers the assert near line 542 in search when running a binary compiled with `debug=yes` on a testcase like: ``` position fen 7K/P1p1p1p1/2P1P1Pk/6pP/3p2P1/1P6/3P4/8 w - - 0 1 go nodes 1000000 ``` passed STC https://tests.stockfishchess.org/tests/view/6668a56a602682471b064c8d LLR: 2.93 (-2.94,2.94) <-1.75,0.25> Total: 143936 W: 37338 L: 37238 D: 69360 Ptnml(0-2): 514, 16335, 38149, 16477, 493 closes https://github.com/official-stockfish/Stockfish/pull/5383 Bench: 1160467	2024-06-15 12:02:56 +02:00
Dubslow	c8213ba0d0	Simplify TT interface and avoid changing TT info This commit builds on the work and ideas of #5345, #5348, and #5364. Place as much as possible of the TT implementation in tt.cpp, rather than in the header. Some commentary is added to better document the public interface. Fix the search read-TT races, or at least contain them to within TT methods only. Passed SMP STC: https://tests.stockfishchess.org/tests/view/666134ab91e372763104b443 LLR: 2.94 (-2.94,2.94) <-1.75,0.25> Total: 512552 W: 132387 L: 132676 D: 247489 Ptnml(0-2): 469, 58429, 138771, 58136, 471 The unmerged version has bench identical to the other PR (see also #5348) and therefore those same-functionality tests: SMP LTC: https://tests.stockfishchess.org/tests/view/665c7021fd45fb0f907c214a SMP LTC: https://tests.stockfishchess.org/tests/view/665d28a7fd45fb0f907c5495 closes https://github.com/official-stockfish/Stockfish/pull/5369 bench 1205675	2024-06-12 09:17:04 +02:00
Disservin	56a9cc512e	Move ALSR change to CI Workflow file It makes more sense to not (potentially) change the developers alsr entropy setting to make the test run through. This should be an active choice even if the test then might fail locally for them. closes https://github.com/official-stockfish/Stockfish/pull/5182 No functional change	2024-04-21 14:49:11 +02:00
Joost VandeVondele	1a8de45b8c	Improve CI the recent refactoring has shown some limitations of our testing, hence we add a couple of more tests including: * expected mate score * expected mated score * expected in TB win score * expected in TB loss score * expected info line output * expected info line output (wdl) closes https://github.com/official-stockfish/Stockfish/pull/5181 No functional change	2024-04-21 14:46:01 +02:00
Disservin	fb07281f55	Fix false positives from ThreadSanitizer Since Linux Kernel 6.5 we are getting false positives from the ci, lower the ALSR entropy to disable ALSR, which works as a temporary workaround. https://github.com/google/sanitizers/issues/1716 https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2056762 closes https://github.com/official-stockfish/Stockfish/pull/5115 No functional change	2024-03-14 22:14:11 +01:00
FauziAkram	833a2e2bc0	Cleanup comments Tests used to derive some Elo worth comments: https://tests.stockfishchess.org/tests/view/656a7f4e136acbc573555a31 https://tests.stockfishchess.org/tests/view/6585fb455457644dc984620f closes https://github.com/official-stockfish/Stockfish/pull/4945 No functional change	2023-12-31 19:54:27 +01:00
Joost VandeVondele	8192945870	Improve testing coverage, remove unused code a) Add further tests to CI to cover most features. This uncovered a potential race in case setoption was sent between two searches. As the UCI protocol requires this sent to be went the engine is not searching, setoption now ensures that this is the case. b) Remove some unused code closes https://github.com/official-stockfish/Stockfish/pull/4730 No functional change	2023-08-11 19:27:46 +02:00
Joost VandeVondele	a3a91f3f9f	Build and test more binaries in CI use a fixed compiler on Linux and Windows (right now gcc 11). build avxvnni on Windows (Linux needs updated core utils) build x86-32 on Linux (Windows needs other mingw) fix a Makefile issue where a failed PGOBENCH would not stop the build reuse the WINE_PATH for SDE as we do for QEMU use WINE_PATH variable also for the signature verify the bench for each of the binaries do not build x86-64-avx2 on macos closes https://github.com/official-stockfish/Stockfish/pull/4682 No functional change	2023-07-15 09:15:16 +02:00
Joost VandeVondele	b7ee7290b5	Add network export to CI verify the network written by export_net matches the original closes https://github.com/official-stockfish/Stockfish/pull/4613 No functional change	2023-06-12 20:35:44 +02:00
Michael Ortmann	4b86ef8c4f	Fix typos in comments, adjust readme closes https://github.com/official-stockfish/Stockfish/pull/3822 also adjusts readme as requested in https://github.com/official-stockfish/Stockfish/pull/3816 No functional change	2021-12-01 18:07:30 +01:00
Tomasz Sobczyk	07e6ceacd6	Add basic github workflow move to github actions to replace travis CI. First version, testing on linux using gcc and clang. gcc build with sanitizers and valgrind. No functional change	2021-06-18 22:05:56 +02:00
Joost VandeVondele	98cbaa6c6b	Enhance CI to error on leaks Add flags to valgrind in our Continuous Integration scripts, to error on memory leaks. closes https://github.com/official-stockfish/Stockfish/pull/3525 No functional change.	2021-06-05 10:55:57 +02:00
Stéphane Nicolet	f90274d8ce	Small clean-ups - Comment for Countemove pruning -> Continuation history - Fix comment in input_slice.h - Shorter lines in Makefile - Comment for scale factor - Fix comment for pinners in see_ge() - Change Thread.id() signature to size_t - Trailing space in reprosearch.sh - Add Douglas Matos Gomes to the AUTHORS file - Introduce comment for undo_null_move() - Use Stockfish coding style for export_net() - Change date in AUTHORS file closes https://github.com/official-stockfish/Stockfish/pull/3416 No functional change	2021-05-17 10:47:14 +02:00
Dieter Dobbelaere	7ffae17f85	Add Stockfish namespace. fixes #3350 and is a small cleanup that might make it easier to use SF in separate projects, like a NNUE trainer or similar. closes https://github.com/official-stockfish/Stockfish/pull/3370 No functional change.	2021-03-07 14:26:54 +01:00
Joost VandeVondele	9382f854b3	Schedule threads fairly under valgrind fixes a rare case that can cause CI to fail when running multithreaded under valgrind. closes https://github.com/official-stockfish/Stockfish/pull/3165 No functional change.	2020-10-05 18:56:49 +02:00
Guy Vreuls	ea6220f381	This commit enables a mixed bench, to improve CI and allow for PGO (profile-build) of the NNUE part of the code. Joint work gvreuls / vondele * Download the default NNUE net in AppVeyor * Download net in travis CI `make net` * Adjust tests to cover more archs, speedup instrumented testing * Introduce 'mixed' bench as default, with further options: classical, NNUE, mixed. mixed (default) and NNUE require the default net to be present, which can be obtained with ``` make net ``` Further examples (first is equivalent to `./stockfish bench`): ``` ./stockfish bench 16 1 13 default depth mixed ./stockfish bench 16 1 13 default depth classical ./stockfish bench 16 1 13 default depth NNUE ``` The net is now downloaded automatically if needed for `profile-build` (usual `build` works fine without net present) PGO gives a nice speedup on fishtest: passed STC: LLR: 2.93 (-2.94,2.94) {-0.50,1.50} Total: 3360 W: 469 L: 343 D: 2548 Ptnml(0-2): 20, 246, 1030, 356, 28 https://tests.stockfishchess.org/tests/view/5f31b5499081672066537569 passed LTC: LLR: 2.97 (-2.94,2.94) {0.25,1.75} Total: 8824 W: 609 L: 502 D: 7713 Ptnml(0-2): 8, 430, 3438, 519, 17 https://tests.stockfishchess.org/tests/view/5f31c87b908167206653757c closes https://github.com/official-stockfish/Stockfish/pull/2931 fixes https://github.com/official-stockfish/Stockfish/issues/2907 requires fishtest updates before commit Bench: 4290577	2020-08-11 08:17:03 +02:00
Marco Costalba	05f7d59a9a	Assorted trivial cleanups 1/2019 To address #1862 No functional change.	2019-02-08 10:20:43 +01:00
Joost VandeVondele	d2acdac101	Small improvements to the CI infrastructure - avoid inlining for the debug testing so that suppressions work - provide more output for triggered errors No functional change.	2019-01-09 16:57:24 +01:00
MJZ1977	70880b8e24	Flag critical search tree in hash table Introducing new concept, saving principal lines into the transposition table to generate a "critical search tree" which we can reuse later for intelligent pruning/extension decisions. For instance in this patch we just reduce reduction for these lines. But a lot of other ideas are possible. To go further : tune some parameters, how to add or remove lines from the critical search tree, how to use these lines in search choices, etc. STC : LLR: 2.94 (-2.94,2.94) [0.50,4.50] Total: 59761 W: 13321 L: 12863 D: 33577 +2.23 ELO http://tests.stockfishchess.org/tests/view/5c34da5d0ebc596a450c53d3 LTC : LLR: 2.96 (-2.94,2.94) [0.00,3.50] Total: 26826 W: 4439 L: 4191 D: 18196 +2.9 ELO http://tests.stockfishchess.org/tests/view/5c35ceb00ebc596a450c65b2 Special thanks to Miguel Lahoz for his help in transposition table in/out. Bench: 3399866	2019-01-09 15:05:33 +01:00
Marco Costalba	eb6d7f537d	Assorted trivial cleanups (#1894 ) To address https://github.com/official-stockfish/Stockfish/issues/1862 No functional change.	2019-01-01 14:10:26 +01:00
ianfab	d44701be4b	Fix condition for error message of signature script Use obtained bench instead of reference bench when checking for crash. No functional change.	2018-07-27 10:16:33 +02:00
Joost VandeVondele	155d5417d9	Improve signature script Catch case of missing bench, indicative of a crash or assert. No functional change	2018-05-08 10:32:23 +02:00
Joost VandeVondele	c959871a59	Integrate syzygy in automated testing (v2). Extends valgrind/sanitizer testing to cover syzygy code. The script downloads 4 man syzygy as needed. The time needed for the additional testing is small (in fact hard to see a difference compared to the large fluctuations in testing time in travis). Possible follow-ups: * include more TB sensitive positions in bench. * include the test script of recent commit "Refactor tbprobe.cpp". * verify unchanged bench with TB (with a long run). * make the TB part of the continuation integration tests optional. Closes https://github.com/official-stockfish/Stockfish/pull/1518 and https://github.com/official-stockfish/Stockfish/pull/1490 No functional change.	2018-03-30 10:23:48 +02:00
Joost VandeVondele	9d95d43c57	Multi-threaded search testing with valgrind Also check with valgrind the multi-threaded search. On top of the fix for issue #1227 (PR #1235). No functional change.	2017-09-01 20:19:43 +02:00
Marco Costalba	45e254a0a0	Restore perft Rewrite perft to be placed naturally inside new bench code. In particular we don't have special custom code to run perft anymore but perft is just a new parameter of 'go' command. So user API is now changed, old style command: $perft 5 becomes $go perft 4 No functional change.	2017-08-18 09:04:38 -07:00
Marco Costalba	4d511512d2	Speed up Trevis CI Avoid a couple of redundant rebuilds and compile with 2 threads since travis gives 2vCPUs. Also enable -O1 optimization for valgrind and sanitizers, it should be safe withouth false positives and it gives a very sensible speed up, especially with valgrind. The spee dup allow us to increase testing to depth 10, useful for thread sanitizer. No functional change.	2017-08-18 03:07:41 -07:00
Joost VandeVondele	272e4d1ac7	Faster travis checks in the last month a couple of timeouts have been seen in travis valgrind testing, leading to undesired false positives. The precise cause of this is unclear: a normal valgrind instrumented run is about 6min, the timeout is 10min. Either there are rare hangs (not reproduced locally), or maybe the actual runtime fluctuates on the travis infrastructure (which uses VMs on AWS as far as I know). This patch leads to roughly a 2x speedup of the instrumented testing by reducing the depth from 10 to 9. If timeouts persist, it needs further analysis. No functional change. Closes #1171	2017-07-23 17:23:14 -07:00
Joost VandeVondele	7e897a64f2	Remove race suppression. Pull #1134 fixed another race, so that can be removed from the thread sanitizer suppressions. No functional change. Closes #1150	2017-06-28 17:06:52 -07:00
Joost VandeVondele	3cb0200459	Fix four data races. the nodes, tbHits, rootDepth and lastInfoTime variables are read by multiple threads, but not declared atomic, leading to data races as found by -fsanitize=thread. This patch fixes this issue. It is based on top of the CI-threading branch (PR #1129), and should fix the corresponding CI error messages. The patch passed an STC check for no regression: http://tests.stockfishchess.org/tests/view/5925d5590ebc59035df34b9f LLR: 2.96 (-2.94,2.94) [-3.00,1.00] Total: 169597 W: 29938 L: 30066 D: 109593 Whereas rootDepth and lastInfoTime are not performance critical, nodes and tbHits are. Indeed, an earlier version using relaxed atomic updates on the latter two variables failed STC testing (http://tests.stockfishchess.org/tests/view/592001700ebc59035df34924), which can be shown to be due to x86-32 (http://tests.stockfishchess.org/tests/view/592330ac0ebc59035df34a89). Indeed, the latter have no instruction to atomically update a 64bit variable. The proposed solution thus uses a variable in Position that is accessed only by one thread, which is copied every few thousand nodes to the shared variable in Thread. No functional change. Closes #1130 Closes #1129	2017-06-21 13:37:58 -07:00
Joost VandeVondele	c5de4080db	Introduce assert for stats update Make sure updates to the stats are done in a stable way. No functional change Closes #1038 Closes #1037	2017-03-25 17:57:07 -07:00
Marco Costalba	d9dd520896	Fix previous patch for OS X (#961 ) Use posix version of sed that is available on all platforms. No functional change.	2017-01-09 15:37:09 +01:00
Joost VandeVondele	ba15781be8	New shell scripts for testing, used for travis CI (#957 ) Perform more complex verification and validation. - signature.sh : extract and optionally compare Bench/Signature/Node count. - perft.sh : verify perft counts for a number of positions. - instrumented.sh : run a few commands or uci sequences through valgrind/sanitizer instrumented binaries. - reprosearch.sh : verify reproducibility of search. These script can be used from directly from the command line in the src directory. Update travis script to use these shell scripts. No functional change.	2017-01-09 10:30:57 +01:00

36 commits