BadFish

mirror of https://github.com/sockspls/badfish synced 2025-07-18 22:39:14 +00:00

Author	SHA1	Message	Date
Marco Costalba	cb08413dc4	Allow build on HP-UX 11.X Patch from Richard Lloyd (slightly edited from me), following the list of changes as described by the author: src/Makefile: - Added PREFIX and BINDIR for the install: rule. - Added a "make hpux" line to the help: rule. - Added "make test"/"make check" rule that runs the $(PGOBENCH) command. - "make clean" now additionally removes core and bench.txt. - Added an hpux: rule. - Added an install: rule to mkdir $(BINDIR), copy $(EXE) to $(BINDIR) and then strip it. - "make strip" now ensures that $(EXE) is built first before trying to strip it. - Hide errors and output from the g++ command used by the .depend: rule and then touch .depend in case g++ isn't available. - Hide errors from the "include .depend" in case .depend doesn't exist (e.g. directly after a "make clean"). src/book.cpp and src/book.h: - HP-UX's aCC really didn't like the const keywords used for the Book::file_name() definitions, so they were removed. I checked that this didn't affect a Linux build and it was still fine. src/misc.cpp: - HP-UX uses <sys/pstat.h> and pstat_getdynamic() to determine the number of CPU cores, so added conditional code for that (if pstat_getdynamic() fails, set the number of cores to 1). src/tt.cpp: - <xmmintrin.h> and _mm_prefetch() seem highly specific to the Intel x86(_64) and gcc platforms - neither exist in HP-UX, so conditionally avoid that code in HP-UX's case. Perhaps some sort of define is needed here such as -DHAS_MM_PREFETCH that could be #ifdef'ed for instead? Even after these changes, it's more convenient for HP-UX users to edit the default: rule in the Makefile to run "$(MAKE) hpux" before they build stockfish, but that's not a big deal if they're warned about that first (the same applies to all other builds other than the standard "$(MAKE) gcc" one). Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2010-02-12 06:49:16 +01:00
Marco Costalba	4b55d3d883	Increase TT size limit to 8 GB We had an overflow due to use an integer for hash size, now we use a size_t as we should, so we can increase to an higher limit. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2010-02-01 16:53:10 +01:00
Marco Costalba	3dc9f95225	Set maximum hash table size to 2GB We cannot allocate more then 2 GB, so let the limit reflect this. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-12-27 00:44:08 +01:00
Joona Kiiski	b056e5d40a	Re-enable TT.insert_pv() This time make sure that valuable TTentries are not overwritten. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-12-09 19:43:12 +01:00
Marco Costalba	06a5b602dc	Fix an off-by-one bug in extract_pv() In case we reach ply == PLY_MAX we exit the function writing pv[PLY_MAX] = MOVE_NONE; And because SearchStack is defined as: struct SearchStack { Move pv[PLY_MAX]; Move currentMove; ..... We end up with the unwanted assignment SearchStack.currentMove = MOVE_NONE; Fortunatly this is harmless because currentMove is not used where extarct_pv() is called. But neverthless this is a bug that needs to be fixed. Thanks to Uri Blass for spotting out this. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-10-09 10:04:55 +01:00
Marco Costalba	fd2b3df770	Fix bogus comment in extract_pv() Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-10-06 11:15:05 +01:00
Marco Costalba	60bc30275d	Small code reformat in TranspositionTable::extract_pv() In particular don't use an array of StateInfo, this avoids a possible overflow and is in any case redundant. Also pass as argument the pv[] array size to avoid a second possible overflow on this one. Fix suggested by Joona. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-10-06 07:14:12 +01:00
Marco Costalba	9741694fca	Save static evaluation also for failed low nodes When a node fails low and bestValue is still equal to the original static node evaluation, then save this in TT along with usual info. This will allow us to avoid a future costly evaluation() call. This patch extends to failed low nodes what we already do for failed high ones. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-09-20 20:05:40 +01:00
Marco Costalba	fd12e8cb23	Finally fix prefetch on Linux It was due to a missing -msse compiler option ! Without this option the CPU silently discards prefetcht2 instructions during execution. Also added a (gcc documented) hack to prevent Intel compiler to optimize away the prefetches. Special thanks to Heinz for testing and suggesting improvments. And for Jim for testing icc on Windows. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-08-14 08:13:42 +01:00
Marco Costalba	166c09a7a0	Reuse 5 slots instead of 4 But this time with the guarantee of an always aligned access so that prefetching is not adversely impacted. On Joona PC 1+0, 64Mb hash: Orig - Mod: 174 - 237 - 359 Instead after 1000 games at 1+0 with 128MB hash size we are at + 1 ELO (just 4 games of difference). Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-08-14 08:13:13 +01:00
Marco Costalba	8d369600ec	Double prefetch on Windows After fixing the cpu frequency with RightMark tool I was able to test speed all the different prefetch combinations. Here the results: OS Windows Vista 32bit, MSVC compile CPU Intecl Core 2 Duo T5220 1.55 GHz bench on depth 12, 1 thread, 26552844 nodes searched results in nodes/sec no-prefetch 402486, 402005, 402767, 401439, 403060 single prefetch (aligned 64) 410145, 409159, 408078, 410443, 409652 double prefetch (aligned 64) 0+32 414739, 411238, 413937, 414641, 413834 double prefetch (aligned 64) 0+64 413537, 414337, 413537, 414842, 414240 And now also some crazy stuff: single prefetch (aligned 128) 410145, 407395, 406230, 410050, 409949 double prefetch (aligned 64) 0+0 409753, 410044, 409456 single prefetch (aligned 64) +32 408379, 408272, 406809 single prefetch (aligned 64) +64 408279, 409059, 407395 So it seems the best is a double prefetch at the addres + 32 or +64, I will choose the second one because it seems more natural to me. It is still a mystery why it doesn't work under Linux :-( Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-08-10 22:35:08 +01:00
Marco Costalba	f4140ecc0c	Avoid Intel compiler optimizes away prefetching Without this hack Intel compiler happily optimizes away the gcc builtin call. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-08-10 13:49:12 +01:00
Marco Costalba	60b5da4cc8	Use aligned prefetch address Prefetch always form a chache line boundary. It seems that if prefetch address is not cache line aligned then performance is adversely impacted. Hopefully we will resuse that 32 bits of padding for something useful in the future. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-08-10 13:49:00 +01:00
Marco Costalba	76ae0e36be	Enable prefetch also for gcc This fix a compile error under Linux with gcc when there aren't the intel dev libraries. Also simplify the previous patch moving TT definition from search.cpp to tt.cpp so to avoid using passing a pointer to TT to the current position. Finally simplify do_move(), now we miss a prefetch in the rare case of setting an en-passant square but code is much cleaner and performance penalty is almost zero. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-08-10 01:42:35 +01:00
Marco Costalba	4251eac860	Try to prefetch as soon as position key is ready Move prefetching code inside do_move() so to allow a very early prefetching and to put as many instructions as possible between prefetching and following retrieve(). With this patch retrieve() times are cutted of another 25% No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-08-09 16:45:37 +01:00
Marco Costalba	cd4604b05c	Add TT prefetching support TT.retrieve() is the most time consuming function because almost always involves a very slow RAM access. TT table is so big that is never cached. This patch prefetches TT data just after a move is done, so that subsequent TT.retrieve will be very fast. Profiling with VTune shows that TT:retrieve() times are almost cutted in half ! No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-08-09 14:18:15 +01:00
Marco Costalba	e6863f46de	Use 5 TTEntry slots instead of 4 Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-08-09 04:42:26 +01:00
Marco Costalba	6f1475b6fc	Use 32 bit key in TT Shrink key to 32 bits instead of 64. To still avoid collisions use the high 32 bits of position key as the TT key and the low 32 bits to retrieve the correct cluster index in the table. With this patch size og TTentry shrinks to 96 bits instead of 128 and the cluster of 4 TTEntry sums to 48 bytes instead of 64. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-08-09 04:42:07 +01:00
Tord Romstad	ae49677446	Fixed a bug in PV extraction from the transposition table: The previous used move_is_legal to verify that the move from the TT was legal, and the old version of move_is_legal only works when the side to move is not in check. Fixed this by adding a separate, slower version of move_is_legal which works even when the side to move is in check.	2009-08-06 18:07:32 +02:00
Tord Romstad	2fff532f4e	Moved the code for extracting the PV from the TT to tt.cpp, where it belongs.	2009-08-06 14:02:53 +02:00
Marco Costalba	5c81602d14	Update copyright year We are well in 2009 already. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-05-07 14:54:40 +02:00
Marco Costalba	a88e762b4e	Rewrite the way application exits Centralize in a single object all the global resources management and avoid a bunch of sparse exit() calls. This is more reliable and clean and more stick to C++ coding practices. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-05-07 12:59:19 +02:00
Marco Costalba	aabd526f7c	Change TT interface to ask directly for a position key Instead of a position because the key is all that we need. Interface is more clear and also very very little bit faster. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-05-03 08:11:36 +01:00
Marco Costalba	2f2e48fad2	Code style cleanup in transposition table code Assorted fixes but no functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-04-27 13:21:49 +01:00
Marco Costalba	2550c6383b	Fix a bogus assert in tt.cpp Max hash size is 4096 MB, not 1024 MB, see the corresponding "Hash" UCI parameter in ucioption.cpp No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-04-27 10:29:33 +01:00
Marco Costalba	659c54582d	Revert setting a flag when TT value equals static evaluation Strangely enough it seems that optimization doesn't work. After 760 games at 1+0: +155 -184 =421 -13 ELO Probably the overhead, although small, for setting the flag is not compensated by the saved evaluation call. This could be due to the fact that after a TT value is stored, if and when we hit the position again the stored TT value is actually used as a cut-off so that we don't need to go on with another search and evaluation is avoided in any case. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-03-31 20:41:06 +01:00
Marco Costalba	2c0cd95ecf	An VALUE_TYPE_EVAL score cannot overwrite an entry If we want to store a value of type VALUE_TYPE_EVAL for a given position and we found an already exsisting entry for the same position then we skip. We don't want to overwrite a more valuable score with a lesser one. Note that also in case the exsisting entry is of VALUE_TYPE_EVAL type the overwrite is unuseful because we would store the same score again. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-03-30 09:09:27 +01:00
Marco Costalba	6a8e46d53e	Remember when TT value equals static evaluation value When the stored TT value equals the static value set a proper flag so to not call evaluation() if we hit the same position again but use the stored TT value instead. This is another trick to avoid calling costly evaluation() in qsearch. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-03-30 08:54:09 +01:00
Marco Costalba	9e44a6dba9	A move needs 17 bits not 19 Fix a bug in the way a move is stored and read in a TT entry. We use a mask of 19 bits insteaad of 17 so that the last two bits in the TT entry end up to be random data. This bug will bite us when we will use these two until now unused bits. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-03-29 17:23:41 +01:00
Marco Costalba	5a0581498c	Cache evaluation score in qsearch Instead of just drop evaluation score after stand pat logic save it in TT so to be reused if the same position occurs again. Note that we NEVER use the cached value apart to avoid an evaluation call, in particulary we never return to caller after a succesful tt hit. To accomodate this a new value type VALUE_TYPE_EVAL has been introduced so that ok_to_use_TT() always returns false. With this patch we cut about 15% of total evaluation calls. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-03-24 18:28:42 +01:00
Marco Costalba	43276cbec5	Fix a bug in insert_pv() where minimum depth is zero We implicitly considered the minimum depth stored in TT to be Depth(0), but because we store values in TT also in qsearch() where depth is < 0, we need to use a negative number as minimum depth. Bug spotted by Joona Kiiski. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-03-23 15:30:20 +01:00
Marco Costalba	b870f5a091	Silence a good bunch of Intel warnings Note that some pawns and material info has been switched to int from int8_t. This is a waste of space but it is not clear if we have a faster or slower code (or nothing changed), some test should be needed. Few warnings still are alive. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-03-15 18:19:08 +01:00
Marco Costalba	8f59de48f5	Introduce StateInfo instead of UndoInfo We don't backup anymore but use the renamed StateInfo argument passed in do_move() to store the new position state when doing a move. Backup is now just revert to previous StateInfo that we know because we store a pointer to it. Note that now backing store is up to the caller, Position is stateless in that regard, state is accessed through a pointer. This patch will let us remove all the backup/restore copying, just a pointer switch is now necessary. Note that do_null_move() still uses StateInfo as backup. No functional change. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-02-22 21:18:50 +01:00
Marco Costalba	c97104e854	Big trailing whitespace cleanup part 1 Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2009-01-07 15:47:59 +01:00
Marco Costalba	4df8651c82	Fix hashfull info Do not count has a replacement when a TT entry is written in an empty slot. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2008-11-16 12:37:45 +01:00
Marco Costalba	d89a03cc35	Small tidyup of TranspositionTable::store() Hopefully without bugs this time! Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2008-11-10 19:19:40 +01:00
Marco Costalba	34b1d0538b	Fix a logic bug in TranspositionTable::store() Make the logic work as advertised in the function description. Still a fallback from TT cleanup. This should be less serious then the one in retrieve(), but it's still a bug. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2008-11-10 19:19:40 +01:00
Marco Costalba	787d358554	Fix compile under Ubuntu 64bit Some missing includes. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2008-11-04 20:56:30 +01:00
Marco Costalba	ff0d9dad2b	Fix a serious bug in TranspositionTable::retrieve() Reported by Tord Romstad. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2008-11-04 20:56:30 +01:00
Marco Costalba	5dc2312121	Update copyright info Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2008-10-20 21:47:20 +02:00
Marco Costalba	a230dc1404	Split transposition table lookup in a separate function This slims down the code and is a prerequisite for future patches. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2008-09-06 15:53:43 +02:00
Marco Costalba	c2c0ba875f	TranspositionTable: add first_entry() helper An inline function to retrieve the first TT entry given a position. Plus usual whitespace noise. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2008-09-06 12:30:07 +02:00
Marco Costalba	d786822b92	TranspositionTable: micro optimize first cycle In the common case (>95%) tte == replace so skip additional comparisons in this case. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2008-09-06 12:21:08 +02:00
Marco Costalba	392360e73b	TranspositionTable: early skip on an empty TT entry Instead of going for the whole 4 cycle loop early skip if TT entry is empty. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2008-09-06 12:20:28 +02:00
Marco Costalba	7c93b171cb	TranspositionTable: spaces inflate No functional change, just a tidy up in preparation for next patches. Signed-off-by: Marco Costalba <mcostalba@gmail.com>	2008-09-06 12:19:29 +02:00
Marco Costalba	bb751d6c89	Initial import of Glaurung 2.1	2008-09-01 07:59:13 +02:00

46 commits