-
Notifications
You must be signed in to change notification settings - Fork 0
/
Gomoku AI protocol.html
315 lines (284 loc) · 18.4 KB
/
Gomoku AI protocol.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
<!DOCTYPE html>
<html lang="en">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="description" content="New protocol for gomoku artificial intelligences which use pipes to communicate with gomoku tournament manager.">
<meta name="keywords" content="Gomocup,Gomoku,Renju,Piskvork,Gomoku AI" />
<meta name="author" content="Martin Petricek and Petr Lastovicka">
<title>Gomoku AI protocol</title>
<link rel="stylesheet" type="text/css" href="styl.css">
</head>
<body>
<h1>Gomoku AI protocol</h1>
<p class="center">
<a href="https://gomocup.org/detail-information/"><span class=bigbutton>Back to information</span></a>
</p><br>
<h2>Introduction</h2>
This document describes communication between a brain (an artificial intelligence) and a go-moku manager (a program which manages and controls a tournament). The manager creates two pipes. The first pipe is used to send commands from the manager to the brain. The second pipe is used to send replies from the brain to the manager. The brain uses standard input-output functions (scanf and printf in C, readln and
writeln in Pascal,...) therefore it can be written in any programming language. The brain must be a console application, not Windows GUI. Be careful, some run-time libraries buffer the console output. For example, it is necessary to call fflush(stdout) after printf in C programs. You can also use low-level functions ReadFile and WriteFile.
<p>
Each line contains exactly one command (there is only one exception). The manager puts bytes CR LF (0x0d, 0x0a) at the end of lines. The brain can send lines that are ended with CR LF, or just LF, or CR. The manager ignores lines that are empty. It must not crash if a line is too long, but it can silently cut very long lines.
<p>
If the brain has only one thread, it is very important not to read from input when the brain is required to think or respond to a command. It would lead to deadlock (both brain and manager are waiting). Manager will terminate the brain in this situation after the time for a turn is out. The brain can have two threads to avoid that problem. The first thread reads commands from input and the second thread thinks and writes responds to output. It is necessary to use some synchronization objects (events, locks or semaphores). The number of threads is not important for a tournament. One thread is sufficient for a tournament. But two threads are useful for human players. For example, someone may want to change time limits while the brain already started to think. The thinking can also be easily canceled at any time without terminating the brain.
<p>
The brain is required to process commands START, BEGIN, INFO, BOARD, TURN, END. The brain can ignore INFO commands which are not needed. The reply for every other command is UNKNOWN (Due to backward compatibility and possibility to extend the protocol).
</p>
<h2>Brain's name and temporary files</h2>
The name of the brain can contain only characters A-Z, a-z, 0-9, dash, underscore, dot. The name is required to begin with prefix "pbrain-". In the other case the brain is considered to use communication <a href="protocl1en.htm">via files (old method)</a>.
<pre>
Example:
pbrain-swine.exe
pbrain-pisq5.exe
</pre>
Only the executable file is required to begin with prefix "pbrain-". The name of ZIP file does not have this prefix. There can be both 32-bit and 64-bit exe in the ZIP file, the 64-bit exe file name must have substring 64. For example, MyGomo.ZIP can contain files pbrain-MyGomo.exe and pbrain-MyGomo64.exe. The manager launches pbrain-MyGomo.exe on 32-bit OS and pbrain-MyGomo64.exe on 64-bit OS.
<p>
Working directory is set by the manager. It doesn't have to be the directory where the brain's executable file is saved. The brain must specify full path to all data files which it uses. It can obtain the path from function GetModuleFileName or it can look at the beginning of its command line which can be discovered from the main function parameters. The manager must put name of brain's exe file on the command line in such a form so that the brain can open the file.
<p>
The brain can create a folder in the current directory to store its temporary files. The name of the folder must be the same as the name of the brain. The maximal allowed size of the folder will be announced on Gomocup web page (it is now 20MB). The manager can delete all temporary files when the manager exits or after a tournament finished. Command INFO folder is used to determine folder where persistent files can be saved.
</p>
<h2>Mandatory commands</h2>
<h3>START [size]</h3>
When the brain receives this command, it initializes itself and creates an empty board, but doesn't make any move yet. The parameter is size of the board. The brain must be able to play on board of size 20, because this size will be used in Gomocup tournaments. It is recommended but not required to support other board sizes. If the brain doesn't like the size, it responds ERROR. There can be a message after the ERROR word. The manager can try other sizes or it can display an error message to a user. The brain responds OK if it has been initialized successfully.
<pre>
Example:
The manager sends:
START 20
The brain answers:
OK - everything is good
ERROR message - unsupported size or other error
</pre>
<h3>TURN [X],[Y]</h3>
The parameters are coordinate of the opponent's move. All coordinates are numbered from zero.
<pre>Expected answer:
two comma-separated numbers - coordinates of the brain's move
Example:
The manager sends:
TURN 10,10
The brain answers:
11,10
</pre>
<h3>BEGIN</h3>
This command is send by the manager to one of the players (brains) at the beginning of a match. This means that the brain is expected to play (open the match) on the empty playing board. After that the other brain obtains the TURN command with the first opponent's move. The BEGIN command is not used when automatic openings are enabled, because in that case both brains receive BOARD commands instead.
<pre>
Expected answer:
two numbers separated by comma - coordinates of the brain's move
Example:
The manager sends:
BEGIN
The brain answers:
10,10
</pre>
<h3>BOARD</h3>
This command imposes entirely new playing field. It is suitable for continuation of an opened match or for undo/redo user commands. The BOARD command is usually send after START, RESTART or RECTSTART command when the board is empty. If there is any open match, the manager sends RESTART command before the BOARD command.
<p>
After this command the data forming the playing field are send. Every line is in the form:
<pre>
[X],[Y],[field]
</pre>
where [X] and [Y] are coordinates and [field] is either number 1 (own stone) or number 2 (opponent's stone) or number 3 (only if continuous game is enabled, stone is part of winning line or is forbidden according to renju rules).
<p>
If game rule is renju, then the manager must send these lines in the same order as moves were made. If game rule is Gomoku, then the manager may send moves in any order and the brain must somehow cope with it. Data are ended by <b>DONE</b> command. Then the brain is expected to answer such as to TURN or BEGIN command.
<pre>
Example:
The manager sends:
BOARD
10,10,1
10,11,2
11,11,1
9,10,2
DONE
The brain answers:
9,9
</pre>
<h3>INFO [key] [value]</h3>
The manager sends information to the brain. The brain can ignore it. However, the brain will lose if it exceeds the limits. The brain must cope with situations when the manager doesn't send all information which is mentioned in this document. Most of this information is sent at the beginning of a match. The time limits will not be changed in the middle of a match during a tournament. It is recommended to react on commands at any time, because the human opponent can change these values even when the brain is thinking.
<pre>
The key can be:
timeout_turn - <span>time limit for each move (milliseconds, 0=play as fast as possible)</span>
timeout_match - <span>time limit of a whole match (milliseconds, 0=no limit)</span>
max_memory - <span>memory limit (bytes, 0=no limit)</span>
time_left - <span>remaining time limit of a whole match (milliseconds)</span>
game_type - <span>0=opponent is human, 1=opponent is brain, 2=tournament, 3=network tournament</span>
rule - <span>bitmask or sum of 1=exactly five in a row win, 2=continuous game, 4=renju, 8=caro</span>
evaluate - <span>coordinates X,Y representing current position of the mouse cursor</span>
folder - <span>folder for persistent files</span>
</pre>
Information about time and memory limits is sent before the first move (after or before START command). Info time_left is sent before every move (before commands TURN, BEGIN, BOARD and SWAP2BOARD). The remaining time can be negative when the brain runs out of time. Remaining time is equal to 2147483647 if the time for a whole match is unlimited. The manager is required to send info time_left if the time is limited, so that the brain can ignore info timeout_match and only rely on info time_left.
<p>
Time for a match is measured from creating a process to the end of a game (but not during opponent's turn). Time for a turn includes processing of all commands except initialization (commands START, RECTSTART, RESTART). Turn limit equal to zero means that the brain should play as fast as possible (eg count only a static evaluation and don't search possible moves).
<p>
INFO folder is used to determine a folder for files that are permanent. Because this folder is common for all brains and maybe other applications, the brain must create its own subfolder which name must be the same as the name of the brain. If the manager does not send INFO folder, then the brain cannot store permanent files.
<p>
Only debug versions should respond to INFO evaluate. For example, it can print evaluation of the square to some window. It cannot be written to the standard output. Release versions should just ignore INFO evaluate.
<p>
How should the brain behave when obtains unknown INFO command ?<br>
- Ignore it, it is probably not important. If it was important, it is not in an
INFO command form.
<p>
How should behave the brain obtaining the unachievable INFO command?<br>
(for example too small memory limit)<br>
- The brain should wait with the output of the problem until the manager sends the first command not having an INFO form (TURN, BOARD or BEGIN). The manager does not read messages from the brain when sending INFO command.
<pre>
Example:
INFO timeout_match 300000
INFO timeout_turn 10000
INFO max_memory 83886080
Expected answer: none
</pre>
<h3>END</h3>
When the brain obtains this command, it must terminate as soon as possible. The manager waits until the brain is finished. If the time of termination is too long (e.g. 1 second), the brain will be terminated by the manager. The brain should not write anything to output after the END command. However, the manager should not close the pipe until the brain is ended.
<pre>
Expected answer: none
The brain should delete its temporary files.
</pre>
<h3>ABOUT</h3>
The brain is expected to send some information about itself on one line. Each info must be written as keyword, equals sign, text value in quotation marks. Recommended keywords are name, version, author, country, www, email. Values should be separated by commas that can be followed by spaces. The manager can use this info, but must cope with old brains that used to send only human-readable text.
<pre>
Example:
The manager sends:
ABOUT
The brain answers:
name="SomeBrain", version="1.0", author="Nymand", country="USA"
</pre>
<h2>Optional commands</h2>
The extended commands published in this section are not required to be
implemented in Gomocup tournament, however it can be useful especially for human players.
<h3>RECTSTART [width],[height]</h3>
This command is similar to START, but the board is rectangular. Parameters are two comma-separated numbers. Width corresponds to coordinate X, height belongs to coordinate Y. The manager must use START command if the board is squared.
<pre>
Example:
The manager sends:
RECTSTART 30,20
The brain answers:
OK - parameters are good
ERROR message - rectangular board is not supported or other error
</pre>
<h3>RESTART</h3>
This command is used after the match is finished or aborted. This command has no parameters. The board size remains unchanged. The brain releases previous board and other structures, creates a new empty board and prepares itself for a new match. Then the brain answers OK and further communication continues as after START command. If the brain answers UNKNOWN, the manager sends END and executes the brain again.
<pre>
Example:
The manager sends:
RESTART
The brain answers:
OK
</pre>
<h3>TAKEBACK [X],[Y]</h3>
This command is used to undo last move. The brain removes stone which has coordinate [X,Y] and then answers OK.
<pre>
Example:
The manager sends:
TAKEBACK 9,10
The brain answers:
OK
</pre>
<h3>PLAY [X],[Y]</h3>
It is used by the manager as a respond to SUGGEST command only. It imposes move [X],[Y] to the brain.
<br>Expected answer:
two comma-separated numbers which should be the same as PLAY parameters. If the brain doesn't like the coordinates sent by the manager, it can answer some other coordinates (but it is not recommended).
<pre>
Example:
The brain has sent:
SUGGEST 10,10
The manager sends:
PLAY 12,10
The brain moves onto 12,10 and answers:
12,10
</pre>
<h3>SWAP2BOARD</h3>
This command is used to deal with the opening stage of <b>Swap2</b> rule. It is sent once or twice to the AI between command START and command BOARD. Specifically, it has three cases, and we show examples for each of them.
<ul>
<li>Case 1. The manager asks for the first three stones.
<pre>
The manager sends:
SWAP2BOARD
DONE
The AI answers:
7,7 8,7 9,9
</pre>
<li>Case 2. The manager sends the coordinates of the first three stones and asks for the choice of options.
<pre>
The manager sends:
SWAP2BOARD
7,7
8,7
9,9
DONE
The AI answers:
SWAP - <span>if the AI decides to swap (option 1)</span>
8,8 - <span>output the coordinate of the 4th move if the AI decides to stay with its color (option 2)</span>
8,8 8,6 - <span>output the coordinates of the 4th and 5th stones if the AI decides to put two stones and let the opponent choose the color (option 3)</span>
</pre>
<li>Case 3. The manager sends the coordinates of the first five stones and asks for the choice of options.
<pre>
The manager sends:
SWAP2BOARD
7,7
8,7
9,9
8,8
8,6
DONE
The AI answers:
SWAP - <span>if the AI decides to swap (option 1)</span>
6,8 - <span>output the coordinate of the 6th move if the AI decides to stay with its color (option 2)</span>
</pre>
<li>After the opening stage, the stones on the board will be treated as an opening for a standard match. For example, following the above example in Case 3, assuming the AI chooses option 2, the manager will send the following messages to the other AI:
<pre>
BOARD
7,7,1
8,7,2
9,9,1
8,8,2
8,6,1
6,8,2
DONE
</pre>
<li>As another example, following Case 2's example, assuming the AI chooses option 1, the manager will send the following messages to the other AI:
<pre>
BOARD
7,7,2
8,7,1
9,9,2
DONE
</pre>
</ul>
<h2>Commands that are sent by the brain</h2>
The brain can use these commands if it needs them. The manager is required to know them.
<h3>UNKNOWN [error message]</h3>
The brain sends this as a respond to a command that is unknown or not yet implemented. That means the brain must not exit after receiving some strange line from the manager. The parameter after UNKNOWN keyword is message that can be displayed by the manager to a user. If the manager has sent some optional command which the brain does not implement, then the manager is required to try some mandatory command.
<h3>ERROR [error message]</h3>
The brain sends this when it receives some known command, but is not able to cope with it. For example the memory limit is too small or the board is too large. Parameter is a short message that the manager writes to a log window or to a log file. The manager can also change some game options and try action again.
<h3>MESSAGE [message]</h3>
The message for a user. The manager can write it to some log window or to a log file. The brain is expected to send messages just before respond to some command. There must not be a new line character inside a message. Multi-line text can be sent as a sequence of two or more MESSAGE commands.
<p>
It is recommended to send only English messages. If the brain chooses another language, it should detect code page that is used on the PC (Win32 function GetACP()) and should not send messages that cannot be displayed in that code page.
</p>
<h3>DEBUG [message]</h3>
It is similar to MESSAGE command, but it is used for debugging information that is useful only for the author of the brain. These messages will not be visible to public in Gomocup tournament.
<pre>
Example:
The manager sends:
TURN 10,15
The brain answers:
DEBUG The most promising move now is [10,14] alfa=10025 beta=8641
DEBUG The most promising move now is [11,14] alfa=10125 beta=8641
MESSAGE I will be the winner
10,16
</pre>
<h3>SUGGEST [X],[Y]</h3>
The brain can answer SUGGEST [X],[Y] instead of [X],[Y] and does not change it's internal state. The manager has a possibility to ignore brain's suggestion and force another move to the brain. Expected manager's answer is PLAY or END. In Gomocup tournament the manager always answers the move the brain has sent in the SUGGEST command. Most brains do not use SUGGEST command.
<h2>Version history</h2>
<ul>
<li>2023-03-20<br>INFO rule 8 - Caro</li>
<li>2022-12-20<br>SWAP2BOARD command</li>
<li>2016-02-07<br>coordinates in BOARD command must be in correct order if rule is renju<br>both 64-bit and 32-bit exe can be in ZIP
<li>2016-02-02<br>INFO rule 4 - renju
<li>2006-03-11<br>INFO rule 2 - option continuous game, also added value 3 to the BOARD command
<li>2005-12-19<br>INFO folder - for permanent files<br>ABOUT - format changed to keyword="value"<br>INFO timeout_turn 0 means as fast as possible
<li>2005-06-26<br>TAKEBACK - used for undo
<li>2005-06-03<br>INFO rule - option exactly five in a row
<li>2005-05-19<br>INFO game_type
<li>2005-04-21<br>RECTSTART - rectangular board<br>RESTART - clears the board<br>BOARD - it is mandatory command
</ul>
</body>
</html>