The World of Vampus - a cave consisting of halls connected by passages

Lecture



The world of the vampus is a cave consisting of halls connected by passages. Somewhere in this cave a vampus is roaming - a terrible beast that eats everyone who enters the hall where it is located. A vampus can be killed by an agent, but the agent has only one arrow. In some halls there are bottomless pits into which all who pass through these halls fall (all except the vampus, which is too large to fail in them). The only thing that comforts those living in this environment is the opportunity to find a bunch of gold. Although the world of the vampus is quite modest in terms of modern computer gaming standards, it represents an excellent testing environment for intelligent agents. The first to suggest using the vampus world for such a purpose was Michael Geneseret.

  • Performance Indicators. For the fact that the agent finds gold, he gets +1000 points, for the fact that the agent hits the pit or eats him vampus, he is assigned -1000 points, he gives -1 point for each action performed and -10 points - for using an arrow .
  • Wednesday. The halls are arranged in a 4x4 grid. The agent always starts from the square marked [1,1] and looks to the right. The locations of gold and vampus are chosen randomly with a uniform distribution from among the squares other than the initial square. In addition, in each square, different from the initial, with a probability of 0.2 there may be a hole.

  The World of Vampus - a cave consisting of halls connected by passages
An example of the world of vampus. The agent is in the lower left corner.

  • Executive mechanisms. The agent can move forward, turn left and right by 9 0 °. An agent dies a pitiful death if he enters a square where there is a pit or a living vampus. (It’s safe to enter a square with a dead vampus, but you also have to smell a bad smell.) The attempt to move forward remains unsuccessful if there is a wall in front of the agent. To grab an object that is in the same square where the agent is located, you can use the Grab action. Using the Shoot action, you can shoot an arrow in a straight line in the direction the agent is looking. The arrow continues to move until it either hits the vampus (and kills it), or hits the wall. The agent has only one arrow, so only the first action of Shoot has any effect.
  • Sensors. The agent has five sensors, each of which reports only one information item:
  • in the square where the vampus is located, and also in the squares directly (and not diagonally) adjacent to this square, the agent feels an unpleasant smell (perception of Stench);
  • in the squares immediately adjacent to the pit, the agent will feel the breeze (perception of Breeze);
  • in the square where the gold is, the agent sees the brilliance (perception of the Glit ter);
  • when the agent hits the wall, he feels a blow (perception bump).
  • before a vampus struck by an arrow dies, it emits a mournful cry that spreads throughout the cave (perception of the Scream).

The results of acts of perception are transmitted to the agent in the form of a list of five characters; for example, if there is an unpleasant smell and breeze, but there is no shine, blow or cry, the agent perceives the results of the act of perception [Stench, Breeze, None, None, None].

It is proposed to determine the vampus environment in accordance with various measurements. The main difficulty for this agent is that from the very beginning he does not know the configuration of his environment; apparently, to overcome this ignorance, it is impossible to do without logical reasoning. In most instances of the vampus world for an agent, it is possible to safely receive gold. But sometimes an agent has to choose: return home empty-handed or risk his life to find gold. About 21% of the variations of the environment are completely unfavorable, since gold is in a pit or surrounded by pits.

We trace the agent for the world of vampus, acting on the basis of knowledge, which studies the environment shown in the figure. The initial knowledge base of the agent contains the rules of existence in this environment, which were described above; in particular, the agent knows what is in the square [1,1] and that the square [1,1] is safe. We will see how the knowledge of the agent expands as the results of new acts of perception arrive and actions are performed.

The first perception is [None, None, None, None, None], on the basis of which the agent can conclude that the adjacent squares are safe. The figure, a, shows the state of knowledge of the agent at that moment. We will enumerate (some) statements from the knowledge base using letter symbols such as B (a breeze is felt) and OK (safely, neither pits nor vampus) in the corresponding squares. On the other hand, the figure depicts this world itself.

  The World of Vampus - a cave consisting of halls connected by passages
The first step performed by an agent in the vampus world: the initial situation that arose after perception / "None, None, None, None, None 7 (a); the situation after one move, in which perception was obtained /" None, Breeze, None, None, None] F)

Based on the fact that there was no unpleasant odor or breeze in the square [1,1], the agent can conclude that the squares [1,2] and [2,1] are free from danger. To indicate this, they are labeled OK. A cautious agent goes only into such a square, which is known to have an OK mark in it. Suppose that the agent decided to move forward, to the square [2,1], and the scene shown in the figure was created.

The agent detects a breeze in the square [2,1], so there must be a hole in one of the neighboring squares. According to the rules of the game, the pit cannot be in the square [1,1], therefore it must be in the square [2,2], or [3,1], or in both. Designation P? in the figure, b indicates the possibility of the presence of pits in these squares. At the moment, only one square is known with an OK mark, which has not yet been visited. Therefore, a prudent agent turns around and returns to the square [1,1], and then goes to the square [1,2].

The new perception in the square [1,2] is [Stench, None, None, None, None], which leads to the state of knowledge shown in the figure. The presence of an unpleasant odor in the square [1,2] means that there is a vampus somewhere nearby. But a vampus cannot be in the square [1,1] according to the rules of the game and cannot be in the square [2,2] (since the agent would detect an unpleasant smell being in the square [2,1]). Therefore, the agent can conclude that the vampus is in the square [1,3]. This is indicated by the designation W !. In addition, the lack of perception of Breeze in the square [1,2] implies that there is no hole in the square [2.2]. Nevertheless, we have already concluded that the pit should be in the square [2.2] or [3, 1], which means that it is indeed in the square [3,1]. This is a very complex logical conclusion, since it combines knowledge obtained at different times in different places, and it makes the decision to perform an important step based on the lack of a certain perception. Such a logical conclusion is superior to the abilities of most animals, but is typical of this kind of reasoning performed by logical agents.

  The World of Vampus - a cave consisting of halls connected by passages
The last two stages during the agent's activity: after the third move, when the perception was received / "Stench, None, None, None, None] (a); after the fifth move, when the perception was received /" Stench, Breeze, Glitter, None, None] F)

Now the agent has proved to himself that in the square [2,2] there is neither a pit nor a vampus, therefore, he can designate this square with the OK mark in order to go into it. We do not show the state of knowledge of the agent in the square [2, 2], but simply assume that the agent turned and went into the square [2, 3], as a result of which the state shown in the figure was obtained. In the square [2,3] the agent detects brilliance, so he must grab the gold and thereby end the game.

In each case, when the agent draws a conclusion from the information available to it, the correctness of this conclusion is guaranteed, if the information available is correct. This is the fundamental property of logical reasoning. The remainder of this chapter will show you how to create logical agents that can provide the necessary information and draw the conclusions described in the previous paragraphs.

created: 2014-09-23
updated: 2021-03-13
132765



Rating 9 of 10. count vote: 2
Are you satisfied?:



Comments


To leave a comment
If you have any suggestion, idea, thanks or comment, feel free to write. We really value feedback and are glad to hear your opinion.
To reply

Miscellaneous

Terms: Miscellaneous