COMP3702 Tutorial 3 Notes

Mutable vs Immutable Datatypes

Very simply:
- The value of mutable datatypes or objects can be changed after it is created, e.g. lists, dictionaries, sets and classes.
- The value of immutable datatypes or objects cannot be changed after it is created, e.g. strings

Exercise 3.1a - State Graph Representation

We begin by developing a state graph representation for our search problem - like a node in a search tree.

class GridWorld():
    ACTIONS = ['U', 'D', 'L', 'R']

    def __init__(self):
        self.n_rows = 9
        self.n_cols = 9

        # (row, column) where indexing is top to bottom, left to right (like a matrix)
        start = (8, 0) #bottom, left
        goal  = (0, 8)

        self.obstacles = [[0, 0, 0, 0, 0, 0, 0, 0, 0],
                          [0, 0, 0, 0, 0, 0, 0, 0, 0],
                          [0, 0, 1, 1, 1, 1, 1, 0, 0],
                          [0, 0, 0, 0, 0, 0, 1, 0, 0],
                          [0, 0, 0, 0, 0, 0, 1, 0, 0],
                          [0, 0, 0, 0, 0, 0, 1, 0, 0],
                          [0, 0, 0, 0, 0, 0, 1, 0, 0],
                          [0, 0, 0, 1, 1, 1, 1, 0, 0],
                          [0, 0, 0, 0, 0, 0, 0, 0, 0]]

        self.costs = [[1, 1,  1,  5,  5,  5,  5, 1, 1],
                      [1, 1,  1,  5,  5,  5,  5, 1, 1],
                      [1, 1, 10, 10, 10, 10, 10, 1, 1],
                      [1, 1,  1, 10, 10, 10, 10, 1, 1],
                      [1, 1,  1,  1,  1, 10, 10, 1, 1],
                      [1, 1,  1,  1,  1, 10, 10, 1, 1],
                      [1, 1,  1,  1, 10, 10, 10, 1, 1],
                      [1, 1,  1, 10, 10, 10, 10, 1, 1],
                      [1, 1,  1,  1,  1,  1,  1, 1, 1]]

Firstly, observe that we create a list of all valid action strings.
We then work on the constructor for the state.
- To make our life easier later, we save member variables that describe the size of the board.
- We next define the search area using a list-of-lists so that we can perform double indexing into it.
We then develop the transition function (in this case, called the step function).
- We first check if the actions are valid, and generate the new position.
- Given the new position, we want to check whether the new position is "in" an obstacle.
- If a collision occurs, we return:
  - Success = False
  - new state = original state
  - cost = cost of action in the original state.
- Otherwise, we return:
  - Success = True
  - new state = new state that we have moved to
  - cost = cost of action in new state.

def step(self, state, action):
    """
    :param state: (row, col) tuple
    :param action: 'U', 'D', 'L' or 'R'
    :return: (success [True/False], new state, action cost)
    """

    r, c = state

    if action == 'U':
        new_r = r - 1
        new_c = c
    elif action == 'D':
        new_r = r + 1
        new_c = c
    elif action == 'L':
        new_r = r
        new_c = c - 1
    elif action == 'R':
        new_r = r
        new_c = c + 1
    else:
        assert False, '!!! invalid action !!!'

    if (not (0 <= new_r < 9)) or (not (0 <= new_c < 9)) or self.obstacles[new_r][new_c] == 1:
        # collision occurs
        return False, (r, c), self.costs[r][c]
    else:
        return True, (new_r, new_c), self.costs[new_r][new_c]

3.1b - Breadth First Search

Before we implement Breadth-First Search, we first want to create a node like we did last week, that we can use to construct our search tree (and perform search operations on).

class StateNode:
    def __init__(self, env, state, actions, path_cost):
        self.env = env
        self.state = state
        self.actions = actions
        self.path_cost = path_cost

    def get_successors(self):
        successors = []
        for a in GridWorldEnv.ACTIONS:
            success, new_state, a_cost = self.env.step(self.state, a)
            if success:
                successors.append(StateNode(self.env,
                                            new_state,
                                            self.actions + [a],
                                            self.path_cost + self.env.get_state_cost(new_state)))
        return successors

Breadth-First Search Implementation

def bfs(env, verbose=True):
    container = [StateNode(env, env.init_state, [], 0)]
    visited = set()
    n_expanded = 0
    while len(container) > 0:
        # expand node
        node = container.pop(0)
        # test for goal
        if env.is_goal(node.state):
            if verbose:
                print(f'Visited Nodes: {len(visited)},\t\tExpanded Nodes: {n_expanded},\t\t'
                      f'Nodes in Container: {len(container)}')
                print(f'Cost of Path (with Costly Moves): {node.path_cost}')
            return node.actions
        # add successors
        successors = node.get_successors()
        for s in successors:
            if s.state not in visited:
                container.append(s)
                visited.add(s.state)
        n_expanded += 1
    return None

3.1c - IDDFS

Iterative-Deepening Depth First Search is an interesting technique, where we call depth-first search in a Breadth-First Search like technique. This combines the benefits of BFS and DFS.

DFS traverses through nodes going through successors of nodes.

Consumes O(d) space in O(b^m) time
- d is depth
- b is branching factor (average # of children each node has)
- m is maximum depth of search tree BFS traverses through nodes level by level
Consumes O(b^d) space with O(v+e) time

To implement IDDFS, we first need to implement depth-limited depth-first search (which stops at a certain depth).

def depth_limited_dfs(env, max_depth, verbose=True):
    container = [StateNode(env, env.init_state, [], 0)]
    # revisiting should be allowed if cost (depth) is lower than previous visit (needed for optimality)
    visited = {}    # dict mapping states to path cost (here equal to depth)    n_expanded = 0
    while len(container) > 0:
        # expand node
        node = container.pop(-1)
         if env.is_goal(node.state):        # test for goal
            if verbose:
                print(f'Visited Nodes: {len(visited.keys())},\t\tExpanded Nodes: {n_expanded},\t\t'
                      f'Nodes in Container: {len(container)}')
                print(f'Cost of Path (with Costly Moves): {node.path_cost}')
            return node.actions

        successors = node.get_successors()         # add successors
        for s in successors:
            if (s.state not in visited or
                    len(s.actions) < visited[s.state]) and len(s.actions) < max_depth:
                container.append(s)
                visited[s.state] = len(s.actions)
        n_expanded += 1
    return None

3.1d - Performance of BFS and IDDFS

In this case, both BFS and IDDFS have found solutions to this problem.

However, notice that IDDFS has significantly better performance than BFS in terms of the number of nodes expanded.
In this example however, the amount of time taken is still faster for BFS.

3.2 - Costly Moves

As a base, our previous implementation of BFS finds a path with a cost of 32.