16-761 Assignment 4: Exploration

Goal: In this assignment, you will implement data structures and algorithms for exploration using occupancy grid maps.

Academic Integrity

Do not publicly share your solution (using GitHub or otherwise)
Collaboration is encouraged but you should write the final code on your own.

0. Setup

This repository uses Git LFS. Perform the following in a terminal on your computer.

git clone [email protected]:mr-cmu/assignment4-handout.git
cd assignment4-handout
git lfs install
git lfs pull

Now, create a python virtual environment.

python3.8 -m venv .venv

Source the environment

source .venv/bin/activate

You will need to install the following dependencies.

pip install cprint numpy matplotlib opencv-python scipy scikit-learn

We assume a point-shape robot with a size equal to one cell in the occupancy grid map. The state of this robot is given by PointRobotState in robot.py. As in the mapping assignment, we assume that the robot is equipped with a 360 degree field-of-view range sensor (e.g. 3D LiDAR). The robot can explore various environments provided in the test_data/ folder.

To set up this functionality correctly, we will rely on the solution to the mapping assignment (Assignment 2). Please complete the following task as part of the setup.

Important

Task 0.1 (0 points): Please copy your solutions for all functions from mapper_py/data_structures/grid.py in Assignment 2 into the file mapper_py/data_structures/grid.py. Similarly, copy solutions for mapper_py/data_structures/sensor.py and mapper_py/mapper.py (except the function update_logodds in the Mapper class) in the respective files within this assignment. If you created some helper functions for your Assignment 2 solutions, make sure those are copied too.

Warning

Please do not directly copy and replace the files for the task above. Some helper functions have changed in the mapper_py folder provided within this assignment compared to the previous assignment. You should carefully replace the parts that have been labelled as TODO.

Note

If you could not get traverse function to work in Assignment 2, you are free to use any function from standard libraries. One idea can be to use skimage.draw.line from the scikit-image library. However, make sure you pass the test_traversal test from Assignment 2 after fixing your traverse implementation (either on your own or through standard library functions).

To measure the exploration progress, it is beneficial to track how the entropy of the occupancy grid map changes over time.

Important

Task 0.2 (5 points) Implement the functions cell_entropy and map_entropy in Grid3D class from mapper_py/data_structures/grid.py

Note

To check your implementation for Task 0.2, you can use the script entropy_test.py. If the implementation is correct, you will see the output:

[Task 0.2]: Full Credit.

1. A Simple Motion Primitive Library

Note

No implementation required in this section. However, this information is important to read and it is crucial to understand when working on implementing the motion planners.

Let us equip our point robot with a simple library of motion primitives that enables the robot to move in an 26-connected grid. For a 2D illustration, consider this scenario:

Here, the robot is in the position shown in red in the occupancy grid. The motion primitives, shown by red arrow, contain the information about the starting position and the direction in which the robot can move. In this assignment, we constrain the robot to be able to move at most one cell along these directions.

This motion primitive library has been implemented for through the classes SimplePrimitive and PrimitiveLibrary in exploration.py. Please read through the docstrings of these classes to understand the provided functionality.

Warning

You are not allowed to modify the SimplePrimitive and PrimitiveLibrary classes. The motion primitive design is assumed to be given.

2. Take Random Actions and Avoid Collisions

Using the motion primitive library, let us create a simple motion planning setup for exploration. Take a look at the class ExplorationPlanner in exploration.py and read through the docstrings.

Warning

You are not allowed to modify the take_action method inside the ExplorationPlanner class or any of its derived classes. The robot is assumed to always move one cell at a time along the chosen primitive.

The first task is to ensure safety by avoiding actions that may result in collision with the environment.

Important

Task 2.1 (5 points) Implement is_feasible function in ExplorationPlanner.

Note

To check your implementation for Task 2.1, you can use collision_test.py. If the implementation is correct, you will see the output:

[Task 2.1]: Full Credit.

Now that we have the capability to avoid collisions, let us enable the robot to take random actions while avoiding collisions. Intuitively, this is the most "naive" way in which the robot can explore its surroundings.

Important

Task 2.2 (10 points) Implement selection_policy function in ExplorationPlanner.

Note

It is difficult to quantitatively test this function since the selection policy is random. We will describe how this problem is graded later in the Frontier-based Exploration section.

For a qualitative evaluation, you can run

python explore_test.py --planner-type random

By default, the maximum timesteps given to the robot for exploration is 200.

You should see a confused robot trying to (safely) figure out what exists in the world...

You will also see a plot for entropy of the explored map over time.

Clearly this is not a good exploration strategy. However, we now have the infrastructure to test out exploration planning algorithms. Let us start with implementing the frontier-based exploration method.

3. Frontier-based Exploration

Take a look at the class FrontierPlanner, which is derived from the ExplorationPlanner class. We will override the method selection_policy to implement a new one. You can use any frontier-based exploration method (ones studied in class or your own).

Important

Task 3.1 (30 points) Implement a frontier-based planner for exploration in the selection_policy within the FrontierPlanner class. Declare helper functions within the class as you need. The return type needs to be the same as in Task 2.2. You may need to override the update_map function to incorporate updating frontiers.

For a qualitative evaluation, you can run

python explore_test.py --planner-type frontier

You should see the robot exploring the environment much better than the random planner

Important

You will receive full credit for Tasks 3.1 and 2.2 if the following conditions are satisfied for all environments in test_data/:

The robot never collides with the occupied space
The robot never enters unknown space at any point during exploration

and the following condition is satisfied for at least three environments:

The robot explores faster in frontier-based case compared to the random case. In other words, the average entropy in the frontier-based case should be lower than the random case.

Warning

There is no possibility for partial credit in Tasks 3.1 and 2.2. The frontier-based planner must outperform random planner in at least three environments.

You can run exploration_comparison.py to check if your solution passes these requirements for all the environments in test_data/

python exploration_comparison.py

You are expected to receive full credit if you see the output

[Tasks 3.1, 2.2, and 2.1]: Full Credit.

You will also see plots for the entropy reduction over timesteps.

The available environments in test_data/ are simple_box and i_love_mr (default is simple_box).

For visualization, you run frontier-based exploration with explore_test script in this manner for any environment (-env option):

python explore_test.py --planner-type frontier --env i_love_mr

4. Information-Theoretic Exploration

In frontier-based exploration, intuitively, the robot is "pushing" the "boundary" (frontier) of the unknown and free space. However, it does not reason about what it might gain beyond the boundary. To improve the performance further, we will now incorporate "information gain" as a "utility" of the frontier.

Take a look at the class MIPlanner. You will notice that the sensor model is required to compute information that a potential observation at the frontier location may provide about the map. In the simplest implementation, if we assume a perfect sensor, this value corresponds to the total entropy of the cells overlapping the observation.

Important

Task 4.1 (20 points) Complete the compute_mi function inside the MIPlanner. This function must compute the expected decrease in entropy if the sensor perfectly measures the occupancy values of the cells. Some notes/hints:

Occlusion handling: If the input position is not in the grid or it is occupied, the return value is zero and the reward must not be computed further along this ray.
Due to rule 1 and the assumption of a perfect sensor, the MI just becomes cumulative sum of the entropy contained in the cells traversed by the sensor observation at the input position.

You can use the script mi_reward_map_test.py to test your solution for Task 4.1. A correct implementation should yield:

[Task 4.1]: Full Credit.

Using this reward function, we now write the planner

Important

Task 4.2 (30 points) Implement selection_policy function inside MIPlanner. Leverage compute_mi completed in Task 4.1

Important

You will receive full credit for Tasks 4.2 if the following conditions are satisfied for all environments in test_data/:

The robot never collides with the occupied space
The robot never enters unknown space at any point during exploration

the following condition is satisfied for at least three environments:

The robot explores faster in this case compared to the random case. In other words, the average entropy in the information-theoretic case should be lower than the random case.

and the following condition is satisfied for at least two environments:

The robot explores faster in this case compared to the frontier-based case. In other words, the average entropy in the information-theoretic case should be lower than the frontier-based case.

Your results for the three planners should look like the video below, with Random first then Frontier then MI.

hw4-example-3d.mp4

Just like the frontier-based case, you can run

python exploration_comparison.py

to see your score or explore_test.py for debugging.

Grading with AutoLab

Assuming you are in this assignment directory, run this command after completing your solutions:

tar -C . -cvf handin.tar explore_py

Submit handin.tar on Autolab.

Autolab will run tests on each function you implement and you will receive a score out of 100. You may upload as many times as you like. Note that we may regrade submissions after the deadline passes.

References

Author(s)

Rebecca Martin, Andrew Jong, Kshitij Goel, Wennie Tabib

alexanderswerdlow / assignment4-handout Goto Github PK

assignment4-handout's Introduction

16-761 Assignment 4: Exploration

Academic Integrity

0. Setup

1. A Simple Motion Primitive Library

2. Take Random Actions and Avoid Collisions

3. Frontier-based Exploration

4. Information-Theoretic Exploration

Grading with AutoLab

References

Author(s)

assignment4-handout's People

Contributors

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent