Computer Vision Project

Abstract

So far, you have successfully programmed robot (simulators) using reactive control strategies and taken a deep dive into the robot localization problem. Next, you will be learning about computer vision and the role it plays in robotics. In contrast to previous project, this time you will have a great deal of freedom to shape your project towards your interests and learning goals.

Learning Objectives

Self-directed learning of a new robotics algorithm.
Implementing a robotics system with minimal scaffolding.

Teaming

For this project, you should work with one other student. Since we have an odd number of students in the class, one team of three will be allowed.

Please fill out this Google Sheet when you have a project team and Github repo.

Project Topic

Your project should be about computer vision and its intersection with robotics. In our simulated robot reality of the pandemic, this means that you should choose a computer vision topic that has some plausible application in robotics. You will carry out that project either using a robot simulator, a computer vision dataset, or some combination of both.

In your project proposal you will be coming up with an implementation plan. That is, if you are using a particular algorithm to solve a problem, which parts of the algorithm will you implement, and which will you use pre-built implementations for? Be strategic in these decisions to balance learning about algorithms with system building (e.g., programming the simualted Neatos (or another robot) to do something interesting). You have substantial lattitude in shaping your project to focus on the parts you really want to learn (e.g., system design versus basic understanding of algorithms). That said, we expect that you will do some exploration of algorithms as part of this project (which could could include implementation or perhaps subsantial learning about an algorithm or class of algorithms).

Potential Algorithm Topics

Object tracking
Image segmentation
Object detection
Text recognition
Fiducial tracking
Visual odometry
Structure from motion

(Note: there are some resources for these topics later in the document)

Robot Platform

One option is to continue to work with the Neato simulator in Gazebo, and use the simulated world for your computer vision project. You will probably want to build a more complex world (see instructions for doing that in the how-tos for the Neato Simulator . While we’ve mostly been using simple shapes in previous projects, there are a lot of built-in models that you can use to build a world.
We’ve been working with the Neato simulator, but there are many other simulators out there, which you’re welcome to use in this project. You can find some of them here: Other Robot Simulators.
You may want to use an external dataset for your project. Here are some possible starting points.
- Visual Data
  has a nice collection of computer vision datasets and projects.
- If you want a huge (but very cool) dataset for self-driving vehicles, consider usign Waymo’s Open Dataset
- Here's another list of datasets specifically for robotics (not all are about computer vision)
- If you’re interested in machine learning for robot control, you might consider building off some of the datasets (and code) from the 2018 Robot Learning project You also might want to check out the Google Slides presentation summarizing their results.
- A lof these datasets are big and fancy, but don’t be afraid to start with smaller, more classic datasets. That’s a great way to learn without having to deal with the greater complexity that comes with some of this data.

Deliverables

There are four deliverables for this project.

Project Proposal (due 10/26)

At a minimum, please include the answers to the following questions. You should include enough detail for us to be able to give you useful feedback.

Who is on your team?
What is the main idea of your project?
What are your learning goals for this project?
What algorithms or computer vision areas will you be exploring?
What components of the algorithm will you implement yourself, which will you use built-in code for? Why?
What is your MVP?
What is a stretch goal?
What do you view as the biggest risks to you being successful (where success means achieving your learning goals) on this project?
What might you need from the teaching team for you to be successful on this project?

In-class Presentation / Demo (11/9)

We’d like each team to spend about 10 minutes presenting what they did for this project. You can structure the presentation in whatever manner you’d like, however, you should try to meet these goals:

Explain the goal of your project
At a high-level explain how your system works
Demonstrate your system in action (either in a video [recommended] or live). If your system doesn’t work completely yet, that is fine, try to show at least one component of your system in action.
This presentation / demo should be very informal. This presentation will be assessed in a purely binary fashion (basically did you do the things above).

Code (Due 11-9)

You should turn in your code and writeup via Github. Please fork your repo from this one.

Writeup (Due 11-9)

In your ROS package create a README.md file to hold your project writeup. Your writeup should touch on the following topics. We expect this writeup to be done in such a way that you are proud to include it as part of your professional portfolio. As such, please make sure to write the report so that it is understandable to an external audience. Make sure to add pictures to your report, links to Youtube videos, embedded animated Gifs (these can be recorded with the tool peek).

What was the goal of your project? Since everyone is doing a different project, you will have to spend some time setting this context.
How did you solve the problem (i.e., what methods / algorithms did you use and how do they work)? As above, since not everyone will be familiar with the algorithms you have chosen, you will need to spend some time explaining what you did and how everything works.
Describe a design decision you had to make when working on your project and what you ultimately did (and why)? These design decisions could be particular choices for how you implemented some part of an algorithm or perhaps a decision regarding which of two external packages to use in your project.
What if any challenges did you face along the way?
What would you do to improve your project if you had more time?
Did you learn any interesting lessons for future robotic programming projects? These could relate to working on robotics projects in teams, working on more open-ended (and longer term) problems, or any other relevant topic.

Resources and Potential Project Directions

Resources

In-Class Activities from Past CompRobo Offerings
- Object Tracking
- 3D Structure from Motion
- Image Filtering
- Object Recognition
  (the instructions on running the code are out of date, but the ideas might help).
Convert ROS image messages to OpenCV using CvBridge
Connecting a webcam to ROS and OpenCV
An overview of methods for object tracking
Slides from the CompRobo Learning Project 2018 Class Report Out
Mastering OpenCV with Practical Computer Vision Projects
Visual odometry resources (one example)
Canny edge detection
Contours
Template matching
Hough line transform
Basics of histograms
and histogram equalization
Basic Numpy tutorials
GUI Features in OpenCV
Basic Operations on Images
Arithmetic Operations on Images
Corner Detection
Numpy Examples List

Previous Student Projects to Draw From

Visual Navigation for Flying Robots
is a course on said topic. The linked page includes lectures and even some bag files.
Robot Learning Report out 2018
Computer Vision Project Writeups from 2017
- Self Driving Neato
- Computer Vision Emotion Detection
- Visual Localization
- Predicting Paths of Tracked Objects
- Neato Keeper
- Lane Follower
- Meal Recognition
- Pac Neato
- Neato Augmented Reality Parking