Redirected from page "SolutionsInPerceptionChallenge"

Clear message

Solutions In Perception Challenge

The new solutions in perception site is at http://solutionsinperception.org/

Click here to go to the new site


Below is the page for the first contest in May of 2011

Update 23 May, 2011: Pictures!

Update 19 May, 2011: The ground truths for the NIST and Willow Garage data sets have been released. The ground truths have been packaged along with the Python script used to score the output CSV files from your TOD_Stub implementations, and can be found here. A guide for reading and understanding the report files generated by the scoring script is also available.

Update 16 May, 2011: The 2011 Solutions in Perception Challenge is over. Finalized reports have been released to all of the participating teams. A simplified version of the performance report can be found here (note that the team names and affiliations have been redacted).

Short form results:

Update 6 April, 2011: The code for tod_stub has been updated and development has been frozen. See the Starting Code section for more information.

Update 29 March, 2011: New data sets from Willow Garage and NIST are available in the Data section below.

Contact: add gmail.com to end of: solutionsinperception


This challenge aims, step by step, to remove the main bottleneck to agile robotics:
                            Reliable perception;
It seeks to develop and document solutions to pragmatic robotic sensing problems.

Problem Statement:

Most existing vision databases:

And the main existing vision challenges:

Are either

The existing challenges push on the "false positive" part of the recognition curve:

Existing challenges do not take advantage of robotic sensing where may have:

Proposed Answer

For robotic [and perhaps some cell phone] needs, the Solutions in Perception Challenge seeks to incrementally and progressively establish solutions to pragmatic sensing problems. This challenge addresses the "True Positive" part of the recognition curve:

Use of active depth sensing, 2D and 3D data is explicitly allowed, but not instrumentation of the objects.

This challenge is aimed at removing the perception barriers to advanced, sensing based, agile robotics.

The Competition

To Join:

Current Competition: ICRA 2011

The first competition will be held at ICRA 2011 in the Robot Challenge section. The goal of this first contest is to show that moderately: Lambertian, rigid objects that have a lot of texture on them can be reliably recognized and their pose in 3D determined using 2D and/or 3D sensors at close range (roughly personal robot arm work space, 0.5 to 1.25 meters). If accomplished, this challenge will demonstrate that robots can reliably find and manipulate such objects under indoor florescent lighting conditions.

This challenge will make use of the kinect sensor which gives 2D RGB and 3D point clouds at 1M pixel resolutions:

With the goal of recognizing the identity and pose of 50 objects:

Conditions of the first contest

The first contest will be held at ICRA 2011 in the Robot Challenge section.

  1. The goal of this contest will be to recognize the identity and pose of 50 fairly rigid, fairly Lambertian, textured objects. One example would be something like a paper tea box. 35 of the objects will be known beforehand, 15 will be new for the contest.

  2. The sensor will be a kinect sensor using drivers provided by prime sense and Willow Garage, see the ROS kinect page here.

  3. The data (see the "Data" subsection below) will consist of extensive training and test sequences with ground truth of the 35 known objects. The data will consist of 2D images, 3D point clouds and labeled object identity and pose. Fifteen additional objects will introduced at the contest (making for a total of 35+15=50 objects). The 15 new objects will be produced by NIST but will be Lambertian, rigid and textured like the rest of the objects.

    1. The lighting will be indoor, diffuse florescent lighting, conditions where the kinect works well.
  4. The code will retain the user's choice of license subject to the "Contestant's Code" subsection below.

    1. Contestants will write their code to run with ROS. A code stub is provided so that they contestant does not need to learn ROS. The stub will just get a point cloud and image, display them and give an example of filling out the object identity, certainty and pose (the filled out pose is randomly chosen in the stub). Using the provided code stubs, your code will run will automatically run on the PR2 robot as well.

    2. The code will implement training, test and object persistence functions.
  5. Scoring, see the "Contest Format" subsection below. There will be 2 elimination rounds followed by a real round.

    1. The starting rounds will have single objects at a time, then multiple objects where we look for object ID only.
    2. This will be followed by single objects where we stress the accuracy of precise pose estimation.
    3. Finally, there will be cluttered, fixtured scenes where the algorithm must recognize multiple objects and their pose.
    4. The top performing code will be demonstrated on a PR2 Robot grasping the items.

    5. All the scores will be cumulative when deciding the prize money.
    6. Conference organizers reserve the right to make judgement calls at any point and on any matter in the contest. These judgements will be final.
  6. Prizes will be awarded exponentially according to performance (rounding down), see the "Prizes" subsection below.

Code

Contestant's Code

This contest aims to establish and to advance what perception problems have workable solutions for robots. To do such advances, researchers must be able to train and test against previous code, ensuring fair and easily repeatable comparison and experimentation with past work. We also hope, however, to foster the growing use of robotics in society. To achieve this, contestants will:

  1. Use any license that they desire for their code. It may be closed, open, free or paid subject to:
  2. You will license the contest organizers, which include Willow Garage and NIST rights to the source code to build and run and maintain your code (which may include making minor alterations to your code in order to resolve bugs or issues with ROS) on servers of the contest organizers which specifically right now include Willow Garage and/or NIST's choice for the purposes of the contest.
  3. You will further license others to run your code on the contests servers which include right now Willow Garage and/or NIST servers. Researchers must be able to train and/or test old and new data sets against your code, but only on the servers specified by the contest organizers. This does not give other's the right to run your code on their system or products. It is just to have a means to compare newer algorithms directly against older algorithms. Again, the licensing of further uses of the contestant's code is up to the authors of that code.
  4. You will allow the contest organizers and external researchers to publish and track any of the results of such algorithm testing.

We are thinking about organizing a optional "perception apps" licensing store. As in existing apps stores, the apps may be posted for free or for a charge. The hope is that license terms will be such as to allow for mass use. This apps store is just an idea at this point and so it may or may not happen. Random thoughts about this are that it would look something like the mash-up visual imagery programming site Processing but would help put together solutions that could be toyed with while tracking the reasonable licensing fees for commercial use.

Starting Code

The contest will be run on ROS the "Robot Operating System". This is to enable maintainability, easy testing and allow running on a robot (PR2) without having to know the details of that robot: the recognition and pose code will automatically send messages that work with the PR2 grasping and manipulation pipeline.

These same instructions can be found in your local tod_stub repository in tod_stub/tod_stub.doc.txt.

Data

The data will consist of many ROS bag files which can collect all the data and messages sent out in a real time session.

35 Objects

Number

Object_

Size US

Size Metric

1

Odwalla Orange Juice

15.2 fl. oz

450 mL

2

Odwalla Summertime Lime

15.2 fl. oz

450 mL

3

Spam Original

12oz

340g

4

Learning OpenCV

--

--

5

Mop & Glow

1qt

946mL

6

Silk Original soy milk

1qt

946mL

7

Claritin-D 24 hour

10 tablets

10 tablets

8

Hershey's Coca Special Dark

8oz

226g

9

Tazo Organic Chai tea

1.9oz (20 bags)

54g (20 bags)

10

Good Earth Original

1.43oz (18 bags)

41g (18bags)

11

Snedd's Spread Country Crock (margarine)

15oz

425g

12

Kellogg's Raisin Brand

25.5 oz

723g

13

Tilex Mold & Mildew Remover

1qt

946mL

14

Arm & Hammer Detergent

50 fl oz

1.47L

15

Tide detergent

50 fl oz

1.47L

16

Downy detergent

34 fl oz

1.02L

17

All detergent

50 fl oz

1.47L

18

Nestle Coffee Mate French Vanilla

16 fl oz

473mL

19

Gillette complete skin care shaving cream

7oz

198g

20

All 3x ultra detergent

20 fl oz

0.59L

21

General Mills Oatmeal Crisp

17oz

481g

22

Del Monte Peas & Carrots

14.5 oz

411g

23

Campbell's soup at hand creamy tomato

10.75oz

305g

24

Clorox regular bleach

60 fl oz

1.77L

25

Coke

12 fl oz

355mL

26

Ziploc plastic bags 6 1/2 x 5 7/8in (16.5 x 14.9cm)

50 bags

50bags

27

Campbell's condensed tomato soup

10.75oz

305g

28

Progresso Traditional New England Clam Chowder

18.5 fl oz

524g

29

Campell's just heat & engjoy tomato soup

15.4oz

435g

30

Crest tooth paste tarter control plus scope

4.6oz

130g

31

Colgate toothpaste Icy Blast

4.6oz

130g

32

Bumble BEe Chunk White albacore tuna in water

5oz

142g

33

Band-Aid plastic strips Johnson&Johnson

3/4 x 3 in (60 band-aides)

1.9 x 7.6cm (60 band-aides)

34

Jell-o Strawberry

6oz

170g

35

Tropicana 100% pure Orange Juice with calcium (6 pack)

8 fl oz

240mL

Keys

Contest Format

Hardware

Contestant's code will ultimately need to run on a PR2 robot. You can read the PR2 Spec Sheet here.

The PR2 has 2 quad core i7 Xeon Processors with 8 cores each for a total of 16 cores. The summary specs are:

These are the detailed PR2 CPU specs.

Some robots have GPUs, but do not assume that we will have them. You can multi-thread your code as you see fit.

Prizes

Schedule

Schedule

Date_

Event

Feb 6

Stub code and preliminary data are out

Feb 18

Official training data is out

April 15

Upload you code onto the server. Your code should compile and output the required data structures.

May 1

All code is uploaded

May 9

Conference starts

May 16

Final Results are here

ENTERING THE CONTEST

Send an email with subject "Join" to: add ''gmail.com'' to end of: solutionsinperception

Older Material

Older text is stored here.


OpenCV User Wiki

OpenCVWiki: http:/opencv.willowgarage.com/wiki/SolutionsInPerceptionChallenge (last edited 2011-09-01 20:09:17 by Hedlly)