Robots Learning Like Babies - Andrea Censi · ‣ Netﬂix suggests you a good movie to watch. 6 ‣ Example of a “recommender” system-other classical example: shopping suggestions

Robots Learning Like Babies

Andrea Censi

Laboratory for Information and Decision SystemsMassachusetts Institute of Technology

http://censi.mit.edu - [email protected]

時代基⾦金會暨⿇麻省理⼯工學院MIT全球產研計劃台灣年會 2015 MIT ILP-EPOCH TAIWAN SYMPOSIUM

Taipei, July 28, 2015Meet the Future of Robotics and Machine

Learning in Innovation Economy

http://censi.mit.edu

mailto:[email protected]

Outline

1. What is learning?

2. What is the value proposition for robotics?

3. What are the challenges in designing learning systems?

2

What is learning?

‣ It’s about using data- …to make predictions- …to make decisions

3

Examples of learning

4


‣ Netflix

5


‣ Netflix

5


‣ Netflix

5


‣ Netflix

5


‣ Netflix suggests you a good movie to watch.

6



6

‣ Example of a “recommender” system- other classical example: shopping suggestions



6

‣ Example of a “recommender” system- other classical example: shopping suggestions

‣ Netflix Challenge (2007-9): $1M for help in making the learning algorithm better.

7

algorithminput output

8

learning algorithm

dataa decision rule

9

learning algorithm

data

9

learning algorithm

dataJohn, “Godfather”,

Jane, “Godfather”,

Jane, “Terminator”,

...

9

learning algorithm

dataJohn, “Godfather”,

Jane, “Godfather”,

Jane, “Terminator”,

...

you

rating?

a movie

10

learning algorithm

data

10

learning algorithm

parameters (high dimensional)

data

10

learning algorithm

candidates decision rules


data

10

learning algorithm



data

10

learning algorithm



data

10

learning algorithm



data

10

learning algorithm



data

10

learning algorithm



data

10

learning algorithm



data

10

learning algorithm



data

10

learning algorithm


error on test data


data

10

learning algorithm


error on test data 0.10


data

10

learning algorithm


error on test data 0.10 0.15


data

10

learning algorithm


error on test data 0.10 0.15 0.30


data

10

learning algorithm


error on test data 0.10 0.15 0.010.30


data

10

learning algorithm


error on test data 0.10 0.15 0.010.30 0.50


data

10

learning algorithm


error on test data 0.10 0.15 0.250.010.30 0.50


data

10

learning algorithm


error on test data 0.10 0.15 0.250.250.010.30 0.50


data

10

learning algorithm




data

10

learning algorithm




data best* decision rule

10

learning algorithm




data

* in the family considered

best* decision rule

10

learning algorithm




data

* on the data available* in the family considered

best* decision rule

11


12

‣ Nest Learning Thermostat


‣ Nest Learning Thermostat - acquired by Google ($2B)- see also: Ecobee in the Apple ecosystem

13



13

~ NT 300



13

+ learning =

~ NT 300



13

+ learning =

~ NT 300 NT 8000

14

March 2015 in Boston

14


Boston, March 2015

14


Boston, March 2015

“snow”fun only on the first day

14


Boston, March 2015

“snow”fun only on the first day

15

16

17

learning algorithm

data

17

learning algorithm

datasensor data time series

17

learning algorithm

data

“I’m cold”, “I’m hot”sensor data time series

17

learning algorithm

data

turn on/off

temperature date


18

learning algorithm

data

a control policy

turn on/off

temperature date


18

learning algorithm

data

a control policy

turn on/off

temperature change

a model of the system

turn on/off

temperature date


18

learning algorithm

data

a control policy

date

desired temperature

a model of the task

turn on/off

temperature change


turn on/off

temperature date


19

‣ What is a robot?

19

‣ What is a robot?- has sensors

19

‣ What is a robot?- has sensors- has actuators

19

‣ What is a robot?- has sensors- has actuators- interacts with the physical world

19

20


‣ QUIZ: Which one is the robot?

20


A


20


A B


20


A B C


20


A B C


D

20


A B C E


D

21

learning algorithm

data

21

learning algorithm

datasensorimotor experience

21

learning algorithm

data

task examples (given by user )

sensorimotor experience

21

learning algorithm

data

a control policy

command

observations desired state



21

learning algorithm

data

commands

change in state


a control policy

command




21

learning algorithm

data

observations

desired state

a model of the task

commands

change in state


a control policy

command





‣ Kiva Systems (now AmazonRobotics)

22



22



22


‣ Kiva Systems (now AmazonRobotics) - Learning allows high performance

with very cheap components

23




23

learning algorithm

data




23

recorded commands

recorded observationslearning

algorithm

data




23

recorded commands


algorithm

datacommands

change in state





24

recorded commands


algorithm

data


‣ Learning makes systems more robust.

25

interferencesabotage faults / wear

‣ Learning makes systems more robust.

25

interferencesabotage faults / wear


‣ Dyson 360eye- learns a 3D map of your house

26


‣ Dyson 360eye- learns a 3D map of your house

26


‣ “Learning by demonstration”

27



27

[Muelling, Peters]



27

[Muelling, Peters]

learning algorithm

data



27

[Muelling, Peters]

user demonstrations

robot’s own experience

learning algorithm

data



27

[Muelling, Peters]

user demonstrations

robot’s own experience

learning algorithm

data

control policy

commands

current state

Learning


28

Learning


28

‣ Value proposition for robotics:- add value: more complex functionality- reduce design cost- reduce building cost- reduce operating cost

Learning


28

‣ Value proposition for robotics:- add value: more complex functionality- reduce design cost- reduce building cost- reduce operating cost

but…

Downsides of Learning

‣ Adaptivity comes at a (computational) cost.

29

30



30

more adaptive less adaptive



30




Human

30




Fruit flyHuman

30

Dyson 360eye




Fruit flyHuman

30

Dyson 360eye




Fruit flyHuman iRobot’s Roomba


‣ It might be harder to understand your design.

31



31

‣ Just like “genetic algorithms”…



31

2006 NASA ST5 spacecraft antenna.

‣ Just like “genetic algorithms”…



32



32

33


(image, label)(image, label)(image, label)(image, label)(image, “gorilla”)(image, label)

learning algorithm

data


33



learning algorithm

data


image of person

“gorilla”

33



learning algorithm

data


image of person

“gorilla”

evidence: + eyes + hair - clothes


‣ Machine learning systems are hard to maintain.

34


‣ Machine learning systems are hard to maintain.

34

Learning


35

‣ Value proposition for robotics- add value: more complex functionality- reduce design cost- reduce building cost- reduce operating cost

Learning


35


‣ But:- systems need more computational resources- systems are hard to trust- systems are less maintainable

Learning


35


‣ But:- systems need more computational resources- systems are hard to trust- systems are less maintainable

‣ Learn about learning: “The Master Algorithm” by Domingos (Sep’15)

Documents

Robots Learning Like Babies - Andrea Censi · ‣ Netﬂix suggests you a good movie to watch. 6 ‣ Example of a “recommender” system-other classical example: shopping suggestions