AConversationalAgent..

Download Report

Transcript AConversationalAgent..

A Conversational Agent to
Navigate in Virtual Worlds
http://www.cs.lth.se/home/Pierre_Nugues/Articles/twlt11/twlt11.pdf
CHI 2000 Workshop on Natural
Language Interfaces
The Hague, The Netherlands
Pierre Nugues, Christophe
Godéreaux, Pierre-Olivier El
Guedj, & Frédéric Revolta
Early Experiment - Shrdlu
• Shrdlu was written by Terry Winograd at
the M.I.T. Artificial Intelligence Laboratory
in 1968-70. Moves objects around and
makes inferences about the world.
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
•
Person: IS THERE A LARGE BLOCK BEHIND A PYRAMID?
Computer: YES, THREE OF THEM: A LARGE RED ONE, A LARGE GREEN CUBE,
AND THE BLUE ONE.
Person: PUT A SMALL ONE ONTO THE GREEN CUBE WHICH SUPPORTS A
PYRAMID.
Computer: OK. (does it)
…
Person: HOW MANY THINGS ARE ON TOP OF GREEN CUBES?
Computer: I'M NOT SURE WHAT YOU MEAN BY "ON TOP OF "IN THE PHRASE
"ON TOP OF GREEN CUBES " .
DO YOU MEAN:
1 - DIRECTLY ON THE SURFACE
2 - ANWHERE ON TOP OF ?
Person: 2
Computer: THREE OF THEM.
...
Person: DOES A STEEPLE
Computer interrupting: SORRY, I DON'T KNOW THE WORD "STEEPLE".
Person: A "STEEPLE" IS A STACK WHICH CONTAINS TWO GREEN CUBES AND A
PYRAMID.
Computer: I UNDERSTAND.
Person: ARE THERE ANY STEEPLES NOW?
Computer: NO.
Person: BUILD ONE.
Computer: OK. (does it)
Graphical Limitations
• Each command needs an interface
• Difficult to enumerate all the possible
combinations
• Takes up room to display interface
• Some commands are hard to find
• In the virtual world, some commands like
“Go through the door” is easier to say than
to navigate with a mouse.
Ulysses
Semantics & Syntax
• Trained on Corpora of 400 utterances
• 350 words plus morphological variations
• Phrase-based structure modified to handle
unknown and missing words
• Recognizes:
– Commands, such as “go to the house”
– deictic clarifications, such as “the yellow one”
– motion modifications, such as “faster”
– motion repetitions, such as “again”
Separates into five commands
• Go: Corresponds to a change of location with a
possible rotation of the embodiment. Includes
advance, enter, leave, climb etc.
• Return: Same as go, but the object visibility
does not matter.
• Rotate: Corresponds to the rotation of the user’s
embodiment head.
• Stop
• Continue
Some verbs, like “retourne”, are still ambiguous.
• Each sentence is broken down into
clauses.
• Each clause has a subject, verb group and
list of complements.
• Each complement has a semantic tag:
time, manner, location.
• Each verb group has the command and
possible object and adverbs.
Maps different styles of commands
together.
•
•
•
•
Go forward.
I want to go forward.
Can you go forward?
I want you to go forward.
Orders clauses based on
connectors and breaks down
clauses into commands.
• Go into the house and then go to the computer.
(1) Go into house
(2) Go to the computer
• Go to the house by passing in front of the flag
pole
(1) Go in front of the flag pole
(2) Go to the house
• Go back into the house
(1) Return to the house
(2) Go into the house
Geometric Reasoning
• For now objects very distinct – houses,
cars, blocks etc.
• Give an alignment to everything that has
alignment
• Distance is relative
Reference Resolution
• Uses a salience algorithm based off work
of Carla Huls.
http://www.cs.mu.oz.au/acl/J/J95/J951003.pdf
• Makes a list of possible objects.
• Places weight based on visibility, proximity
and focus.
• Asks for clarification if still ambiguous.
Dialogue Manager
•
•
•
•
Receives a list of possible parses.
Gets rid of non-executable ones.
If only one command executes it.
If not solicits clarification.
U
s
e
r
Turn around
Look at the
house
Go to the two
cars on the left
of the house
U
l
y
s
s
e
s
Viola
Okay
Viola
U
s
e
r
Turn to the right
Go to that car
Turn to the right
U
l
y
s
s
e
s
Yes
Voila
Viola
U
s
e
r
Go to the cube
Go to the small
ones
Go behind them
U
l
y
s
s
e
s
There are many
Voila
Viola
Questions?