Generative Lexicon

Download Report

Transcript Generative Lexicon

Generative Lexicon- Idea and Practicality
Debasri Chakrabarti
02408601
Guide: Prof.Milind S. Malshe
Co-Guide: Prof. Pushpak Bhattacharyya
1
Overview
• Introduction
• Polysemy and the Logical Problem of Polysemy
• Generative Lexicon Theory
• Lexicon Building
• Applications and Limitations of GLT
• Conclusion
2
Introduction
•
Lexicon— ideally collection of all words of a language
• Information stored in a lexicon Phonetic information
 pronunciation
 Semantic information
 meaning
 Morphological information
 transitivity and intransitivity (verbs) , count vs. mass (noun)
3
Lexicon (contd…)
Example of “eat” in the Oxford Advanced Learner’s Dictionary
eat /i:t/ v (pt ate /et/; pp eaten /i:tn/):1. sth (up) to food into the mouth,chew and swallow it: he was too ill to eat
Pronunciation
Meaning
Morphological information
Lexical entry
4
Mental Lexicon
• Mental Lexicon: information stored in the mind of a native speaker
• Native speakers store information
 Phonetic information
 pronunciation
 Semantic information
 meaning
 Morphological information
 transitivity vs.intransitivity (verbs), count vs. mass (noun)
• Additional information
 use of a word in a new context, syntactic environment of a word, wordformation rules
5
Example of Mental Lexicon
Example of eat in a native speaker’s mind
• Pronunciation: long /i:/ is used in eat
• Grammatical information: past tense is ate /et/
• Word-formation rules: /-s/ is the third person singular present tense marker as in
he eats
• Meaning: 1. Take in solid food: she ate a banana
2. Take a meal: we did not eat until 10 P.M.
3. Worry or cause anxiety in a persistent way: what’s eating you up.
• Syntactic Information: eat needs an agent to perform the action.
the agent role is obligatory.
6
Lexicon in Computational Linguistics
Lexicon meant for Natural Language Processing (NLP) must have the
following properties:
•
Morphological information

Parts of speech information

Rules should be there to deal with both regular and irregular forms
e.g ate (past tense of eat)
men (plural of man)
•
Semantic information

•
Can handle lexical ambiguity
Syntactic information

Action verbs will always have an agent
7
Polysemy and the Logical Problem of
Polysemy
Polysemy
•
An individual word can have indefinite number of subtle meaning
difference
•
Natural Languages are highly polysemous
•
This creates ambiguity
•
Weinreich distinguishes between two types of ambiguity


Contrastive ambiguity
Complementary polysemy
8
Polysemy and the Logical Problem of
Polysemy (contd…)
Contrastive Ambiguity
•
•
A lexical item carries two distinct unrelated meanings
This is a case of homonymy

words spelled or pronounced in the same way but have different
meanings
Example:


bank a financial institution
bank place beside a body of water.
9
Polysemy and the Logical Problem of
Polysemy (contd…)
Complementary Polysemy
•
Manifestation of the same basic sense
• Denotes a relation among different senses
Example,
John crawled through the window.
 The window is closed.

Sense 1. Apparatus
Sense 2. Physical Object
10
Sense Enumeration Lexicon (SEL)
•
Simplest model of lexical design to capture the logical polysemy.
•
Widely accepted in both computational and theoritical linguistics.
•
Direct approach to handle polysemy is to allow the lexicon to have
multiple listing of words, each annotated with a separate meaning
or lexical sense.
11
Sense Enumeration Lexicon (SEL)
•
Example of Contrastive Senses
bank1
CAT= count-noun
GENUS= financial-institution
bank2
CAT= count-noun
GENUS= shore
12
Sense Enumeration Lexicon (SEL)
•
Example of Complementary Polysemy
Window1
CAT= count-noun
GENUS= apparatus
Window2
CAT= count-noun
GENUS= artifact
13
Sense Enumeration Lexicon (SEL)
•
Possible Modification of Complementary Polysemy in
SEL
window
sense1
CAT= count-noun
GENUS= apparatus
sense2
CAT= count-noun
GENUS= artifact
14
Generative Lexicon Theory(GLT)
•
•
Major Problems for Lexical Semantics

to explain the polymorphic nature of language

to characterize the semanticality of natural language utterances

to capture the creative use of words in novel contexts

to develop a richer, co-compositional semantic representation
Generative Lexicon Theory

developed by James Pustejovsky

crucial aspect of GLT is the representation and treatment of polysemy

it examines the meaning of words to see the range of polysemy
15
Methodology of Generative Lexicon
Theory
Generative lexicon involves the following methodology
•
Argument Structure
 True Arguments
 Default Arguments
 Shadow Arguments
 True Adjuncts
•
Event Structure
•
Qualia Structure
 Formal
 Constitutive
 Telic
 Agentive
16
Argument Structure
•
True Arguments: syntactically realized parameters of the
lexical item
John arrived late
•
Default Arguments: logically present in the expressions
but are not necessarily expressed syntactically.
John built the house out of bricks
•
True Adjuncts:
 modify the logical expression
 part of the situational interpretation
She drove down to New York on Tuesday.
17
Argument Structure (contd…)
•
Shadow Arguments: semantically incorporated in the lexical
item and are expressed by discourse specification and contextual
factors
Mary buttered her toast

hidden argument is the material being spread on the toast

these are not optional arguments but expressible only under specific
conditions

refer to the semantic content that is not necessarily expressed in syntax
Example: Mary buttered her toast with margarine
18
Event Structure
•
•
event type of a lexical item and a phrase
events can be sub-classified into at least three sorts: State, Process
and Transition
Event Structure of build as found in the following expressions
They are building a new house
The house was built by John
build
EVENTSTR=
E1= process
E2= state
19
Qualia Structure
•
gives a relational force for a lexical item
•
composed of four qualia roles




Formal: This qualia role distinguishes a lexical item within a
larger domain.
Constitutive: This is a relation between an object and its constituent
parts.
Telic: This specifies the purpose and function of a lexical item.
Agentive: This indicates the factors involved in the origin of a
lexical item.
20
Qualia Structure (contd…)
Qualia Structure for novel
novel
Qualia
const = narrative
formal = book
telic = reading
agent = writing
21
Lexical Conceptual Paradigm (LCP)
•
The term is used by Pustejovsky and Anick (1988)
•
Refers to the ability of a lexical item to cluster multiple senses
Example,


•
John crawled through the window.
The window is closed.
Resulting LCP

phys-obj.aperture-lcp = [phys-obj]
[aperture]
22
Generative Device
•
Type Coercion

a lexical item or phrase is coerced to a semantic interpretation by a
governing item in the phrase, without changing its syntactic type
Mary wants John to leave
Mary wants to leave
Mary wants the book
•
Function Application with Coercion


different complement type of the verb
different interpretations of the verb that arise for the different
complements
23
Generative Device
•
Selective Binding

•
a lexical item or a phrase operates specifically on the substructure of a phrase,
without changing the overall type in the composition
a good knife: a knife that cuts well
Co-composition

multiple elements within a phrase behave as functors, generating new nonlexicalized senses for the words in composition
John baked the potato
John baked the cake
24
Lexicon Building
• Building of WordNet


lexical database organised in terms of concept
each concept is related to each other in terms of various semantic
relations
• Building of a Universal Word Dictionary


building a lexicon for Universal Networking Language
Universal Networking Language (UNL) is an electronic language
for computers to express and exchange all kinds of information
• Creation of Verb hierarchy Tree

creating a verb knowledge base for the UNL system
25
Building of WordNet
•
Different semantic relations in WordNet





•
Synonymy
Antonymy
Hypernymy and Hyponymy
Meronymy and Holonymy
Entailment and Troponymy
Multiple Hypernymy in Euro WordNet



Disjunctive Hypernym
Conjunctive Hypernym
Nonexclusive Hypernym
26
Building of WordNet
•
Disjunctive Hypernym


these are incompatible types that never apply simultaneously
found among nouns that refer to the participant in an event
but do not restrict for the type of entity participating
threat
- Role- Agent
- Has Hypernym
- Has Hypernym
- Has Hypernym
threaten
person; disjunctive
thing; disjunctive
idea; disjunctive
27
Building of WordNet
•
Conjunctive Hypernym


•
•
these are compatible types that always apply simultaneously
found for verbs in which multiple aspects are combined.
Dutch Example
doodschoppen to kick to death
- Has Hypernym doden (to kill); conjunctive
- Has Hypernym schoppen (to kick); conjunctive
Similar Hindi example
huMkarnaa: Dranao ko ilae jaaor ka Sabd krnaa (to shout to scare somebody)
- Has Hypernym Dranaa
- Has Hypernym icallaanaa
(to scare)
(to shout)
conjunctive
conjunctive
28
Building of WordNet
•
Non-exclusive Hypernym

either both aspects may apply simultaneously or one of both may
apply
knife
- Has Hypernym weapon
- Has Hypernym cutlery
29
Building of a Universal Word
Dictionary
Construction of Universal Word (UW) in Universal Networking
Language (UNL)
• UNL – electronic language for computers to express and exchange all kinds of
information
• UW – character strings representing unique concept
eat (icl>consume) as in he is eating
eat (icl> damage) as in the house was eaten up by the heat
represented by an English word
captures all the meanings conveyed by that word
restrictions are attached to create unique sense
• UNL Knowledge Base (KB)— performs the task of defining all possible
relationships between two UWs.
30
How to create an UW
I. First a category is decided
a. nominal concept (icl> thing) is attached
e.g swallow(icl> thing)
b. verbal concept
(icl>do) concept of an event caused by something or someone
change (icl>do) as in I changed my mind.
(icl>occur) concept of an event that happens of its own accord
change (icl>occur) as in The weather will change.
(icl>be) concept of a state verb
know(icl>be) as in I know you.
31
How to create a UW(contd…)
To handle the ambiguity of a UW
• For a nominal concept, a subordinate category from the uw hierarchy
should be used rather than a thing.
Example: swallow (icl>bird) as in the swallow is singing.
swallow(icl>action) as in he took the drink at [in] one swallow.
swallow(icl>quantity) as in take a swallow of water.
• For a verbal concept possible case relations are attached.
case relations are like obj>thing, obj>person, gol>thing
Example: spring(icl>occur(obj>liquid)): expresses gushing out as in to spring out
spring(icl>do(gol>place)): expresses jumping up as in to spring up
32
Creation of a verb hierarchal tree
Creation of the Verb knowledge base
Following :
1.Beth Levin’s methodology of verb alternation
example, a. Bill sold a car.
b. Bill sold Tom a car.
2. Hypernymy relation of English Wordnet
Hypernym denotes superset of a concept
example,
animal
Hypernym
cat
33
Creation of a verb hierarchal tree
contd…
• Beth Levin gives the syntactic information.
• Hypernymy gives the semantic information.
• The classification is in the following manner:
 "do(agt>thing,obj>thing {,gol>thing,src>thing,icl>do})"
 "argue({icl>do(}agt>thing,obj>thing,ptn>thing{)})"
34
Creation of a verb hierarchal tree
contd…
Format of the entry:
1Tab
"attack({icl>do(}agt>thing,obj>thing{)})"; Most wild animals
won't attack humans unless they are provoked. /Army forces have been
attacking (the town) since dawn with mortar and shell fire. / Napoleon
attacked Russia in 1812 and was defeated and forced to retreat. (to
make an attack on sb/sth)
2Tab
Tab"assault(icl>attack(agt>thing,obj>thing,man>emotionally))"
Nightmares assaulted him regularly.(to attack sb emotionally)
2Tab
Tab"assault(icl>attack(agt>thing,obj>thing,man>physically))"
;He got two year's imprisonment for assaulting a police officer.[Vn](to
attack sb physicaly and violently, esp when this is a crime)
35
Application of GLT
•
•
•
•
•
Formal role is similar with the hypernymy relation
Constitutive role is similar with the meronymy
relation
Telic role is similar with the functional link given
between a Noun and a Verb in the Hindi WordNet
LCP is used in the multi hypernymy process
Event structure is specified by the ontology nodes in
the Hindi WordNet
36
Application of GLT

English Wordnet (1.7.1) gives 63 senses for the verb sense of break
interrupt, break 1-- (end prematurely; break a lucky streak)
break, break off, discontinue, stop 10-- (prevent completion; stop the project; break the
silence)
break, break away 18-- (interrupt a continued activity; She had broken with the traditional
patterns)
break 31-- (stop or interrupt; He broke the engagement; We had to break our plans for a
trip to China)
separate, part, split up, split, break, break up 33-- (discontinue an association or
relation; go different ways; The business partners broke over a tax question; The couple
separated after 25 years of marriage; My friend and I split up)
37
Application of GLT
•
Merging of senses using GLT
Break
EVENTSTR
QUALIA
E: event
FORMAL: interruption
AGENTIVE: break_act
38
Limitations Of GLT
•
Attempts to distinguish between polysemy and
accidental homonymy
Example of bake
 baked a cake (creativity)
 baked a potato (change of state)
•
Pustejovsky’s suggestion
 cake-artifact
 potato-nat obj
Problem: how to deal with artifacts like knife, car?
39
Conclusion
• Generative mechanisms fail to predict polysemy or
generate polysemous sense
• Generative mechanisms along with ontology can be a
powerful device
• This implies the building of a rich ontology
40