INTEX as an educational subject in the Master`s program in
Download
Report
Transcript INTEX as an educational subject in the Master`s program in
INTEX as an educational
subject in the Master's
program in Computational
Linguistics at Sofia
University
Svetla Koeva, Svetlozara Lesseva,
Ivelina Stoyanova
6th INTEX Workshop, Sofia, 28-30 May
6th INTEX Workshop, Sofia 28-30 2003
The beginning
INTEX was included in the curriculum of
the Master`s programme in
Computational Linguistics in the
academic year 2001-2002.
http://compling.ibl.bas.bg
INTEX is used in teaching the subject
Computer Systems for NLP.
6th INTEX Workshop, Sofia 28-30 2003
Main goals
To expand the students competence on
formal linguistic representation.
To help students in grasping the
theoretical complexity and the scope of the
linguistic phenomena.
To develop the students competence on
finite state automata and their application
in natural language processing.
6th INTEX Workshop, Sofia 28-30 2003
Standard tasks for all students
Enhancing the existing Sentence
boundaries delimiting FST
FST-s for the analytic cardinal and ordinal
numerals in Bulgarian
FST-s for dates – Latin and Arabic
numbers
6th INTEX Workshop, Sofia 28-30 2003
Individual tasks
DELAF and DELACF dictionaries for:
historical periods and events,
institutions` names, companies` names.
DELAF and DELACF dictionaries for:
chemical compounds terms,
botanical and zoological terms, toponyms,
abbreviations.
6th INTEX Workshop, Sofia 28-30 2003
Individual tasks
DELACF dictionary of phraseologisms.
DELACF dictionary of frozen expressions
Decision-making in presenting the
paradigms
6th INTEX Workshop, Sofia 28-30 2003
Some examples
Modifier + Noun head
Carbon dioxide
въглероден диоксид,въглероден
диоксид.N+M:s
въглеродния диоксид,въглероден
диоксид.N+M:sh
въглеродният диоксид,въглероден
диоксид.N+M:sl
6th INTEX Workshop, Sofia 28-30 2003
Some examples
metal oxide:
метален оксид, метален оксид.N+M:s
металния оксид, метален оксид.N+M:sh
металният оксид, метален оксид.N+M:sl
метални оксиди, метален оксид.N+M:p
металните оксиди, метален оксид.N+M:pd
6th INTEX Workshop, Sofia 28-30 2003
Individual tasks
FST-s for the analytic forms of the grammatical
paradigms of verbs, nouns and adjectives.
FST-s for recognition of analytic verb forms in
the indicative mood, active voice. These are the
present perfect, pluperfect, future, future perfect,
future in the past, future perfect in the past.
At the present moment the students develop
FST-s for the passive voice of the indicative
mood of all tenses, for conditional mood forms of
all tenses.
6th INTEX Workshop, Sofia 28-30 2003
Individual tasks
A particular case is the negative tensed
forms because the negative forms are
always analytical and FST-s are devised
for tenses which have otherwise synthetic
formation. Negation patterns and question
patterns have particular word order which
has to be considered, too.
Some examples
6th INTEX Workshop, Sofia 28-30 2003
Future Perfect in the Past in
Bulgarian
6th INTEX Workshop, Sofia 28-30 2003
Perfect tense in Bulgarian
6th INTEX Workshop, Sofia 28-30 2003
Result of the application of
analytic_tenses.grf
6th INTEX Workshop, Sofia 28-30 2003
Tasks
Expansion of the existing linguistic corpus
Notations unification
Devising of bigger and richer libraries of
dictionaries and FST-s
6th INTEX Workshop, Sofia 28-30 2003
Masters` theses
A master's thesis was written on
representation of Bulgarian compound
nouns in INTEX.
This year students are going to use INTEX
for analysis of recognition errors for
Grammar checker
OCR correction.
6th INTEX Workshop, Sofia 28-30 2003
Future directions
Introducing a wider range of Bulgarian
researchers to INTEX.
Applying INTEX in a wider range of
activities.
Enhancing the system with more
resources.
Cooperation in expanding the
functionalities of INTEX.
6th INTEX Workshop, Sofia 28-30 2003