MedAT - Knowledge Engineering Group

Download Report

Transcript MedAT - Knowledge Engineering Group

MedAT:
Medical Resources
Annotation Tool
Monika Žáková*, Olga Štěpánková*, Taťána Maříková
*Department
of Cybernetics, CTU Prague
Institute of Biology and Medical Genetics, Prague
[email protected], [email protected]
1
Outline
1.
Motivation
2.
System Description
3.
Creating Annotation
4.
Additional Functionalities
5.
Knowledge Representation
6.
Ontologies

Task Ontologies
 Domain Ontologies
7.
2 / 20
Results and Conclusion
KEG seminar 8.12.2005
Motivation

Patients’ records represent a valuable source of information

Records stored in semi-structured text files

For sharing and data mining format such as ontology or
relational database needed

Currently known methods for text mining not applicable,
since
Records heterogeneous – type of examination, personality of doctor
 Abbreviations used (some non-standard)
 Gazetteers not available in Czech

3 / 20
KEG seminar 8.12.2005
Motivation II

Grant “Relational ML for analysis of biomedical data” of the
Czech national research program Information Society in
cooperation with the Institute of Biology and Medical
Genetics, 2nd Medical Faculty of Charles University

Relational data mining using subgroup discovery
methodology

Need to transform data from text files into a form suitable
for relational data mining i.e. relational database and rules
4 / 20
KEG seminar 8.12.2005
System overview
Ontologies
Forms generator
Medical record
Knowledge base
5 / 20
Relational
database
KEG seminar 8.12.2005
System description

Creating semantic annotations of medical records

Based on Dynamic Narrative Authoring Tool

Modular architecture

Export to knowledge base in OCML, OWL

Export to relational database

Visualization – genealogical tree
6 / 20
KEG seminar 8.12.2005
MedAT GUI
7 / 20
KEG seminar 8.12.2005
Creating Annotations

Dynamically generated forms

A form  one major class in ontology  master table in
the database

E.g. Patient, Examination

Adding abbreviations and aliases to the ontology

Filling of forms

Automatically by parsing
 Drag and drop from records in text format
 Manually in case OCR not effective
8 / 20
KEG seminar 8.12.2005
Creating Annotations II
9 / 20
KEG seminar 8.12.2005
Additional functionalities

Exploration of data stored in the relational database



Pre-defined SQL queries – knowledge of SQL not required
Writing queries directly in SQL
Visualization

10 / 20
Genealogy tree
KEG seminar 8.12.2005
Knowledge representation

Core formalism – Apollet
Apollet

Frame-based formalism based on OCML

Formalism used by Apollo ontology editor => possibility to
use I/O modules of Apollo

Export to lisp available

Inference engine available

Disadvantage: rules very often just lisp functions
11 / 20
KEG seminar 8.12.2005
Relational database

Tables of the relational database generated automatically
from the ontology

Semantic description of the database given by an ontology

Export done in a batch for a particular version of ontology
and knowledge base

Export intended for a data mining experiment

Currently PostgreSQL database used
12 / 20
KEG seminar 8.12.2005
Ontologies
MedAT relies on ontologies on 2 levels:

Task ontologies
 Describe

structure of different medical records
Domain ontologies
 Formalize
knowledge about a specific domain
e.g. diseases, family relations, time points
13 / 20
KEG seminar 8.12.2005
Task Ontologies

Developed on basis of procedures and structure of medical
records in cooperation with medical doctors

Hierarchy induced by part-whole relationship
OCML – slots with facets
 OWL – hasPart, partOf (W3C Working Draft)


Serve as basis for generating of forms and tables in
relational database
14 / 20
KEG seminar 8.12.2005
Task Ontologies - Example

Classes - elements of medical
records e.g. object of
examination, therapy

Slots – description of
composition of medical
records e.g. class
examination has slots date,
doctor, has_therapy
15 / 20
KEG seminar 8.12.2005
Domain ontologies

Use of third party ontologies e.g. GALEN, Gene Ontology

Ontology of family relations

Need for rules e.g. hasHalfBrother(x,y)
hasHalfBro ther ( x, y )  male ( y ) 
a , b:( hasParent( x, q )  hasParent( y , a ))
 ( hasParent( x, b )  hasParent( y , b ))
OWL – no standardized rule language (ORL)
 OCML – lisp functions

16 / 20
KEG seminar 8.12.2005
Time ontology
Time ontology

Developed originally for historical narratives

Based on Allen’s algebra

Uncertain time points and intervals



Extended to cover time events specific for medical domain


Uncertain temporal position
Uncertain granularity
E.g. before surgery, during infancy
Available in OCML
17 / 20
KEG seminar 8.12.2005
Results

Easily transfer information from medical reports to
dynamically generated forms

Data from forms saved to a knowledge base and relational
database

Iterative extending of ontologies, adding aliases and
abbreviations

Tool currently being tested at the Institute of Biology and
Medical Genetics for patients with neurofibromatosis type 1
18 / 20
KEG seminar 8.12.2005
Future work

Text mining methods for semi-automatic annotation

Tool for semantic search and retrieval of a relevant subset
of data and visualization of retrieved data

Use of annotated data along with information about
genotype for data mining using subgroup discovery
methodology
19 / 20
KEG seminar 8.12.2005
Questions
Thank you for your attention
Questions???
20 / 20
KEG seminar 8.12.2005