Introducing Bayesian Network in

Transcript Introducing Bayesian Network in

Introducing Bayesian
Networks in
Neuropsychological
Measures
Presented at 2006 APS Annual Conference
Toshi Yumoto
University of Maryland, Abt Associate
Gregory Anderson
Xtria, Adler School of Professional Psychology
Daisy Wise
University of Maryland
Bayesian Networks
It is very difficult for practitioners to combine this
information objectively


Bayesian Networks provide an effective way to
combine different sources of information such as:



Paul Meehl, Clinical versus Statistical Prediction (1954)
Information from distinct domains
Demographic (e.g. gender, race/ethnicity) and Clinical
Information (e.g. previous diagnosis, different type of tests)
It is easy to add new information or modify existing
information
Bayesian Network (2)




A Bayes Net is suitable for discrete data
 Continuous data needs to be converted into categorical data
Complex models can be divided into several conditionally
independent parts
 Model estimation is easier
 Straightforward to add conditional independent data
Bayesian Network is visual and intuitive
 Class assignments are expressed as a proportion (e.g. percent)
 Provides a prediction for all parameters based on limited (or no)
information
 Allows both subjective and objective evaluation of a network
Microsoft Belief Networks (MSBNx, Kadie, Hovel, Horvitz, 2001)
was used to build a Bayesian Network
Steps in Network Development

Part I


Conversion of continuous scores to discrete
categories by Finite Mixture Approach
Part II


Create Bayesian Network based on hypothesized
model
Estimate conditional probabilities


This research used a Latent Class Model
Part III

Examine and Modify model
Measures and Sample




The ATLAS is a comprehensive assessment for
ADHD containing 7 different sections (Anderson &
Post, 2006). One of the sections is a series of
neuropsychological measures and observations of
performance during testing.
This paper examines three of the major traits of the
neuropsychological measures for ADHD.
Diagnoses of ADHD & LD were gathered from a
parent report.
The sample is around 220 subjects, 8-18 years of
age, from across the nation, gathered in the test
field trials.
Creation of Discrete scores

The assumption was made that test scores
are from more than one distribution


Cut scores were established using the
intersection of the distributions



A Finite Mixture Model was therefore utilized
# of mixture distribution = # of cut scores + 1
A discrete score was assigned based on a
person’s mixture distribution characteristic
For more information contact authors.
Finite Mixture Analysis for Trail A
0.2
Trail A time: Comparison of Observed and Mixture
Distributions
Trail A time: Two Normal Distributions
0.04
0.18
0.035
Observed
0.16
One Normal
0.03
Two Normal
0.14
Three Normal
Normal
0.12
Probability
Proportion
0.025
0.1
0.08
Abnormal
0.02
0.015
0.06
0.01
0.04
0.005
0.02
0
0
10
30
50
70
90
Time in seconds
110
130
150
10
30
50 Cut Score
70
90
Time in Seconds
110
130
150
Hypothesized Model
LD/ADHD
Conditionally Independent Given
Second Order Latent Class
Second Order
Latent Class
Visual Trace
Latent Class
Impulsive/Error
Latent Class
Memory
Latent Class
TAtime
TBtime
DA
TA
DA
Cancel
TAerror
Memory1
Memory3
Forward
Backward
SS
TBerror
CancelO
CancelC
Model Specification

Three Distinct Domains




Second Order Latent Class


Visual Trace/Sequence
 Three Latent Classes
Memory
 Three Latent Classes
Impulsive/Error
 Two Latent Classes
Four Latent Classes
Diagnosis (LD/ADHD)

Four manifest categories
 Typical, LD, ADHD, and LD/ADHD
Specification of the Bayesian
Network



For each domain conditional probability of
indicator variables are specified given latent
class membership (for that domain).
These probabilities are first specified based
on the assumption that we have no
information.
These are then updated, given information
such as test scores or clinical observations.
Partial Model: Visual Trace Network
No information known
Partial Model: Visual Trace Network
All item scores known

Probability of LD/ADHD state given




Middle Visual Trace level
Middle Memory level
High Impulsive level
SOClass and LD/ADHD states are unobserved

Proportions are expected class states

Probability of LD/ADHD state with Six test results




Trail A time (middle) and error (middle)
Trail B time (middle-low) and error (middle)
Word Memory 1 (low) and Word Memory 3 (low)
Other nodes have expected category distribution

Highest probability indicates most likely category



It is easy to combine additional information such as clinical observations and gender
to improve model prediction.
Clinical observation is conditionally independent from other nodes given LD/ADHD
states (i.e. only affect the probability of LD/ADHD node)
Gender has direct effect on LD/ADHD states and Impulsive/Error level, which
indirectly affect second order class states.

This type of information is harder to add later and should be included from the beginning, if
appropriate.
Summary





Bayesian Networks provide an effective way to express and
examine hypothesized models.
The model performance can be compared with precision of
prediction (e.g. LD/ADHD diagnoses in this research).
Any statistical procedures estimating expected scores (i.e.
probability of responses) may be used to build a network.
 Bayes Net uses discrete data, therefore latent class model and
latent trait model (with discrete proficiency levels) nicely fit model
development.
Combining additional information is straight forward and relatively
easy
 Understanding of conditional independence is the key
Bayes Net estimates expected probability from available
information
 Makes best possible diagnosis without complete data
Contacts

Toshi Yumoto


Gregory Anderson


[email protected]
[email protected]
Daisy Wise

[email protected]

Introducing Bayesian Network in

Transcript Introducing Bayesian Network in

Directory