Transcript Chapter 11
LEARNING FROM DATA
Lecture Ten
(Chapter 10, Notes;
Chapter 11, Textbook)
1
Chapter 10: Learning From Data
Outline
The Concept of Learning
Data Visualization
Neural Networks
The Basics
Supervised and Unsupervised Learning
Business Applications
Association Rules
Implications for Knowledge Management
2
Chapter 10: Learning From Data
The “Context” of Learning
The “value added” collaborative intelligence
layer of KM architecture
Relevant technologies are:
Artificial Intelligence
Experts Systems
Case-Based Reasoning
Data Warehousing
Intelligent agents
Neural Networks
3
Chapter 10: Learning From Data
The “Process” of Learning
A process of filtering and
transforming data into valid
and useful knowledge.
Automate via technology tools:
provide a collaborative
learning environment for
participants
enhance their ability to
understand the processes /
tasks they are dealing with
4
Chapter 10: Learning From Data
The “Goals” of Learning
Final goal is to improve the
qualities of communication and
decision making
Ways to achieve these goals:
Verify hypotheses (formed
from accumulated knowledge)
Discover new patterns in data
Predict future trends and
behaviour
5
Chapter 10: Learning From Data
Learning from Data
Build learning models that automatically
improve with experience.
Top-down approach
Generate ideas
Develop models
Validate models
Bottom-up approach
Discover new (unknown) patterns
Find key relationships in data
6
Chapter 10: Learning From Data
Top-down approach (Example)
Start with a hypothesis derived
from observation or prior
knowledge
“Tourists visiting Egypt earn an
annual income of at least
$50,000”
Hypothesis tested by querying
database followed by analysis
If tests not supportive,
hypothesis is revised and test
7
Chapter 10: Learning From Data
Bottom-up approach (Example)
No hypothesis to test
“Find unknown buying
patterns by analyzing the
shopping basket”
“ … showed married males,
age 21 to 27, shopped for
diapers also brought beer.
“store decided to stack beer
cases next to diaper shelf”
8
Chapter 10: Learning From Data
Data Visualization
Explore visually for trends in
data useful for making decision
Exploring data includes:
Identify key attributes and
their distribution
Identify outliers
Extract interesting grouping
of data subsets
Identify initial hypothesis
9
Chapter 10: Learning From Data
Example of Data Visualization
(John Snow and the Cholera outbreak in London, 1845)
10
Chapter 10: Learning From Data
Artificial Neural Network as
Learning Model
Modeled after human brain’s
network
Simulate biological
information processing via
networks of interconnected
neurons
Neural networks are analog
and parallel
11
Chapter 10: Learning From Data
Neurons – The Basic Elements
The neuron receives
inputs, determines their
weights (strengths), sums
the combined inputs, and
compares the total to a
threshold (transfer
function)
If total is greater than
threshold, the neuron fires
(sends an output)
12
Chapter 10: Learning From Data
A Neuron Model
13
Chapter 10: Learning From Data
Learning in Neural Network
Supervised
The NN needs a teacher
with a training set of
examples of input and
output
Unsupervised (or SelfSupervised)
Does not need a teacher
14
Chapter 10: Learning From Data
Supervised Learning
Each element in a training
set is paired with an
acceptable response
Network makes successive
passes through the
examples
The weights adjust toward
the goal state.
15
Chapter 10: Learning From Data
A Supervised Neural Network
(An Example)
16
Chapter 10: Learning From Data
Unsupervised Learning
No external factors can
influence adjustment of
input’s weights
No advanced indication
of correct or incorrect
answers
Adjusts through direct
confrontation with new
experiences
17
Chapter 10: Learning From Data
Business Applications (1)
Risk management
Appraise
commercial loan applications
NN
trained on thousands of applications,
half of which were approved and the other
half rejected by the bank’s loan officers
Through
supervised learning, NN learned
to pick risks that constitute a bad loan
Identifies
loan applicants who are likely to
default on their payments
18
Chapter 10: Learning From Data
Business Applications (2)
Predicting Foreign Exchange Fluctuations:
A set of relevant indicators were identified,
used as inputs to NN
NN was trained for exchange rates of US
dollar against Swiss franc and Japanese yen,
using data from first 6 months of 1990.
Then it was tested over an 8- to 11-week
period
Results revealed return on capital of about
20%
19
Chapter 10: Learning From Data
Business Applications (3)
Mortgage Appraisals:
Neural
network uses the data in the
mortgage loan application
It
estimates value of the property based
on the immediate neighborhood, the
city, and the country
The
system comes up with a valuation
for the property and a risk analysis for
the loan.
20
Chapter 10: Learning From Data
Association Rules
A KB tool that generates a set of
rules to help understanding
relationships that exist in data
Types:
Boolean
rule
Quantitative
rule
Multi-dimensional
Multi-level
rule
association rule
21
Chapter 10: Learning From Data
Boolean Rule (An Example)
A rule that examines the
presence or absence of items
For example, if a customer
buys a PC and a 17” monitor,
then he will buy a printer.
Presence of items (a PC and
17” monitor) implies
presence of the printer in
the customer’s buying list
22
Chapter 10: Learning From Data
Quantitative Rule (An
Example)
A rule that considers the
quantitative values of items
For example, if a customer
earns between $30,000 and
$50,000 and owns an
apartment worth between
$250,000 and $500,000, he will
buy a 4-door automobile
23
Chapter 10: Learning From Data
Multi-dimensional Rule
A rule that refers to a
multitude of dimensions
If a customer lives in a big
city and earns more than
$35,000, then he will buy a
cellular phone
This rule involves 3
attributes: living, earning,
and buying. Therefore, it is
a multi-dimensional rule
24
Chapter 10: Learning From Data
Multi-level Association Rule
25
Chapter 10: Learning From Data
Implications for Knowledge
Management
Cost / benefit analysis
Tangible costs - user training,
hardware + software, backup,
support, maintenance
Intangible costs - user resistance and
learning curve
Quality Assurance
Adequacy of initial design
Level and frequency of maintenance
26
Chapter 10: Learning From Data
.....
Layers of KM Architecture
1
2
3
User Interface
(Web browser software installed on each user’s PC)
Authorized access control
(e.g., security, passwords, firewalls, authentication)
Collaborative intelligence and filtering
(intelligent agents, network mining, customization, personalization)
Knowledge-enabling applications
4
5
(customized applications, skills directories, videoconferencing, decision support systems,
group decision support systems tools)
Transport
(e-mail, Internet/Web site, TCP/IP protocol to manage traffic flow)
Middleware
6
(specialized software for network management, security, etc.)
The Physical Layer
(repositories, cables)
7
Databases
Legacy applications
(e.g., payroll)
Groupware
(document exchange,
collaboration)
Data warehousing
(data cleansing,
data mining)
27