X1, X2, X3, X4, X5

Download Report

Transcript X1, X2, X3, X4, X5

Chapter 10
Machine Learning
1
Learning
1. Rote learning
2. Learning by taking advice
3. Learning by problem solving
Parameter adjustment
Macro-Operators
4. Learning from examples
Induction : Winston’s learning program p.458
Version Spaces : Candidate eliminate algorithm
Decision tree
5. Explanation-based learning p 482
6. Formal learning theory
2
344-471 AI & ES
Chapter 10 ML
Winston’s learning program
3
344-471 AI & ES
Chapter 10 ML
Winston’s learning program
Concept : P.459
Begin with a structural description of one known
instance of the concept. Call the description the
concept definition.
Examine descriptions of other known instances of
the concepts. Generalize the definition to
include them.
Examine descriptions of near misses of concept,
Restrict the definition to exclude these.
4
344-471 AI & ES
Chapter 10 ML
5
344-471 AI & ES
Chapter 10 ML
Winston’s learning program
6
344-471 AI & ES
Chapter 10 ML
Winston’s learning program
7
344-471 AI & ES
Chapter 10 ML
Winston’s learning program
8
344-471 AI & ES
Chapter 10 ML
Winston’s learning program p.458
Block world concept : Figure 17.2 p. 459
Structure description : Figure 17.3 p. 460
The comparison of two arches : Figure 17.4 p. 461
The arch description after two examples : Figure 17.5 p.
462
The arch “description after a near miss : Figure 17.6 p.
463
use semantic networks to describe block structures
use matching process to detect similarities and
differences between structures
use isa hierarchy to describe relationship among
already known objects
9
344-471 AI & ES
Chapter 10 ML
Version Spaces
The goal : to produce a description that is consistent with all
positive examples but no negative examples in the training
set.
use frame representing concept for car see Figure 17.7 p. 463
Features/Slots : { value1, value2,...,valueN }
origin : { Japan, USA, Britain }
Variables : X1, X2, X3
concept space : see Figure 17.11 Concept of Version Spaces p. 466
variables
target concept
all training instance
10
344-471 AI & ES
Chapter 10 ML
Version Spaces
11
344-471 AI & ES
Chapter 10 ML
Version Spaces




version space = current hypothesis = subset of concept space = largest
collection
of descriptions that is consistent with all the training
examples seen so far.
concept space = G or S
G = contain the most general descriptions consistent with the training
example seen so far.
S = contain the most specific descriptions consistent with training
exaples

positive example (+)  move S to more specific

negative example (-)  move G to more specific
if G and S sets converge  the hypothesis is a single concept
description

12
344-471 AI & ES
Chapter 10 ML
Version Spaces

Candidate Eliminate Algorithm p.466-467

 algorithm that use to narrow down the version
space
 by remove any descriptions that are inconsistent
with set G and set S
Car Example


Figure 17.7 Concept Car : p. 463
Figure 17.8 Representation language for car : p. 464
Figure 17.9 The concept Japanese car : p. 464
Figure 17.10 Partial ordering of concepts : p. 465
Figure 17.12 Positive and negative examples of car : p. 467
13
344-471 AI & ES
Chapter 10 ML
Version Spaces
14
344-471 AI & ES
Chapter 10 ML
Version Spaces
15
344-471 AI & ES
Chapter 10 ML
16
344-471 AI & ES
Chapter 10 ML
Version Spaces
17
344-471 AI & ES
Chapter 10 ML
Version Spaces
18
344-471 AI & ES
Chapter 10 ML
Candidate Eliminate Algorithm
19
344-471 AI & ES
Chapter 10 ML
Candidate Eliminate Algorithm
20
344-471 AI & ES
Chapter 10 ML
Version Spaces
We want “Japanese economy car”
From Figure 17.12 Positive and negative examples of car : p. 467
[origin = X1, manufacture = X2, color = X3, decade = X4, type = X5]
GET EX1 (+)
GET EX2 (-)
GET EX3 (+)
GET EX4 (-)
GET EX5 (+)
21
G = {(X1, X2, X3, X4, X5)}
S = {(Japan,Honda, Blue,1980,Economy}) =Figure 17.12 in EX1
G = {(X1, Honda, X3, X4, X5), (X1, X2, Blue, X4, X5) ,
(X1, X2, X3, 1980, X5), (X1, X2, X3, X4, Economy)}
S = {(Japan,Honda, Blue,1980,Economy})
** the same because (-) example
check G first, G = {(X1, X2, Blue, X4, X5) ,(X1, X2, X3, X4, Economy)}
S = {(Japan,X2, Blue,X4,Economy})
check G first, G = {(Japan, X2, Blue, X4, X5) ,
(Japan, X2, X3, X4, Economy)}
S = {(Japan,X2, Blue,X4,Economy})
** the same because (-) example
check G first, G = {(Japan, X2, X3, X4, Economy)}
S = {(Japan,X2, X3,X4, Economy})
344-471 AI & ES
Chapter 10 ML
Version Spaces


Note : The algorithm is least commitment algorithm : produce as
little as possible at each step
Problems
1.) S and G may not converge to a single hypothesis
2. ) if there is a noise (inconsistent data)  the algorithm will be
premature, we may prune the target concept too fast
* For example if the data number three given the negative sign (-)
instance of positive sign (+) ... no matter how much the data is we
can not find the concept....
* How to fix this problem is to maintain several G and S sets
BUT it is costly and may have the bounded inconsistency
problem
3.) We can not use OR in the questions ask
* For example : Italian sport car or German luxury car”
22
344-471 AI & ES
Chapter 10 ML
Decision Tree
ID3 Program = to classify a particular input, we start at the top of the tree
and answer questions until we reach a leaf, where the classification is
stored. See Figure 17.13 Decision tree p. 470
1. Choose window = random subset of training examples to train
2. Outside window = use to test the decision tree
3. Use empirical evidence (iterative method) to build up decision tree
4. Building a node = choosing some attribute to divide training instance into
subset
consider (+) sign
Can use with OR .... just change (-) sign into (+) sign
Problems : noisy input, attribute value may be unknown, may have large
decision tree and hard to understand relationship See Figure 17.13
23
344-471 AI & ES
Chapter 10 ML
Decision Tree
24
344-471 AI & ES
Chapter 10 ML
Explanation-Based Learning
 provide
explanation
 depend on domain theory/
domain knowledge
25
344-471 AI & ES
Chapter 10 ML
Formal Learning Theory



Given positive and negative examples
produce algorithm that will classify future
examples correctly with probability 1/h
Complexity of learning :



26
the error tolerance (h)
the number of binary features present in the examples (t)
the size of the rule necessary to make the discrimination (f)
344-471 AI & ES
Chapter 10 ML
Formal Learning Theory





27
if the number of training examples required is
polynomial in h,t, and f  then the concept is
learnable.
few training examples are needed  learnable
we restrict the learner to the positive examples
only.
See Figure 12.22 Concept of elephant P. 483
elephant = “gray, mammal, large”
344-471 AI & ES
Chapter 10 ML
Formal Learning Theory
28
344-471 AI & ES
Chapter 10 ML
Induction


emphasis to all BEANS : all instances
induction : A method of reasoning by which one infers a
generalization from a series of instances.
Inductive syllogisms are of the following form:
1. These beans are from this bag. (and these beans..., and these
beans..., etc.)
2. These beans are (all) white.
# 3 Therefore, all beans from this bag are white.

In a much broader sense, induction can be thought to include various
other forms of reasoning including reasoning, inference to cause form
symptoms, and confirmation of laws and theories.
1
29
344-471 AI & ES
2
Chapter 10 ML
Deduction



emphasis to one BEAN : one instance
deduction - A method of reasoning by which one infers a
conclusion from a set of sentences by employing the axioms
and rules of inference for a given logical system.
Use the term 'deduction' in a general sense to denote the fact
that a conclusion follows necessarily from the premises.
Deductive syllogisms in quantificational predicate calculus
are of the following form:
1
1. All beans from this bag are white....
2. These beans are from this bag.
#4 Therefore, these beans are white.....
2
30
344-471 AI & ES
Chapter 10 ML
Abduction



emphasis to one BEANS
abduction -A method of reasoning by which one infers to
the ......best explanation.....
- A heuristic procedure that reasons
inductively from available empirical evidence to the
discovery of the probable hypotheses that would best
explain its occurrence.
Abductive syllogisms are of the following form:
#3 All beans from this bag are white
#4 These beans are white.
31
344-471 AI & ES
Chapter 10 ML
The End
32
344-471 AI & ES
Chapter 10 ML