The proposed ART-like clustering algorithm

Download Report

Transcript The proposed ART-like clustering algorithm

A hierarchical approach to ARTlike Clustering Algorithm
Author : Mu-Chun Su
Yi-Chun Liu
Graduate : Chien-Ming Hsiao
Outline

Motivation
Objective
Introduction

The proposed ART-like clustering algorithm

Conclusion

Personal Opinion


Motivation

Two crucial problems for most of
clustering algorithm
1.
2.
The determining of the optimal number of
clusters
The determining of the similarity measure base
on which patterns are assigned to
corresponding clusters
Objective

Propose a hierarchical approach to ART-like
clustering algorithm which is able to deal with
data consisting of arbitrarily geometrical-shaped
clusters

Feasible solution to the two problems of determining
the number of clusters and clustering
Introduction (1/2)
Cluster validity problem



The estimation of the number of clusters in the data set
To solving the cluster validity problem usually
involves
(1) increasing the number of clusters
(2) merging the existing cluster

computing some certain cluster validity measures in each
run, until partition into optimal number of clusters is
obtain
Introduction (2/2)




It is easy to consider the idea of a data cluster on a rather
informal basis
It is very difficult to give a formal and universal
definition of a cluster.
Most of the conventional clustering methods assume
that patterns having similar locations or constant density
create a single cluster
Different similarity measures will result in different
clustering results
The proposed ART-like clustering
algorithm

The adaptive resonance theory (ART) networks
are able to cluster data without pre-specifying the
number of clusters.


The vigilance parameter to some extent implicitly prespecifies
How much confidence do we have on the
clustering results created by ART networks ?
The proposed ART-like clustering
algorithm

Hierarchical clustering method



Agglomerative
Divisive
To use the advantage provided by a dendrogram to
increase our confidence on selecting a more
trustable clustering results created by ART-like
clustering algorithm
The proposed ART-like clustering
algorithm
Fig.1 The idea of using of a set of ellipses to approximate
an arbitrarily-shaped cluster
The proposed ART-like clustering
algorithm

Adopt the neurons with quadratic neuron-type
junctions


To cluster data into hyperellipsoids
The output of a quadratic neuron indicates the
similarity between the present input and the prototype
of the hyperellipsoid embedded in the synaptic
weights of the neurons
The proposed ART-like clustering
algorithm

The output of a quadratic neuron is computed by
using the following equations :
n
y jk  x    w jki xi
i 1
net j  x     y jk  x   b jk 
n
2
k 1
Out j  x   e
 s j 2 net j  x 
where b jk , s j , and w jki are adjustable weights, x  x1 ,, xn  is an input pattern,
T
y j   y j1 ,, y jn  is the transform ed version of the input pattern x, and Out j x 
T
is the output function of neuron j
The proposed ART-like clustering
algorithm

Stage one : generate a single-layer neural
network


Step 1.1 : specify the initial value of the vigilance parameter and
corresponding parameters
Step 1.2 : if the present input pattern is the first pattern then generate a
quadratic neuron whose weights are initialized
b1 0  x
1 if k  i
w1ki 0  
0 o.w

Step 1.3 find the wining neuron j among the neurons generated in the
network, using the following criterion:
*
j *  j  arg min Out j x 
j 1,, J
The proposed ART-like clustering
algorithm

Step1.4 : if the output of the winning neuron is larger than
the threshold. Then adjust the synaptic weights of the
winning neurons
b jk new  b jk old   b 2s 2j Out j x  y jk x   b jk 
w jki new  w jki old    w  2s 2j Out j x  y jk x   b jk xi 
otherwise, a new quadratic neuron is generated and
initialized as follows:
b1 0  x
1 if k  i
w1ki 0  
0 o.w

Step 1.5 : present next data pattern into the network and
repeat steps 1.3-1.4 until all data patterns are processed.
The proposed ART-like clustering
algorithm

Stage two : generate a dendrogram for the generated
neural network

Step 2.1: set the cycle number k to be 0, determine the value
of a real-valued constantαwhose value is in the range (0,1),
and initialize an N * J matrix E to be a zero matrix.
Step 2.2: fill out the elements in the matrix E
k+1

k+1
 1 if Out j x i    k 1
eij  
0 o.w.

for 1  i  N ,1  j  J
Step 2.3: find so-called equivalent sets by examining the
matrix E
Step 2.4: Increase the value of k by one and repeat steps 2.22.3 until the number of cycles reaches a pre-specified limit
or the number of equivalent sets, N , becomes just one.
k+1

K 1
e
The proposed ART-like clustering
algorithm
The proposed ART-like clustering
algorithm
Conclusion


Utilize the dendrogram to help us to
visually select believable clusters and
clustering.
Any clustering result must be viewed with
extreme suspicion since the performance of
most of the clustering algorithms more or
less depends on some parameters.
Personal Opinion

No clustering algorithm can claim it works
perfectly for any application.