Data Mining: Crossing the Chasm

Download Report

Transcript Data Mining: Crossing the Chasm

ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
ELEMENTARY MACHINE LEARNING FOR WEBMINING
A Technical paper
National
Institute
Submitted
byOf Science And Technology
Satya narayan Sahu
Satya narayan Sahu
ROLL:IT200117269
[1]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
INTRODUCTION
Algorithms that automatically discovers all pages
in a website whose location is different from the
location where visitors expect to find them and
for selecting the
set of navigation links to
National Institute Of Science And Technology
optimize the benefit to the website are proposed.
 It is hard to organize a website such that pages
are located where visitors expect to find them.

Satya narayan Sahu
[2]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
APPROACHES
New search mechanism called SmartSeek is
introduced
Machine learning
concepts Employs genetic
National Institute Of Science And Technology
algorithm (GA)
System accepts user feedback
Satya narayan Sahu
[3]
National Institute of Science & Technology
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
FUNCTIONING OF SMARTSEEK
 SmartSeek sets up a database about user
objective
 It extracts information about the common words
and phrases
 Looks for similarNational
pagesInstitute
(metasearch
engine)
Of Science And
Technology
 Comparison with other pages
 Feedback
 Further enhancement
Satya narayan Sahu
[4]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
SMARTSEEK MODEL
National Institute Of Science And Technology
Satya narayan Sahu
[5]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
IN DETAIL
Most Relevant
 Moderate Relevancy
 Irrelevant
National Institute Of Science And Technology
 Operators AND (&), OR (|),NOT (~) and CONCAT
(-).
 After n such iteration

Satya narayan Sahu
[6]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
CROSSOVER
National Institute Of Science And Technology
Satya narayan Sahu
[7]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
THE AI
 Artificial Intelligence (AI) is simply a way of
thinking intelligently aims at




Natural
language
processing
National
Institute Of
Science And Technology
Knowledge representation
Automated reasoning
Machine learning
Satya narayan Sahu
[8]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
WEB MINING
Web Mining can be said to have three operations
of interest:
finding natural grouping
of users
pages
National Institute
Of or
Science
And Technology
 finding URLs which tend to be grouped together
 finding the order in which URLs tend to be accessed

Satya narayan Sahu
[9]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
ALGORITHMIC APPROACH
Finding Expected
Locations
National Institute Of Science And Technology
Optimizing the set of Navigation Links
Satya narayan Sahu
[10]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
FINDING EXPECTED LOCATIONS

Model of Visitor Search Patterns
1(a) Single Target
National Institute Of Science And Technology
1(b) Set of Targets

Identifying Target Pages
Satya narayan Sahu
[11]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
EXAMPLE
National Institute Of Science And Technology
Website and Search Pattern
Satya narayan Sahu
[12]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
OPTIMIZING SET OF NAVIGATION LINKS
1) First Only
2)Optimize Benefit
National Institute Of Science And Technology
3)Optimize time
Satya narayan Sahu
[13]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
EXPERIMENTS
National Institute Of Science And Technology
Wharton Website Structure
Satya narayan Sahu
By Rakesh Agrawal
[14]
National Institute of Science & Technology
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
EXAMPLE:
National Institute Of Science And Technology
Satya narayan Sahu
[15]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
CONCLUSION & FUTURE DIRECTIONS
Algorithms have been proposed
Websites without
clear
separation
National
Institute
Of Scienceof
Andcontent
Technology
and index page
Time Threshold to distinguish between two
activities
Satya narayan Sahu
[16]
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute of Science & Technology
REFERENCES
1.www.ibm.com
2.www.almaden.com
Of Science And Technology
3.Mastering Data National
MiningInstitute
by Helenz
Hulki
4.How internet works by K.Perl
Satya narayan Sahu
[17]
National Institute of Science & Technology
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
THANK YOU
National Institute Of Science And Technology
Satya narayan Sahu
[18]
National Institute of Science & Technology
ELEMENTARY MACHINE LEARNING
FOR WEBMINING
National Institute Of Science And Technology
Satya narayan Sahu
[19]