Personalized e-news monitoring agent system
Download
Report
Transcript Personalized e-news monitoring agent system
國立雲林科技大學
National Yunlin University of Science and Technology
N.Y.U.S.T.
I. M.
Personalized e-news monitoring agent system for
tracking user-interested Chinese news events
Presenter : Yu-hui Huang
Authors :Chih-Ming Chen · Chao-Yu Liu
AppInt 2009
1
Intelligent Database Systems Lab
Outline
Motivation
Objective
Methodology
Experiments
Conclusion
Comments
N.Y.U.S.T.
I. M.
2
Intelligent Database Systems Lab
Motivation
N.Y.U.S.T.
I. M.
reading or monitoring everyday news stories from the Internet is a
difficult and time-consuming job for modern humans.
But many irrelevant news events are retrieved, resulting in a high
recall rate and low precision rate.
3
Intelligent Database Systems Lab
Objective
N.Y.U.S.T.
I. M.
To improve the performance of Google news alert
Develop a personalized e-news monitoring agent system to tracking
user-interested news based on topic-based scheme.
And develop extension word segmentation system that is ECScanner.
4
Intelligent Database Systems Lab
Methodology
N.Y.U.S.T.
I. M.
System architecture
5
Intelligent Database Systems Lab
Methodology
N.Y.U.S.T.
I. M.
Word segmentation system-ECScanner
Step1:determine new work
Step2: judge new work or not by self-defining threshold
6
Intelligent Database Systems Lab
Methodology
N.Y.U.S.T.
I. M.
The news words management interface for linguistic experts
7
Intelligent Database Systems Lab
Methodology
N.Y.U.S.T.
I. M.
The news event monitoring agent by two-phase scheme for tracking
First phase:cosine measure
Measure similarity between the user-interested and news category
Measure similarity between the user-interested and other news
Ex:user-interested A (書雅,帥哥)=(1,1)
B (書雅,帥哥)=(1,1)
but if A (帥哥,書雅)=(1,1)
x
B (書雅,帥哥)=(1,1)
8
Intelligent Database Systems Lab
Methodology
Hamming distance
Modified cosine measure
N.Y.U.S.T.
I. M.
second phase-filter out misclassified & recommend highest Avg_Msim
9
Intelligent Database Systems Lab
Experiments
N.Y.U.S.T.
I. M.
10
Intelligent Database Systems Lab
Experiments
N.Y.U.S.T.
I. M.
11
Intelligent Database Systems Lab
Experiments
N.Y.U.S.T.
I. M.
12
Intelligent Database Systems Lab
Conclusion
N.Y.U.S.T.
I. M.
Strict threshold value will reduce the number of possible candidate
new words. On the contrary, a loose threshold value will lead to over
large number of candidate new words.
However the ECScanner performance is superior to CKIP
13
Intelligent Database Systems Lab
Comments
Advantage
This paper’s writing step by step is very clear .
Drawback
N.Y.U.S.T.
I. M.
…
Application
News tracking , blog mining , customer behavior analysis…
14
Intelligent Database Systems Lab