Transcript 第七章数据挖掘应用
数据挖掘应用
客户关系管理(CRM)
顾客生命周期
盈利
收入
寿命
支出
获取消费者
保持消费者
寿命
消费者分析和恢复
顾客数据
数据挖掘在CRM中的应用
Customer identification
CRM begins with customer identification. This phase involves
targeting the population who are most likely to become
customers or most profitable to the company.
It also involves analyzing customers who are being lost to
the competition and how they can be won back.
Elements for customer identification include target customer
analysis and customer segmentation.
Customer attraction
Organizations can direct effort and resources into
attracting the target customer segments.
Direct marketing is a promotion process which motivates
customers to place orders through various channels.
direct mail or coupon
目标营销
Customer retention
Central concern for CRM.
Customer satisfaction is the essential condition for retaining
customers.
Elements of customer retention include one-to-one marketing,
loyalty programs and complaints management.
One-to-one marketing refers to personalized marketing
campaigns which are supported by analyzing, detecting and
predicting changes in customer behaviors.
Loyalty programs involve campaigns or supporting activities
which aim at maintaining a long term relationship with customers.
Churn analysis, credit scoring, service quality or satisfaction form
part of loyalty programs.
客户流失分析
Customer development
Elements of customer development include customer lifetime
value analysis, up/cross selling and market basket analysis.
Customer lifetime value analysis is defined as the prediction of
the total net income a company can expect from a customer.
Up/Cross selling refers to promotion activities which aim at
augmenting the number of associated or closely related services
that a customer uses within a firm.
Market basket analysis aims at maximizing the customer
transaction intensity and value by revealing regularities in the
purchase behaviour of customers.
SPSS通讯行业分析专题
SPSS Modeler通讯行业分析模型
Personalized recommendation systems
Personalized recommendation
Personalization is defined as “the ability to provide
content and services tailored to individuals based on
knowledge about their preferences and behavior” or
“the use of technology and customer information to
tailor electronic commerce interactions between a
business and each individual customer”
Internet recommendation systems (Internet recommender systems) in electronic
commerce is to reduce irrelevant content and provide users with more pertinent
information or product.
A recommendation system is a computer-based system that uses profiles built
from past usage behavior to provide relevant recommendations.
Information filtering and recommendation
rule-based filtering, content-based filtering, and
collaborative filtering.
Rule-based filtering uses pre-specified if-then rules to select
relevant information for recommendation.
Content-based filtering uses keywords or other productrelated attributes to make recommendations.
Collaborative filtering uses preferences of similar users in
the same reference group as a basis for recommendation.
Typical personalization process
understanding customers through profile building
delivering personalized offering based on the knowledge about
the product and the customer
measuring personalization impact
Inadequate information in IR
One possible solution for overcoming the problem is to
expand the query by adding more semantic information to
better describe the concepts. Relevance feedbacks and
knowledge structure are used to add appropriate terms to
expand the queries.
Relevance feedbacks are information on the items selected
by the user from the output of previous queries.
Spreading Activation Model
In the Spreading Activation (SA) Model, concepts are expanded
based on the semantics in the process of identifying customer
profile and matching items and the model has been applied to
expand queries.
A personalized knowledge
recommendation system
A semantic-expansion
approach to build the user
profile by analyzing
documents previously read
by the person.
The semantic-expansion
approach that integrates
semantic information for
spreading expansion and
content-based filtering for
document recommendation.
A sample semantic-expansion network
Experimental results
An empirical study using master theses in the National Central
library in Taiwan shows that the semantic-expansion approach
outperforms the traditional keyword approach in catching user
interests.
构件库管理
自适应构件检索
构件检索是构件库研究中的重要问题,有效的构件检索机
制能够降低构件复用成本。
构件的复用者并不是构件的设计者或构件库的管理员,在
检索构件时对构件库的描述理解不充分,导致难以给出完
整和精确的检索需求。
用户选择构件的结果反映其真实需求,如果能够从用户的
检索行为以及用户对检索结果的反馈中推断出用户的非精
确检索条件与用户实际需要的精确检索条件之间内在联系
的模式,就可以提高系统的查准率。
基于关联挖掘的自适应构件检索
把关联规则挖掘方法引入构件检索,从用户检索行为以及
反馈中挖掘出非精确检索条件与精确检索结果之间的关联
规则,从而调整检索机制,提高构件检索的查准率。
实例
{windows} {windows ,SQL Server}
{Linux} {Linux ,Mysql}
{金融} {金融,SQL Server}
{windows ,金融} {windows ,金融,SQL Server}
供应链管理
零部件供应商选择
如何选择供应商不仅决定了产品的质量和成本,也决定了
产品的销售价格、维护费用和用户满意程度。
选择供应商一般以满足时间约束的条件下最小化物流成
本为目标,没有考虑零部件故障率与不同地域环境之间
的相关性。
基于关联规则的零部件供应商选择
使用关联规则挖掘算法,从产品维修记录中,寻找
不同供应商提供的产品零部件及其组合在不同地域
的频繁故障模式。
在生成供应商选择和配送方案过程中,利用这些频
繁故障模式,选择合适的零部件供应商组合,达到
物流成本与产品维护成本的联合优化。
人力资源管理
人力资源管理
人力资源在高科技公司中的地位相当重要。人力招聘直
接影响公司员工的素质,但传统的人力资源管理方法已
经不适应高科技公司的需要。
高科技行业知识不断变化,工作不易定界,跨职能任务
较多,工作过程趋于多元化。这些因素都对员工素质提
出了更高的要求,依靠传统方法获知竞聘者是否能够胜
任工作变得比较困难。
采用决策树挖掘出人员选拔规则
CHAID
Decision tree for predicting job performance
Improving education
Improving teaching and learning
Instructors can have trouble identifying their real
difficulties in learning.
Based on the students’ testing records, the system
works to identify and find those problems, and then
comes up with its suggestions for designing new
teaching strategies.
Assist teachers to identify students’ specific
difficulties and weaknesses in learning.
Helps the student to find out his or her weak points
in learning and offers improvement recommendations.
ESL recommender teaching and learning
Right/wrong answer statistical table
For every student, the system creates a
right/wrong answer statistical table: a
wrong answer is represented by 1 and a
right answer by 0.
Summary table of students’ wrong
answers
The right/wrong answer statistical tables for respective
students are integrated in a summary table
of students’ wrong answers, and the sum values in the
table are then ranked in descending order so as to show
the descending degrees of weaknesses the students have
collectively .
Hierarchical clustering
Hierarchical clustering
algorithm is then
applied to data
collected to segment
the students into a
certain number of
clusters, or categories,
each of which
includes students
sharing the same or
similar characteristics.
All students’ right/wrong answer
statistical tables
Clustering analysis
A clustering analysis is made of the data in All students’
right/wrong answer statistical tables. It is evident that the
students whose numbers are enclosed in the following separate
parentheses belong to different clusters respectively: (9,15, 6, 17, 13,
19, 14, 5); (22, 23, 4, 3, 21, 11, 24, 20, 7, 1);(12, 18, 2, 8, 25, 10, 16).
搜索引擎优化
搜索引擎优化
They are usually not search engines by
themselves.
The clustering engine uses one or more
traditional search engines to gather a number
of results; then, it does a form of postprocessing on these results in order to cluster
them into meaningful groups.
The post-processing step analyzes snippets,
i.e., short document abstracts returned by the
search engine, usually containing words
around query term occurrences.
E-Commerce Recommender Systems
Background
E-commerce has allowed businesses to
provide consumers with more choices.
Increasing choice, however,has also brought
about information overload.
E-commerce stores are applying mass
customization principles to their
presentation in on-line stores. One way to
achieve mass customization in e-commerce
is the use of recommender systems.
What is E-commerce Recommender Systems?
Recommender systems are used
by e-commerce sites to suggest
products to their customers and to
provide consumers with
information to help them decide
which products to purchase.In a
sense, recommender systems
enable the creation of a new store
personally designed for each
consumer(one-to-one marketing).
Tool for database marketing and
CRM
The Structure of Recommender Systems
•A typical e-commerce recommender application
includes the functional I/O, the recommendation
method.
Recommendation
Method
•Targeted customer inputs
•Community inputs
•Outputs
Targeted Customer Inputs
explicit navigation inputs are intentionally
made by the customer with the purpose
of informing the recommender
application of his or her preferences—
keywords search,registration etc.
Implicit inputs:specific item or items that the
customer is currently viewing or those items in
the customer's shopping cart(purchase history).
Community Inputs
community purchase history
best-seller lists
text comments
Output Recommendations
a set of suggestions:ordered list or unordered
lists
Ratings
meta-rating: rating the comments themselves
text comments
item-to-item correlation
user-to-user correlation
Top-N
Email marketing
Delivery and Presentation
Push methods reach a customer who is not
currently interacting with the system for
example, by sending e-mail, recommendations
for related products.
Pull methods notify customers that
personalized information is available but
display this information only when the
customer explicitly requests it.
Other types of visualization.
Recommendation Methods
Statistical summaries of community opinion withincommunity popularity measures and aggregate or
summary ratings Association analysis
Content-based recommendations: The user will
be recommended items similar to the ones the
user preferred in the past;
. Collaborative recommendations: The user will be
recommended items that people with similar
tastes and preferences liked in the past;
Hybrid approaches: These methods combine
collaborative
and content-based methods.
Examples?
Techniques for Recommendation
Many techniques from data mining can
be adapted to the scalability problem for
recommender systems:nearestneighbor,classifiers(rule induction, neural
networks, and Bayesian networks),
clustering,association
Web usage mining and more general
commerce-related data mining may
reveal techniques for exploiting complex
behavioral data.
Effectiveness of Recommendations
Developing good metrics to measure the
effectiveness of recommendations.
The performance evaluation of
recommendation algorithms is usually done
in terms of coverage and accuracy metrics.
Coverage measures the percentage of items
for which a recommender system is capable
of making predictions; Accuracy measures
can be either statistical or decision-support.
E-commerce Recommender Applications
E-recommender systems enhance Ecommerce sales (2 %-8 %) in the
following ways:
Converting Browsers into Buyers
Increasing Cross-sell
Building Credibility through Community
Inviting customers back
Give the type of feedback needed for marketing
professionals
Issues and Challenges(1)
Scalability and real-time performance:a large Web
site must produce each recommendation within a few tens of milliseconds
while serving hundreds or thousands of consumers simultaneously.
the extreme sparsity of ratings in recommender
systems
Limited Content Analysis
Overspecialization
New User Problem
not enough data is also a challenge for
recommender systems
Intrusiveness
Issues and Challenges(2)
Incorporating Rich Data:explicit ratings
and simple behavioral data such as
purchases,click stream and many other types of
data
Hybrid recommendation
Privacy Issues
E-commerce sites may learn a great deal about
customers without the customers' awareness or
consent.
Privacy policies
Technological approaches for automating enforcement of
privacy policies:Anonymizing techniques,the Platform
for Privacy Preferences (P3P)
Other issues…
Background
E-commerce has allowed businesses to
provide consumers with more choices.
Increasing choice, however,has also brought
about information overload.
E-commerce stores are applying mass
customization principles to their
presentation in on-line stores. One way to
achieve mass customization in e-commerce
is the use of recommender systems.
Recommender systems
Recommender systems form a
specific type of information
filtering technique that attempts to
predict and present items (movies,
music, books, news, images, web
pages) a user may be interested in.
What is E-commerce Recommender Systems?
Recommender systems are used by
e-commerce sites to suggest
products to their customers and to
provide consumers with information
to help them decide which products
to purchase.In a sense,
recommender systems enable the
creation of a new store personally
designed for each consumer(one-toone marketing).
Tool for database marketing and
CRM
The Structure of Recommender Systems
•A typical e-commerce recommender application
includes the functional I/O, the recommendation
method.
Recommendation
Method
•Targeted customer inputs
•Community inputs
•Outputs
Targeted Customer Inputs
explicit navigation inputs are intentionally
made by the customer with the purpose
of informing the recommender
application of his or her preferences—
keywords search,registration etc.
Implicit inputs:specific item or items that the
customer is currently viewing or those items in
the customer's shopping cart(purchase history).
Community Inputs
community purchase history
best-seller lists
text comments
Recommendations Output
a set of suggestions:ordered list or unordered
lists
Ratings
meta-rating: rating the comments themselves
text comments
item-to-item correlation
user-to-user correlation
Top-N
Email marketing
Delivery and Presentation
Push methods reach a customer who is not
currently interacting with the system for
example, by sending e-mail, recommendations
for related products.
Pull methods notify customers that
personalized information is available but
display this information only when the
customer explicitly requests it.
Other types of visualization.
Recommendation Methods
Statistical summaries of community opinion withincommunity popularity measures and aggregate or
summary ratings Association analysis
Content-based recommendations: The user will
be recommended items similar to the ones the
user preferred in the past;
. Collaborative recommendations: The user will be
recommended items that people with similar
tastes and preferences liked in the past;
Hybrid approaches: These methods combine
collaborative
and content-based methods
Social network-based social recommender system
Social recommender systems
Social network is a very important source
of information to profile users. Taking
part into social relationships may cause
individuals to modify their attitudes and
behaviors.
Individuals become interested in topics or
subjects that do not necessarily match
their personal preferences and tastes, but
that reflect those of their social network.
Social psychology
Joining in a network with other people
exposes individuals to social dynamics
which can influence their attitudes and
behaviors.
Individuals become interested in topics or
subjects that do not necessarily match
their personal pre-existing preferences
and tastes, but that reflect those of the
network.
Theories of social influence
Social Conformity: people belonging to a group usually
experience a “pressure to conform”, namely, they tend
to change their attitudes and behaviors to match the
expectations of the other members.
Social Comparison: people who are new to a certain
context or are not expert of certain domain should be
interested in topics which reflect the opinions of other
individuals in their network.
Social Facilitation: people who are interested in a certain
topic, but lack strong motivation, should appreciate
information showing that other people in their network
share their interest.
SoNARS algorithm
Techniques for Recommendation
Many techniques from data mining can
be adapted to the scalability problem for
recommender systems:nearestneighbor,classifiers(rule induction, neural
networks, and Bayesian networks),
clustering,association
Web usage mining and more general
commerce-related data mining may
reveal techniques for exploiting complex
behavioral data.
A recommender system for the cosmetic business
First of all, we can categorize the
customers with the clustering
algorithm.
Then we use content-based method
to set a basic average score for each
product in each cluster.
Finally, we apply the association
algorithm to the transaction data
assign more recommender scores to
existing associated products.
A location-aware recommender system for
mobile shopping environments
When receiving a
service request,
the on-line
subsystem
generates a list of
possibly
interesting web
pages based on
the customer’s
interests profile,
vendor data,and
the
instantaneous
position of the
customer
provided by the
location manager.
Other application fields
Recommending conference paper submission to
reviewing committee members
Smart recommendation for an Evolving E-learning
system
Recommendation of citations for research papers
Knowledge recommender system
……
Effectiveness of Recommendations
Developing good metrics to measure the
effectiveness of recommendations.
The performance evaluation of
recommendation algorithms is usually
done in terms of coverage and accuracy
metrics.
Coverage measures the percentage of
items for which a recommender system
is capable of making predictions;
Accuracy measures can be either
statistical or decision-support.
E-commerce Recommender Applications
E-recommender systems enhance Ecommerce sales (2 %-8 %) in the
following ways:
Converting Browsers into Buyers
Increasing Cross-sell
Building Credibility through Community
Inviting customers back
日用消费品
Give
the type of feedback needed for marketing
professionals
Issues and Challenges(1)
Scalability and real-time performance:a large Web
site must produce each recommendation within a few tens of milliseconds
while serving hundreds or thousands of consumers simultaneously.
the extreme sparsity of ratings in recommender
systems
Limited Content Analysis
Overspecialization
New User Problem
not enough data is also a challenge for
recommender systems
Intrusiveness
Issues and Challenges(2)
Incorporating Rich Data:explicit ratings
and simple behavioral data such as
purchases,click stream and many other types of
data
Hybrid recommendation
Privacy Issues
E-commerce sites may learn a great deal about
customers without the customers' awareness or
consent.
Privacy policies
Technological approaches for automating enforcement of
privacy policies:Anonymizing techniques,the Platform
for Privacy Preferences (P3P)
Other issues…
研讨题
阅读后面参考文献,分析案例使用的数据挖掘方
法以及解决的主要问题。
结合自己的实践,说明所在岗位对商务智能的需
求(针对软件工程硕士)。
典型参考文献(1)
Chen-Fu Chien, Li-Fei Chen. Data mining to improve personnel selection and enhance
human capital: a case study in high-technology industry. Expert Systems with
Application, 2008,34(1):280-290
Cristo´bal Romero, Sebastia´n Ventura, Enrique Garcı´a. Data mining in course
management systems: Moodle case study and tutorial. Computers & Education 51
(2008) 368–384
Yang, C. C. et al., Improving scheduling of emergency physicians using data mining
analysis, Expert Systems with Applications (2008), doi:10.1016/j.eswa.2008.02.069
Jang Hee Lee, Sang Chan Park. Intelligent profitable customers segmentation system
based on business intelligence tools. Expert Systems with Applications 29 (2005): 145–
152
Chih-Ming Chen, Ying-Ling Hsieh, Shih-Hsun Hsu. Mining learner profile utilizing
association rule for web-based learning diagnosis. Expert Systems with Applications 33
(2007) 6–22
Bong-Horng Vhu, Ming-Shian Tsai, Cheng-Seen Ho. Toward a hybrid data mining
model for cluster retention. Knowledge-Based Systems 20 (2007) 703–718
典型参考文献(2)
Daniela Grigoria, Fabio Casatib, Malu Castellanos, et al. Business process intelligence.
Computers in Industry 53 (2004) 321–343
Dursun Delen, Christie Fuller, Charles McCann. Analysis of healthcare coverage: A data
mining approach. Delen, D. et al., Analysis of healthcare coverage: A data mining
approach, Expert Systems with Applications (2007), doi:10.1016/j.eswa.2007.10.041
Mei-Hua Hsu.Proposing an ESL recommender teaching and learning system.Expert
Systems with Applications.2008,34(3):2102–2110
Yi-Fan Wang, Ding-An Chiang, Mei-Hua Hsu,et al. A recommender system to avoid
customer churn: A case study. Expert Systems with Applications,2009,36:8071–8075
倪日文,徐晓飞,邓胜春.基于关联规则的零部件供应商选择优化. 计算机集成制造系统,
2004, 10(3): 317-335
薛云皎 ,钱乐秋,花鸣等.一种基于关联挖掘的自适应构件检索方法. 电子学报,
2004,32(12A): 203-206
Ting-Peng Liang,Yung-Fang Yang,Deng-Neng Chen, et al. A semantic-expansion
approach to personalized knowledge recommendation.Decision Support
Systems,2008 ,45(3):401-412
Giansalvatore Mecca, Salvatore Raunich, Alessandro Pappalardo. A new algorithm for
clustering search results.Data & Knowledge Engineering 62 (2007) 504–522
典型参考文献(3)
Schafer, J.B., Konstan, J.A., and Riedl, J. 1999. Recommender Systems in ECommerce. In ACM Conference on Electronic Commerce (EC-99), pages 158-166.
Resnick and Varian. Recommender systems. Communications of the ACM ,
1997 ,40 (3) :56 – 58
SCHAFER J B , KONSTAN J ,RIEDL J . Recommender systems in e - commerce.
Proceedings of the First ACM Conference on Electronic Commerce. Denver , CO ,
1999. 158 - 166.
BEN J , KONSTAN J A , JOHN R. E - commerce recommendation applications.
University of Minnesota ,2001,pp1-24
Yu Li,Liu Lu. Research on personal ized recommendations in E- business.
Computer Integrated Manufacturing Systems,2004,10(10):1306-1313(in
Chinese)
TRAN T ,COHEN R. Hybrid recommender systems for elec2
tronic commerce [ R ] . Knowledge - Based Electronic Markets ,Papers from the
AAAI Workshop , AAAI Technical Report WS- 00 - 04. Menlo Park , CA : AAAI
Press ,78 - 83.
典型参考文献(4)
Gediminas Adomavicius,Alexander Tuzhilin.Toward the Next Generation of Recommender Systems: A
Survey of the State-of-the-Art and Possible Extensions. IEEE TRANSACTIONS ON KNOWLEDGE
AND DATA ENGINEERING, 2005,17(6):734-749
Gediminas Adomavicius,Alexander Tuzhilin. Using data mining methods to build
customer profiles.IEEE Computer, 2001:74-82
G.-J. Houben et al. (Eds.): UMAP 2009, LNCS 5535, pp. 223–234, 2009
Yi-Fan Wang, Yu-Liang Chuang, Mei-Hua Hsu,et al. A personalized recommender system
for the cosmetic business. Expert Systems with Applications 26 (2004) 427–434
Wan-Shiou Yang *, Hung-Chi Cheng, Jia-Ben Dia. A location-aware recommender system
for mobile shopping environments. Expert Systems with Applications 34 (2008) 437–445
Chumki Basu, Haym Hirsh, William W. Cohen, et al. Technical paper recommendation: a
study in combining multiple information sources. Journal of Artificial Intelligence
Research,1(2001):231-253
T. Tang and G. McCalla. Smart recommendation for an evolving e-learning system:
Architecture and experiment. International Journal on e-Learning, vol. 4, pp. 105-129,
2005
Sean M. McNee, Istvan Albert, Dan Cosley, et al. On the recommending of citations for
research papers. CSCW’02, 2002: 16-20
Lu Zhen, George Q. Huang, Zuhua Jiang. An inner-enterprise knowledge recommender
system. Expert Systems with Applications, 37(2010):1703-1712
Francesca Carmagnola, Fabiana Vernero, Pierluigi Grillo. SoNARS: a social networksbased algorithm for social recommender systems. G.-J. Houben et al.(Eds), LNCS 5535,
2009:223-234