SPSS presentation

Download Report

Transcript SPSS presentation

The SPSS Portfolio
Tim Daciuk
Services Manager, Canada
SPSS Inc.
April 12, 2016
1
SPSS At A Glance

Leadership



Stability


30+ year heritage in analytic technologies
Proven track record



Market leader in Predictive Analytics
Focus on online & offline customer data acquisition and analysis
250,000+ customers worldwide
NASDAQ: SPSS
Analytics standard



80% of Fortune 500 are SPSS customers
80% plus market share in Survey & Market Research sector
Ranked #1 Data Mining solution by KD Nuggets
2
Public Sector Customers
Centers for Disease Control
Internal Revenue Service
DHHS Office of Inspector General
TX Comptroller of Public Accounts
NY Department of Public Health
UK Gov’t Communications Bureau
Canada Revenue Agency
Department of Justice
Centers for Medicare & Medicaid
USAF Leaders Project
Florida Department of Revenue
US Department of State
HM Customs & Excise
US Army Recruiting Command
New York City Human Resources
US Joint Forces Command
US Dept. of Agriculture
3
Predictive Analytics: Defined
Predictive analysis helps connect data to effective
action by drawing reliable conclusions about
current conditions and future events.
— Gareth Herschel, research director, Gartner group
4
Core Technologies
1.
2.
3.
4.
5.
6.
Statistical Analysis
Data Mining
Text Mining
Web Analytics
Data Collection
Deployment
5
Statistical Analysis
6
Descriptive Analysis
Analytic software:

Data displays
(e.g., frequency
distributions)

Graphic displays of data
(e.g. histogram)

Measures of central
tendency (e.g., mean,
median)

Estimates of variance
(e.g., standard deviation)
Satisfaction with service 1-10
80
60
40
Frequency

20
Std. Dev = 1.65
Mean = 8.3
N = 248.00
0
3.0
SPSS product: SPSS Base
4.0
5.0
6.0
7.0
8.0
9.0
10.0
Satisfaction with service 1-10
7
Inferential Analysis

Predicting numerical or
categorical outcomes

Linear regression

GLM Multivariate/Repeated
Measures

Non-linear regression

Weighted least squares

Two Stage Least Squares

Survival Analysis/Cox regression

Structural Equation Modeling
SPSS products: SPSS Base,
Regression Models, Advanced
Models & AMOS
8
Reporting

Graphical software

Visually communicate your
results

Create more visually
compelling information
SPSS products: SPSS Base
(and Trinity)
9
Powerful Data Management

Control Panel for
OMS

Allows users to turn
output into:

XML
 HTML
 Text
 SPSS data file
10
Data Mining
11
Where does Data Mining fit?

Three classes
of data mining
algorithms



Prediction
Association
Clustering
Cluster
Group cases that
exhibit similar
characteristics.
What events occur
together? Given a
series of actions; what
action is likely to occur
next?
Data
Mining
Predict
Associate
Predict who is likely to
exhibit specific
behavior in the future.
12
Profile and Predict: Supervised Learning

Build a predictive profile of
the historical outcome
using a collection of
potential input fields.
Credit ranking (1=default)
Cat.
%
Bad 52.01
Good 47.99
Total (100.00)
n
168
155
323
Paid Weekly/Monthly
P-value=0.0000, Chi-square=179.6665, df=1
Weekly pay
Monthly salary
Cat.
%
n
Bad 86.67 143
Good 13.33 22
Total (51.08) 165
Cat.
%
n
Bad 15.82 25
Good 84.18 133
Total (48.92) 158
Age Categorical
P-value=0.0000, Chi-square=30.1113, df=1
Age Categorical
P-value=0.0000, Chi-square=58.7255, df=1
Young (< 25);Middle (25-35)
Cat.
%
n
Bad 90.51 143
Good 9.49
15
Total (48.92) 158
Old ( > 35)
Cat.
%
Bad
0.00
Good 100.00
Total (2.17)
n
0
7
7
Young (< 25)
Middle (25-35);Old ( > 35)
Cat.
%
n
Bad 48.98 24
Good 51.02 25
Total (15.17) 49
Cat.
%
n
Bad
0.92
1
Good 99.08 108
Total (33.75) 109
Social Class
P-value=0.0016, Chi-square=12.0388, df=1
Management;Clerical

Explores all combinations,
interactions and
contingencies.

Use this profile to
understand and predict
future cases.
Cat.
%
Bad
0.00
Good 100.00
Total (2.48)
n
0
8
8
Professional
Cat.
%
n
Bad 58.54 24
Good 41.46 17
Total (12.69) 41
13
Cluster and Associate: Unsupervised
Learning

Find emerging
patterns and unusual
cases.


Use data mining to examine
the differences and shifts
across all dimensions of the
data.
Select large groups to identify
common patterns. Select
small groups to identify
unusual patterns.
14
The Product: Clementine
15
Read your data in…
16
Define your Target and Predictors
17
Build a Rule-based Predictive Model
18
Text Mining
19
From Concepts to Predictive Analytics
Components
LexiQuest
Mine
Discover
concepts,
relationships
and trends
LexiQuest
Categorize
Linguistic
Terminology
Extractor
Understand
documents
and assign in
pre-defined
categories
Text Mining for
Clementine
Add text fields to
data mining for
better prediction
20
Underlying Technology is Linguistic
based
Text is:
Unstructured
Ambiguous
Language dependent
Linguistic Approach
Does not treat a document as a bag of words
Removes ambiguity by extracting structured
concepts
Concepts are the DNA of text
21
Core Technology…
Linguistics Based Search Technology

SPSS LexiMine

Concept Extraction and Query Building
Classification Document Management

SPSS Categorize

Supervised Learning in Existing Systems
22
Text Mining for Clementine
Text Mining for Clementine consists of three
nodes:

Text Mining Source - uses LexiQuest Mine to automatically extract
concepts, categories and frequencies from a set of documents

Text Mining Process - uses LexiQuest Mine to automatically extract
concepts, categories and frequencies from text data stored in a
database, and links these results to structured data

Document Viewer - displays the document or documents selected from
the Text Mining Source node
23
Web Analytics
24
Web Measurement Continuum
Insight
Value
ROI
•# Users
•# Visits
•# Page Views
•Top Pages
•Top Referrers
•# Errors
•Recency
•Frequency
•Average Visit Streak
•Campaign Sales
•Eventstream
•Sectionstream
•Predict Likelihood to Respond
•Automatic User Segments
•Content Clustering
•Significant Activity Sequences
•Content & Activity Associations
•Textbook Visits
•Homepage Bouncing
PREDICTIVE
WEB
ANALYTICS
WEB
STATS
Activity Counts
WEB
ANALYTICS
WEB
ANALYTICS
WEB
STATS
WEB
STATS
Business Insight
Customer Intimacy
25
Web Mining for Clementine (WM4C)

Takes web data preparation directly into
Clementine – removes the need for NetGenesis

Turns huge volumes of web logs into business
events data

Allows for very fast deployment of data mining on
top of web data
26
Data Collection
27
Dimensions Capabilities
28
Dimensions Objectives

Software and Development Platform, not
just a set of applications

Design Once use Many.

Powerful: handles the most complex
survey designs and multimodal
deployment.

Centralized: defines and translates
metadata that drives all downstream
processes
29
Dimensions Solutions

Survey Design


Interview Builder – web based (included w/ mrInterview)
Data Collection Methods

mrInterview data collection engine






web-based and/or Call-Center functions available
mrPaper – create paper surveys within MS Word
mrPaper/mrScan – scan solution with Readsoft EHF
CATI – call center solution software
mrDialer – call center automated/predictive dialers
Palm PDA –data collection with Techneos EntryWare
30
Dimensions Solutions

Analysis/ Reporting/ Publishing





mrTables – web based reporting and publishing
mrStudio – desktop and script based - automate processes
Dimensions Component Pack – server processing
Interview Reporter for real-time reporting of web data
mrTranslate : Managing Multiple Languages



Easy to use tool that does not require research knowledge to use
Writes directly to the questionnaire metadata
Supports all single and double byte character sets

European (Spanish, French, etc.)
 Double Byte (Japanese, Mandarin, Korean, etc.)
31
Deploying SPSS Models
32
Deployment Solutions
Data Collection
Reporting/Analysis
Deployment
Existing Data
SPSS Web Deployment
Framework
Survey Data
Web Behavior
Text Extraction
Text Mining for
Clementine
Web Mining for
Clementine
Web Based
Applications
Predictive Marketing
Predictive Call Center
33
Summary

Predictive analytics

Wide range of products





Collect data
Analyze data
Mine data
Score and handle data
Wide range of applications




Predict
Group
Associate
Anomalies
34
Questions
?
35
Contacts




Tim Daciuk
Services Manager, Canada
416-410-7921
[email protected]

Angie Mohr
 SPSS Sales, Canada
 613-599-3377

[email protected]
www.spss.com
36