New Data Sources
Download
Report
Transcript New Data Sources
NEW DATA SOURCES
2 Cases
• Varying Data Sources
• Simulation-Based
OBJECTIVES
■
■
■
needed to understand factors
affecting volume performance
needed to quantify impact of
each factor
possible forecast of immediate
future performance
volume decomposition
quantified impact of each
factor
effects of short/long term
activities (promo, media)
Classes of Data
Internal
Sales by Subcategories
Government
economic indicators ( GNP, inflation, P-$, unemployment, etc)
weather ( temperature, rainfall , humidity)
Competitive
Retail Audit
Consumer confidence
Media TARP
Events/Promotions
APPROACH
Pre- analysis
Estimate of Base Volume
Volume Decomposition
Effect of Short term activities
Simulations
PRE-ANALYSIS
– assigned dummy variables for events (
month of occurrence)
Group of Indicators
Economy
Weather
applied data reduction technique to
address problem on short time series and
no. parameters > no. of observations
used principal components per group of
indicators to come up with composite
scores
Humidity
Rainfall
Media
Distribution
Promotion
Events
Pricing
Competitor Act.
Predictive Analytics
All factors affect volume but of varying degrees
Economy
Weather
Media
Distribution
Promotion
Events
Pricing
Competitor
Activities
SALES
Publication
Monte Carlo Simulation
■ Stochastic methods to generate new configurations of a system
of interest – simulation of a phenomena
■ Monte Carlo: importance sampling or systems at equilibrium.
– Start: initial configuration of the system
■ can be data-based random variable generation
– Change the configuration
■ acceptance/rejection of changes
Monte Carlo Simulation
■ Given a data-generating mechanism
– Example: drawing colored balls from an urn, input-output
model, adaptive sampling, etc.
– model of the process you wish to understand
– produce new samples of simulated data, replicate current
data
– examine results of those samples
– may also amplify this procedure with additional
assumptions
Monte Carlo Simulation
■ Computer Simulation/Monte Carlo Models
– Not solved by mathematical analysis but are used for
numerical experimentation.
– Goal of Numerical Experimentation: Answer questions of
real world (What if-sensitivity analysis)
– Purpose of Sensitivity Analysis
■
Validation of the model
– Would the customers exhibit similar credit
behavior?
– Are their credit behavior similar?
Simulation
■ Big Data
Big Data
■ Machine
Learning/Modeling
Leading Indicators
■ Simulation
Validation
■ Estimation
■ Validation
Model-Based Estimation
Simulation Procedures (Resampling)
■ Construct a simulated universe
– composition similar to the universe whose behavior we wish to
describe and investigate.
■ Specify the procedure that produces a pseudo-sample
– simulates the real-life sample in which we are interested
– specify procedural rules by which the sample is drawn from the
simulated universe (purposive sampling)
■ Describe: if several simple events must be combined into a composite
event
■ Calculate the probability of interest
– estimate parameters
– test hypothesis
– Based on tabulation of outcomes of the resampling trials.
THANK YOU.