Hadoop - Center
Download
Report
Transcript Hadoop - Center
Cloud
Creating New Business Opportunities
GE
1.
Revenue
Growth
Business
Innovation1
Operational
Efficiencies
Increases ad revenue by
processing 3.5 billion
events per day
Measures and ranks online
user influence by processing
3 billion signals per day
Uses sentiment analysis and
web analytics for its internal
cloud
Massive
Volumes
Cloud
Connectivity
Real-Time
Insight
Processes 464 billion rows
per quarter, with average
query time under 10 secs.
Connects across 13 social
networks via the cloud for
data and API access
Improves operational
decision making for IT
managers and users
Klout Case Study: http://www.microsoft.com/casestudies/Microsoft-SQL-Server-2012-Enterprise/Klout/Data-Services-Firm-Uses-Microsoft-BI-and-Hadoop-to-Boost-Insight-into-Big-Data/710000000129
McKinsey
Forrester
Research
Gartner
Enabling
Experimentation to
discover needs,
expose variability
and improve
performance
Replacing/Supporting human
decision making with
automated algorithms
Innovating new business models,
products and services with
big data
Big Data
Creating Transparency
Segmenting populations to
customize actions
Internal Data through APIs
Cost of Changes
Fetcher & Parser to enrich &
validate with external data
Faster Updates
Cost Reduction
Full dataset refresh possible
every week instead of a few
times per year
Significant reduction in
validation phone calls
Solution
High Velocity
changes
HADOOP Data
Repository
Solution Benefits
Business Challenges
Solution Benefits
Data Silos
Old solution not able to
handle incoming volumes of
data in timely manner
Scaleable architecture to
support current and future
real-time insight needs
Faster Insights
Grow with Needs
Realtime handling of the
growing Volume & Velocity of
the data. Adding at least 1TB
per year.
Solution scales with business
needs without upfront cost
Solution
HADOOP Real-Time
Architecture
Solution Benefits
Business Challenges
Solution Benefits
High Volume &
High Velocity
Faster Research &
Diagnostical Insight
Diagnoses can be validated
against previous diagnoses
New research ideas can be
checked across full image set
Scalable Image Library
Processing cluster to process
images
Cost Friendly
Reliability
Improvement
Inexpensive data duplication
over HADOOP storage nodes
provides needed reliability
improvements
Solution
Analyses of 30.000+ giant
images of medical scans of
Pancreas
HADOOP Data &
Processing Cluster
Solution Benefits
Business Challenges
Solution Benefits
High Variety &
High Volume
HDInsight - Microsoft’s approach to Big Data
Immersive Insight,
Wherever you are
Connecting
with the World’s Data
Any Data, Any Size
Anywhere
Analyze Big Data with familiar tools
Share your data with the world via
Azure Marketplace
Simplicity and manageability of
Windows to Hadoop
Enrich with social media data via Social
Analytics
Extended data warehousing with
Hadoop
Advanced analytics with Hadoop
Scale & elasticity of cloud
Immersive insights from any data
JavaScript based simple programming
Benefits
Key Features
Familiar BI tools with structured and
unstructured data
Hive Excel Plugin, ODBC Driver integrates
Hadoop to SQL Server Analysis Services,
PowerPivot, and Power View
Benefits
Key Features
New business insights with
predictive analytics from Microsoft
Unlock rare patterns from bespoke data
mining models
Hive ODBC Driver connects Hadoop
to SQL Server Data Mining tools
Support for open source predictive
analytics tools such as R and Mahout
Benefits
Integration with
Microsoft Enterprise
Data Warehouses
Key Features
Integration with enterprise BI
solutions
Microsoft SQL Server connector
for Apache Hadoop with SQOOP
(SQL to Hadoop)
Deeper insights from
structured and
unstructured data
SQL Server Parallel Data Warehouse
connector for Apache Hadoop with
SQOOP
Benefits
Key Features
Sharing of data and
insights through Windows
Azure Marketplace
Mashing up of internal and
public data sets via Data
Explorer
Integration with Windows
Azure Marketplace
Integration with third-party
data and services
Benefits
Key Features
Stronger customer
relationships
Models augmented with
publicly available data
from social media sites
Integration of social
information with
business applications
Social Analytics
Integration with social
media sites
Benefits
Key Features
Micsoft HDInsight
Simplified
management of
Hadoop on Windows
Enterprise-class
security
Easy setup on-premises
and in the cloud
Smart packaging of
Hadoop on premises
Integration with
Microsoft System
Center
Integration with
Windows Server®
Active Directory
Fast deployment of
Hadoop on Azure
100% Microsoft Support
Benefits
Key Features
Micsoft HDInsight
Elastic peta-scale
analytics on Microsoft’s
cloud platform
Enterprise-class Big Data
platform on-premises
Hadoop-based Service on
Windows Azure platform
Hadoop-based distribution
on Windows Server
1.
Take a large problem and divide it into sub-problems
MAP
…
2. Perform the same function on all sub-problems
REDUCE
DoWork()
DoWork()
DoWork()
3. Combine the output from all sub-problems
…
Output
…
spanning relational and non-relational Worlds
SELF-SERVICE
INSIGHTS
MOBILE
PREDICTIVE
COLLABORATIVE
DATA ENRICHMENT
DISCOVER
AND RECOMMEND
TRANSFORM
AND CLEAN
SHARE
AND GOVERN
DATA MANAGEMENT
1
011
01
RELATIONAL
NON-RELATIONAL
MULTIDIMENSIONAL
STREAMING
MARKETPLACE
REAL-TIME
External Data
and Services
OPERATIONAL
Objectives
Scoping
Starter Offer
Customer
Meeting to
discuss Big
Data Needs
& Scoping
for Starter
Offer
Structured (+- 4/5weeks)
engagement that
demonstrates the
capabilities of the
Microsoft Big Data
platform with a prototype
using real customer data
Who Delivers
Expected Outcome
Microsoft
Consulting
Services &
Industry
Experts
Define Big Data
Company Strategy
Implement Big Data
Prototype solution
At your service