Transcript Lecture 6

IS1825 Multimedia Development
for Internet Applications
Lecture 06: Big data and the Internet of
Things
Rob Gleasure
[email protected]
http://corvus2.ucc.ie/phd/rgleasure/index.html
IS1825

Today’s session
 Introduction to big data
 The 3 V’s of big data
 The Internet of Things
Big data

The idea is that the vast amounts of interaction data allow for
systems that are nuanced and responsive in ways that were
previously not possible

Also a realisation that, if it can be analysed, this data is a huge
commodity, meaning new business models are possible

So when is data ‘big data’
3 Vs of Big data

Volume
 Facebook generates 10TB of new data daily, Twitter 7TB
 A Boeing 737 generates 240 terabytes of flight data during a flight
from one side of the US to the other

We can use all of this data to tell us something, if we know the right
questions to ask
3 Vs of Big data
Traditional Approach
Analyzed
informatio
n
Big Data Approach
All available
information
analyzed
All available
information
Analyze small
subsets of data
Analyze all data
From http://www.slideshare.net/ibmcanada/big-dataturning-data-into-insights?qid=0b4c69bc-3db2-4e12-ae47-a362a25752eb&v=qf1&b=&from_search=3
3 Vs of Big data



Velocity
 Clickstreams and asynchronous data transfer can capture what
millions of users are doing right now
Think back to AirBnB – make a change, then watch the response.
No guesswork required up front as to what to gather, we can induce
the interesting stuff as we see it
3 Vs of Big data
Traditional Approach
Hypothesis
Question
Answer
Data
Start with hypothesis
and test against
selected data
Big Data Approach
Data
Exploration
Insight
Correlation
Explore all data and
identify correlations
From http://www.slideshare.net/ibmcanada/big-dataturning-data-into-insights?qid=0b4c69bc-3db2-4e12-ae47-a362a25752eb&v=qf1&b=&from_search=3
3 Vs of Big data

Variety
 Move from structured data to unstructured data, including image
recognition, text mining, etc.
 Gathered from users, applications, systems, sensors

Increasingly comprehensive data view of our ecosystem
 The Internet of Things
The Internet of Things
From http://www.pcworld.com/article/2039413/new-intel-ceo-creates-mysterious-new-devices-division.html
The Internet of Things

RFID sensors, bluetooth, microprocessors, wifi all becoming easier
to embed in ‘dumb’ devices

Move to mobile also means more data streaming from us at all
times, e.g. location, call activity, net use
The Internet of Things

Smart homes/smart cities
 Temperature, lighting, food stocks, energy, security

Smart cars
 Diagnostics, traffic suggestions, sensors, self-driving

Smart healthcare
 Worn and intravenous computing detects issues early and
monitors care outcomes remotely

Smart factories, farms
 Machines coordinated efficiently, linked dynamically to
consumption models
Big data

Success stories
 Books
 Barnes and Noble: Discovered that readers often quit
nonfiction books less than halfway through. Introduced highly
successful new series of short books on topical themes
 Amazon: originally used a panel of expert reviewers for
books. Data surplus allowed them to create increasingly
predictive recommendations. Panel has since been disbanded
and 1/3 of sales are now driven by the recommender system
Big data and the Internet of Things

Success stories (continued)
 Transport
 Flyontime.us: used historical weather and flight delay
information to predict likelihood of flights get delayed
 Farecast: looked at ticket prices for specific flights based on
historical data, then advised users to buy or wait according to
predicted fare costing trajectory
 UPS: Uses a range of traffic data to calculate most efficient
time/fuel efficient routes according to complex algorithm
Big data and the Internet of Things

Famous success stories (continued)
 Healthcare
 Modernizing Medicine EMA dermatology system
 https://www.youtube.com/watch?v=jMGaGtK9nzU
Big data and the Internet of Things

Famous success stories (continued)
 Social media
 Google (data for information relevance)
 Twitter (c.f. #RescuePH)
 Facebook (social data)
Issues with big data

Google Flu Trends
 Life imitating data, imitating life?

No one is really average height

Your Xbox knows you like that Katy Perry song

Also, Target called to say your teenage daughter is pregnant.

Icecream sales and shark attacks…
Icecream sales and shark attacks
continued (correlation, not causation)
From http://xkcd.com/552/
Target’s family monitoring continued
Readings


Mayer-Schönberger, V. and Cukier, K. (2014). Big Data: A
Revolution That Will Transform How We Live, Work, and Think,
John Murray Publishers, UK.
http://nextcity.org/daily/entry/rescuers-use-social-media-twitter-tofind-disaster-victims