Transcript Lecture 1

IS6145
Database Analysis and Design
Lecture 1: Introduction to IS6145 and the
changing nature of data
Rob Gleasure
[email protected]
www.robgleasure.com
IS6145

Today’s session
 Change of time/place for next week
 Course outline
 Data a few years ago
 Data now
 The cloud
 Big data
 Business Intelligence
 The case of Spotify
IS6145

Lecture times
 13.00-15.00, Wednesday (KANE B10A)

Contact me at
 Ext 2503
 Room 2.112
 [email protected]

Website for this course
 http:// girtab.ucc.ie/rgleasure/index.html
IS6145

Module content
 Data modelling is studied in its practical dimensions and
enterprise relational database applications (e.g. Oracle, MS SQLServer) are used to demonstrate the key issues in database
administration with an introduction to SQL.

Topics covered include
 Data modelling (ERDs and normalisation)
 Database technology and developing database systems
IS6145

Learning objective
 Analyse organisational activities to identify key data requirements
 Generate ERD models to identify data sources and their
relationships
 Employ normalisation processes to assist in meeting the data
integrity requirements
 Identify and utilise various relational data management systems
(RDMS) to meet an organisations functional requirements
 Demonstrate proficiency in basic SQL scripting, including the
ability to insert, delete and update database records.
IS6145

Course Assessment
 Continuous assessment: 30 marks
 In-class exam – 20 marks
 Group report – 30 marks
 Exam: 50 marks
IS6145

Some things to note
 Ask questions
 Help each other
 Make use of the Internet
 Use search engines (e.g. google) to find information on things
you want to know more about
 If you see cases or interesting stories you think we should talk
about in lectures, email me
IS6145

Some things to note
 This is essentially a skill-building course
 We’re going to have to discuss things and practice different
modelling techniques to improve
 As a starting point
 What is data?
Data a few years ago
Image from http://www.hotcleaner.com/web_storage.html
The Cloud

Capacity
Resources

Web is overtaking/has overtaken desktop
Mobile is replacing local
Utility-based computing is replacing once-off purchase
 Makes resources seem endless
 Lowers risk in terms of usage (pay as you go)
Demand
Resources

Capacity
Demand
Time
Static data center
Time
Data center in the cloud
Slide Credits: Berkeley RAD Lab
The Cloud

The ‘Internet of things’ was born in about 2009
 More devices connected to the Web than people…
Image from http://computinged.com/edge/become-part-of-the-cloud-computing-revolution/
The Cloud

This has meaningful implications for data in terms of
 Capacity
 Measurement
 Integration
 Security
 Privacy
Big data

All of this interaction with one linked information system means vast
quantities of data can be captured throughout user interaction, often
in real-time

‘big data’

The idea is that the vast amounts of interaction data allow for
systems that are nuanced and responsive in ways that were
previously not possible

Also a realisation that, if it can be analysed, this data is a huge
commodity, meaning new business models are possible

Firms like Google, eBay, LinkedIn, Facebook, etc. are based on the
principles of big data
3 Vs of Big data

Volume
 Facebook generates 10TB of new data daily, Twitter 7TB
 A Boeing 737 generates 240 terabytes of flight data during a flight
from one side of the US to the other

Velocity
 Clickstreams and asynchronous data transfer can capture what
millions of users are doing right now

Variety
 Move from structured data to unstructured data, including image
recognition, text mining, etc.
 Gathered from users, applications, systems, sensors
Big data

All of this means huge changes for a number of sectors
 e.g.
 Healthcare
 Trading
 Education
 Transport
Big data

E.g. in Healthcare
 Modernizing Medicine EMA dermatology system
 https://www.youtube.com/watch?v=jMGaGtK9nzU
Big data

E.g. in entertainment
 Amazon
 What did Amazon do that bricks-and-mortar bookshops didn’t?
Assignment 1

In groups, you are tasked with identifying and researching a
business that uses data in an interesting and creative way.
 The report should be approximately 2000 words and describe the
key values offered by the business to its consumers, how this
differentiates it from competitors, and how its use of data at
different points in the creation, delivery, and support of
products/services enables this differentiation.
 You don’t need to go into deep technical detail concerning how
data is handled, nor about the technologies used. However you
should discuss data-related processes at a high-level, insofar as
you understand them from the information you gathered

The report is due on the 23rd October, at which time a soft-bound
report should be handed into Ann O’Riordan in room 3.75
Assignment 1

The groups are as follows:








Group 1: Hartigan, Stephen John; Ojo, Afolabi; Liu, Yang;
Group 2: Li, Xiaochen; Cofalik, Emilia Agnieszka; Curtin, Peter
Laurence;
Group 3: Hayes, Brian James; Murphy, Charles Francis; Murphy, Laura;
Group 4: Kelleher, Shona; Wang, Pengcheng; Walsh, Bernard John;
Group 5: Carey, Caroline; Nolan, Ryan; Wu, Jiahua;
Group 6: Aslam, Usman; Quirke, David; Martin, James Michael;
Group 7: Wang, Meng; Cahill, Liam; Foley, Ciara Mary;
Group 8: Lee, David James; Lu, Zicheng; O Brien, Patrick Anthony;
Want to read more?



On Modernizing Medicine
 https://www.modmed.com/
On Spotify
 http://www.bigdata-startups.com/BigData-startup/big-dataenabled-spotify-change-music-industry/#!prettyPhoto
On the cloud and big data
 The Little Book of Cloud Computing 2013 edition, Lars Nielsen