Data Science Master March 2015x

Download Report

Transcript Data Science Master March 2015x

Data Science Master Track
Tom Heskes and Harmen Prins
Scientific questions you will study
• What is clustering?
• What is causality?
• What’s the magic behind deep learning?
• How can you efficiently search and rank?
• How do you build reliable models from complex data?
Why are these questions important?
To help and improve our society
iCIS data science groups
• Prof. Tom Heskes
machine learning theory and applications
• Prof. Peter Lucas
Bayesian networks and eHealth
• Dr. Elena Marchiori
complex networks and machine learning
• Prof. Theo van der Weide
information systems and retrieval
iCIS data science groups
• Prof. Wessel Kraaij
information retrieval and multimedia data analysis
• Prof. Mireille Hildebrandt
privacy and legacy aspects of data mining
• Prof. Nico Karssemeijer
computer-aided diagnosis and medical imaging
• but also: Antal van den Bosch, Bert Kappen, Lutgarde Buydens, Marcel van
Gerven, Maurits Kaptein, ...
Course outline
1st semester
2nd
semester
3rd
semester
4th semester
Track Basis
Track Basis
Track
Choice
Track Basis
Research
Seminar
Track
Choice
Research Project
CS &
Society
Master Thesis
Track
Choice
Free Choice
Track
Choice
Free Choice
External
Choice
External
choice
Track basis courses
• Mandatory, key methodological aspects
• Machine Learning in Practice (6 ec)
• Information Retrieval (6 ec)
• Bayesian Networks (6 ec)
Track choice courses
• Statistical Machine Learning (6 ec)
• Natural Computing (6 ec)
• Machine Learning (9 ec)
•
•
•
•
•
•
Computer aided diagnosis in medical imaging (6 ec)
Bayesian Neurocognitive Modeling (6 ec)
Bioinformatics (3 ec)
Pattern Recognition for Natural Sciences (3 ec)
Text Mining (6 ec)
Artificial Intelligence at the Web Scale (6 ec)
• Law in Cyberspace (6 ec)
• Foundations of Information Systems (6 ec)
• Business Rules Specification and Application (3 ec)
Theory and Tools
Applications
Other aspects
Research projects
• Join one of the research groups within iCIS/RU or do an internship at a company
• Can Google Trends predict outbreaks of influenza?
Nature paper correlating Google searches to influenza outbreaks
led to quite some discussion: a fluke or actual predictive power?
• What distinguishes an excellent RTS game player from an average one? The
SkillCraft data set contains many characteristics of various players that can be
mined for actual causal relationships
Master thesis projects
• Steffen Janssen developed a tool to predict productivity of software projects based
on neural networks for the Dutch tax authorities
• Thomas Janssen improved the fitting of hearing aids by machine
learning for the hearing aid company GN ReSound
• Louis Onrust studied a novel machine learning method for the extraction
of brain structure from neuroimaging data
Master thesis projects
• Niels Radstake investigated Bayesian approaches to analyze mammographic
images
• Jelle Schühmacher came up with a classifier-based method for
searching large document collections
• Tom de Ruyter works on his master thesis at Xerox in Grenoble
to improve dynamic pricing for parking in LA and other US cities
Do you want to study abroad? Or an internship?
For appointments
please mail to:
[email protected]
Room HG 00.508
But first contact your study advisor about the contents of your stay abroad!
Data Science vs Web and Language Interaction
• Overlap: text mining, information retrieval, machine learning in practice, AI at the
web scale
• Data Science: broader scope of application domains, slightly more emphasis on
methodological aspects
• Web and Language Interaction: dive deeper into the
psychological and neurological aspects of
human-human and human-computer interaction
• Can do both through a joint, double master program:
180 ec (3 years) for 2 master degrees!
Job perspective
• Start up your own company in data analytics, become a data analysis specialist or
consultant at a larger company, or go for a PhD
Rasa Jurgelenaite
Quantitative risk analyst
at ABN AMRO
Kristel Rösken
Business analyst
at VVV Nederland
Pavol Jancura
Software design engineer
at ASML
Laurens van de Wiel
Data scientist at FlxOne
Max Hinne and Wout Megchelenbrink
PhD students
Bart Bakker
Senior scientist
at Philips Research
Alex Slatman
Director at OBI4wan
Why Data Science at the Radboud University?
• Diversity: various aspects and applications of data science
• Flexibility: large choice of courses to shape
student interests
• Excellence: students are embedded in
research groups
Example: Machine Learning in Practice
• Basic idea: student teams enter an ongoing machine learning competition
• While trying to beat the other teams, students
learn the ins and outs of challenging
machine learning problems
• For example: recognize thousands of plankton
by their shape
Example: Statistical Machine Learning
• Theoretical underpinning of machine learning methods
- regression
- classification
- neural networks
- kernel methods
- mixture models and EM
• Programming and math exercises
• Demonstrations on actual data
Example: Text Mining
Learn all about the different areas of text mining:
• Text categorization
• Summarization
• Question answering
• Topic modeling
• Sentiment analysis
• Social media mining
And do an experiment on one of those topics yourself
Example: Bayesian Neurocognitive Modeling
• Use machine learning tools to understand our brain
• Example: decode fMRI data to
reconstruct the image the person is
looking at
• Pioneered by Gallant's lab at UCB
• In the course we implement similar
techniques for still images. And that
is just one week
My impressions
•
•
•
•
•
•
Is it fun?
Is it difficult?
Can you make a living?
Will you have options? Can you reconsider?
Study environment
Should you do it?
• Pro tips:
- Have a look at some statistics before starting the courses
- Always ask. Always.
Thanks!