From R to Python
Download
Report
Transcript From R to Python
From R to Python
Robert Mastrodomenico
Global Sports Statistics
(Edit via Slide Master) Name – Job Title
[email protected]
Schedule
About me
My experiences with R
Why I moved to Python
Whats so good about Python
Robert Mastrodomenico – Global Sports Statistics
[email protected]
About me
Completed a PhD in statistical genetics at
University of Reading in 2008
Worked at Smartodds from 2007 to 2011
Setup Global Sports Statistics in 2011
Robert Mastrodomenico – Global Sports Statistics
[email protected]
My experiences with R
Initially used R at PhD level integrated with
C to do analysis
R was the primary programming language
once I started work
R used for data analysis and maintenance
tasks
Robert Mastrodomenico – Global Sports Statistics
[email protected]
Why I moved to python
R was used due to the ease of data
manipulation and large statistical library
However R was also being used for other
tasks such web scraping and html/xml
parsing
R was becoming the only choice used for
scripting
Robert Mastrodomenico – Global Sports Statistics
[email protected]
Why I moved to python
I first used python for an XML parsing task
Code was easy to write
Results were great
This got me curious about what else
Python had to offer
Robert Mastrodomenico – Global Sports Statistics
[email protected]
Dive into Python
To learn more about Python I read “Dive
into Python”
Syntactically and stylistically Python was
much different to R
The more I learnt the more I used it
Robert Mastrodomenico – Global Sports Statistics
[email protected]
Setting up by myself
In 2011 Global Sports Statistics was
created
Everything had to be setup from scratch
Used the opporunity to do something
different
Robert Mastrodomenico – Global Sports Statistics
[email protected]
Why Change?
Development is faster in Python
Easy to write Object Orientated code
Fantastic Library of packages
Sometimes a change is good!
Robert Mastrodomenico – Global Sports Statistics
[email protected]
What Python has to offer
Pandas: Data Analysis Library
Django: High Level Web Framework
NumPy: Package for numerical computing
SciPy: Routines for numerical Integration
and Optimization
Robert Mastrodomenico – Global Sports Statistics
[email protected]
What Python has to offer
Matplotlib: 2-D plotting
Flask: Microframework
Mechanize: Programatic Web Browsring
BeautifulSoup: Html Scraping
PyQt: GUI building
Robert Mastrodomenico – Global Sports Statistics
[email protected]
Rpy2
But what if you want to use an R package
Rpy2 allows use of any R functionality from
within Python
It also supports use of any packages that R
has to offer
Robert Mastrodomenico – Global Sports Statistics
[email protected]
Why move to Python
Im not saying you need to
However I believe a lot of what people do
in R can be done faster in Python
Try it out whats the worst that can happen?
Robert Mastrodomenico – Global Sports Statistics
[email protected]
Thank you for you attention
Robert Mastrodomenico – Global Sports Statistics
[email protected]