Resources
Books
The Data Science Handbook
Field Cady. The Data Science Handbook. Wiley, 2017. Available for free as a PDF download
[here: download link](https://github.com/bsharvey/bsharvey.github.io/blob/master/assets/books/The%20Data%20Science%20Handbook.pdf)
Big Data Fundamentals
Erl, Thomas, Wajid Khattak, and Paul Buhler. Big data fundamentals: concepts, drivers & techniques. Prentice Hall Press, 2016. Available for free as a PDF download at
[here: download link](https://github.com/bsharvey/bsharvey.github.io/blob/master/assets/books/bigdatafundamentals-thomaserl-book.pdf)
Doing data science
Schutt, Rachel, and Cathy O'Neil. Doing data science: Straight talk from the frontline. " O'Reilly Media, Inc.", 2013.
Available for free as a PDF download at
[here: download link](https://github.com/bsharvey/bsharvey.github.io/blob/master/assets/books/Doing%20Data%20Science.pdf)
Machine learning for hackers
Conway, Drew, and John White. Machine learning for hackers. " O'Reilly Media, Inc.", 2012.
Available for free as a PDF download at
[here: download link](https://github.com/bsharvey/bsharvey.github.io/blob/master/assets/books/machine_learning_for_hackers.pdf)
Python for data analysis
McKinney, Wes. Python for data analysis: Data wrangling with Pandas, NumPy, and IPython. " O'Reilly Media, Inc.", 2012.
Available for free as a PDF download at
[here: download link](https://github.com/bsharvey/bsharvey.github.io/blob/master/assets/books/Python4DataAnalysis.pdf)
Data Science for Business
Provost, Foster, and Tom Fawcett. Data Science for Business: What you need to know about data mining and data-analytic thinking. " O'Reilly Media, Inc.", 2013.
Available for free as a PDF download at
[here: download link](https://github.com/bsharvey/bsharvey.github.io/blob/master/assets/books/data-science-for-business.pdf)
Introduction to data science
Stanton, Jeffrey M. "Introduction to data science." (2013).
Available for free as a PDF download at
[here: download link](https://github.com/bsharvey/bsharvey.github.io/blob/master/assets/books/DataScienceBook1_1.pdf)
Data
data.gov.in
{Add description of example an library here}
data.gov.in data example data repository
World Band
{Add description of example an library here}
World Bank data example data repository
Reserve Bank India
{Add description of example an library here}
Five Thirty Eight Datasets
{Add description of example an library here}
AWS dataset
{Add description of example an library here}
Google datasets
{Add description of example an library here}
Youtube labeled Video datasets
{Add description of example an library here}
UCI Machine Learning Repisitory
{Add description of example an library here}
Kaggle
{Add description of example an library here}
Analytics Vidhya
{Add description of example an library here}
Qandl
{Add description of example an library here}
Past KDD Cups
{Add description of example an library here}
Driven Data
{Add description of example an library here}
Tutorials
- initial steps toward reproducible research
- organizing data in spreadsheets
- git/github guide
- minimal make
- knitr in a knutshell
- R package primer
- simple site: making a simple website with GitHub Pages
-
hipsteR: re-educating people who learned R before it was cool
- intro to R [not really a tutorial, but there are some useful links here]
- intro to Perl [very old; I now recommend ruby or python]
- intro to C [not really a tutorial, but it includes a list of C programming books]
- top ten worst graphs