Skip to content
kevin-m-kent edited this page May 16, 2019 · 63 revisions

Welcome to my Data Science Resources wiki!

Learning R (general)

Best Practices - Krista DeStasio
R for Data Science
Stat 545
Data Science with R: how do I start?
What’s the best way to learn the programming language R? (Preferably, for free)
R for Data science curriculum (Stanford course)
R for Data Science
How to Become a Data Scientist - On your own - Data Science Central
The best Data Science courses on the internet, ranked by your reviews
My Data Science Master’s
R Studio Cheat Sheets

Data Management, File Structure, Version Control

Github for Poets
Software Carpentry: Data Management
Guerrilla Analytics
Cookiecutter Datascience
Nice R Code
Project-Oriented Workflow

Open Datasets

Google Dataset Search

Scraping and API Interaction

Twitter API with R
#RTutorial: Using R to Harvest the Twitter STREAM API

Database Querying

Stanford Database Course

NLP

An overview of the NLP ecosystem in R

Feature Extraction

The Fundamental Difference Between Principal Component Analysis and Factor Analysis
Principal components analysis and exploratory factor analysis Text Mining with R

Database Queries

SQLZOO
SQL Tutorial

Analysis

Research Blog: Harness the Power of Machine Learning in Your Browser with Deeplearn.js
Quick-R: Factor Analysis
Which linear model is best? | Real Data
The Fundamental Difference Between Principal Component Analysis and Factor Analysis
Calculate OLS regression manually using matrix algebra in R | the Tarzan
K-means R tutorial

Regex

R: Pattern Matching and Replacement
Regular Expressions in R

Data Management and Process

FAIR Data Management
CRISP-DM Process
Readings in data science
Data Organization in Spreadsheets

Wrangling

Ordering categories within ggplot2 facets
Visualizing Incomplete and Missing Data

Visualization

Gallery of ggplot2 plots
Data Visualization Tools and Books
Data Viz Tools
Scott Murray D3 Tutorial
Ggplot tutorial
Build an instant Twitter dashboard, with just a little code
Best practices for presenting plots
D3 tutorial list
Network Visualization in R
Sankey Graphs
R Studio's D3 Resources
Data Wrapper
ComplexHeatmap Complete Reference
Viz Palette
The Data Visualisation Catalogue

Presenting Findings

R Markdown
Getting started with R-Markdown

R Community

TidyTuesday
R for Data Science Online Community

Blogging

Blogdown package
Blogdown book
Jekyll
How to set up your own R blog with Github pages and Jekyll Bootstrap
Tags on Jekyll and GitHub pages
Pygment CSS for Code Highlighting

Books

Visualization Analysis & Design
Text Mining with R
Python for Data Analysis - O'Reilly Media
R for Data Science