Going deeper with dplyr: New features in 0.3 and 0.4 (tutorial)
dplyr is an R package for data manipulation. Using a series of examples on a dataset you can download, this tutorial covers the new functionality in dplyr versions 0.3 and 0.4.
Watch the previous dplyr tutorial: http://youtu.be/jWjqLW-u3hc
View the R Markdown document: http://rpubs.com/justmarkham/dplyr-tutorial-part-2
Download the source document: https://github.com/justmarkham/dplyr-tutorial
Tutorial contents:
1. Introduction (starts at 0:00)
2. Loading dplyr and the nycflights13 dataset (starts at 1:12)
3. Choosing columns: `select`, `rename` (starts at 2:28)
4. Choosing rows: `filter`, `between`, `slice`, `sample_n`, `top_n`, `distinct` (starts at 5:40)
5. Adding new variables: `mutate`, `transmute`, `add_rownames` (starts at 12:38)
6. Grouping and counting: `summarise`, `tally`, `count`, `group_size`, `n_groups`, `ungroup` (starts at 15:20)
7. Creating data frames: `data_frame` (starts at 23:01)
8. Joining (merging) tables: `left_join`, `right_join`, `inner_join`, `full_join`, `semi_join`, `anti_join` (starts at 25:28)
9. Viewing more output: `print`, `View` (starts at 31:29)
10. Resources (starts at 34:41)
== RESOURCES ==
Release announcement (0.3): http://blog.rstudio.org/2014/10/13/dplyr-0-3-2/
Release announcement (0.4): http://blog.rstudio.org/2015/01/09/dplyr-0-4-0/
Reference manual and vignettes: http://cran.r-project.org/web/packages/dplyr/
Two-table vignette: http://cran.r-project.org/web/packages/dplyr/vignettes/two-table.html
Data Wrangling Cheat Sheet: http://www.rstudio.com/wp-content/uploads/2015/02/data-wrangling-cheatsheet.pdf
GitHub repo: https://github.com/hadley/dplyr
List of releases: https://github.com/hadley/dplyr/releases
== LET'S CONNECT! ==
Newsletter: https://www.dataschool.io/subscribe/
Twitter: https://twitter.com/justmarkham
Facebook: https://www.facebook.com/DataScienceSchool/
LinkedIn: https://www.linkedin.com/in/justmarkham/
Watch on YouTube ↗
(saves to browser)
Sign in to unlock AI tutor explanation · ⚡30
Playlist
Uploads from Data School · Data School · 12 of 60
1
2
3
4
5
6
7
8
9
10
11
▶
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
Setting up Git and GitHub
Data School
Navigating a GitHub Repository - Part 1
Data School
Forking a GitHub Repository
Data School
Creating a New GitHub Repository
Data School
Copying a GitHub Repository to Your Local Computer
Data School
Committing Changes in Git and Pushing to a GitHub Repository
Data School
Syncing Your GitHub Fork
Data School
Allstate Purchase Prediction Challenge on Kaggle
Data School
Troubleshooting: Updates Rejected When Pushing to GitHub
Data School
Hands-on dplyr tutorial for faster data manipulation in R
Data School
ROC Curves and Area Under the Curve (AUC) Explained
Data School
Going deeper with dplyr: New features in 0.3 and 0.4 (tutorial)
Data School
What is machine learning, and how does it work?
Data School
Setting up Python for machine learning: scikit-learn and Jupyter Notebook
Data School
Getting started in scikit-learn with the famous iris dataset
Data School
Training a machine learning model with scikit-learn
Data School
Comparing machine learning models in scikit-learn
Data School
Data science in Python: pandas, seaborn, scikit-learn
Data School
Selecting the best model in scikit-learn using cross-validation
Data School
How to find the best model parameters in scikit-learn
Data School
How to evaluate a classifier in scikit-learn
Data School
What is pandas? (Introduction to the Q&A series)
Data School
How do I read a tabular data file into pandas?
Data School
How do I select a pandas Series from a DataFrame?
Data School
Why do some pandas commands end with parentheses (and others don't)?
Data School
How do I rename columns in a pandas DataFrame?
Data School
How do I remove columns from a pandas DataFrame?
Data School
How do I sort a pandas DataFrame or a Series?
Data School
How do I filter rows of a pandas DataFrame by column value?
Data School
How do I apply multiple filter criteria to a pandas DataFrame?
Data School
Your pandas questions answered!
Data School
How do I use the "axis" parameter in pandas?
Data School
How do I use string methods in pandas?
Data School
How do I change the data type of a pandas Series?
Data School
When should I use a "groupby" in pandas?
Data School
How do I explore a pandas Series?
Data School
How do I handle missing values in pandas?
Data School
What do I need to know about the pandas index? (Part 1)
Data School
What do I need to know about the pandas index? (Part 2)
Data School
How do I select multiple rows and columns from a pandas DataFrame?
Data School
Machine Learning with Text in scikit-learn (PyCon 2016)
Data School
When should I use the "inplace" parameter in pandas?
Data School
How do I make my pandas DataFrame smaller and faster?
Data School
How do I use pandas with scikit-learn to create Kaggle submissions?
Data School
More of your pandas questions answered!
Data School
How do I create dummy variables in pandas?
Data School
How do I work with dates and times in pandas?
Data School
How do I find and remove duplicate rows in pandas?
Data School
How do I avoid a SettingWithCopyWarning in pandas?
Data School
How do I change display options in pandas?
Data School
How do I create a pandas DataFrame from another object?
Data School
How do I apply a function to a pandas Series or DataFrame?
Data School
Getting started with machine learning in Python (webcast)
Data School
Q&A about Machine Learning with Text (online course)
Data School
Your pandas questions answered! (webcast)
Data School
Machine Learning with Text in scikit-learn (PyData DC 2016)
Data School
Write Pythonic Code for Better Data Science (webcast)
Data School
Web scraping in Python (Part 1): Getting started
Data School
Web scraping in Python (Part 2): Parsing HTML with Beautiful Soup
Data School
Web scraping in Python (Part 3): Building a dataset
Data School
More on: ML Pipelines
View skill →Related AI Lessons
⚡
⚡
⚡
⚡
Big Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.
Dev.to AI
Big Tech Is Firing Humans to Buy More GPUs
Dev.to · Syed Ahmer Shah
I Quit AI for 30 Days — Here’s the Truth Nobody Tells You
Medium · AI
Big Tech Is Firing Humans to Buy More GPUs
Medium · AI
🎓
Tutor Explanation
DeepCamp AI