Blogging A to Z: The A to Z of tidyverse
[This article was first published on Deeply Trivial, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Announcing my theme for this year’s blogging A to Z!Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
The tidyverse is a set of R packages for data science. The big thing about the tidyverse is making sure your data are tidy. What does that mean?
- Each row is an observation
- Each column is a variable
- Each cell contains only one value
When I first learned about the tidy approach, I thought, “Why is this special? Isn’t that what we should be doing?” But thinking about keeping your data tidy has really changed the way I approach my job, and has helped me solve some tricky data wrangling issues. When you really embrace this approach, merging data, creating new variables, and summarizing cases becomes much easier. And the syntax used is the tidyverse is much more intuitive than much of the code in R, making it easier to memorize many of the functions; they follow a predictable grammar, so you don’t need to constantly look things up.
See you tomorrow for the first post – A is for arrange!
To leave a comment for the author, please follow the link and comment on their blog: Deeply Trivial.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.