10 R packages every data scientist should know about
[This article was first published on Revolutions, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
The yhat blog lists 10 R packages they wish they'd known about earlier. Drew Conway calls them “10 reasons to always start your analysis in R”. They're all very useful R packages that every data scientist should be aware of. They are:
- sqldf (for selecting from data frames using SQL)
- forecast (for easy forecasting of time series)
- plyr (data aggregation)
- stringr (string manipulation)
- Database connection packages RPostgreSQL, RMYSQL, RMongo, RODBC, RSQLite
- lubridate (time and date manipulation)
- ggplot2 (data visulization)
- qcc (statistical quality control and QC charts)
- reshape2 (data restructuring)
- randomForest (random forest predictive models)
You can find links to all of these packages and tips on how to use them at link below.
yhat blog: 10 R packages I wish I knew about earlier
To leave a comment for the author, please follow the link and comment on their blog: Revolutions.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.