New package "SparkRext" – SparkR extension for closer to dplyr
[This article was first published on HOXO-M - anonymous data analyst group in Japan - , and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Apache Spark is one of the hottest products in data science.Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Spark 1.4.0 has formally adopted SparkR package which enables to handle Spark DataFrames on R.
SparkR is very useful and powerful.
One of the reasons is that SparkR DataFrames present an API similar to dplyr.
We launced our new package “SparkRext” to redefine the functions of SparkR to enable NSE(Non stanard Evaluation) inputs. As a result, the functions will be able to be used in the same way as dplyr.
If you want to know about SparkRext package in detail, please check our blog post here.
To leave a comment for the author, please follow the link and comment on their blog: HOXO-M - anonymous data analyst group in Japan - .
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.