Microsoft R Server tips from the Tiger Team
[This article was first published on Revolutions, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
The Microsoft R Server Tiger Team assists customers around the world to implement large-scale analyytic solutions. Along the way, they discover useful tips and best practices, and share them on the Tiger Team blog. Here are a few recent tips from the Tiger Team on using Microsoft R Server:
- Gather metadata and exlore numeric summaries of large data sets held in XDF files
- Filter XDF files with regular expression matching using the rxDataStep function in the RevoScaleR package
- Import DBase .dbf files into Microsoft R Server as an XDF file
- Optimize performance when using rxExec to parallelize R code across a server or cluster
- Perform various data wrangling tasks on XDF files, including aggregations, merges, and calculating column-level statistics
- Confine Microsoft R Server computations to a subset of a Hadoop cluster using node labels
- Quantify risk associated with loans, via in-database model scoring with SQL Server R Services
For more tips, including tips on operationalizing R scripts and using Microsoft R Server with data platforms including Teradata and Cloudera, check out thre Tiger Team blog at the link below.
To leave a comment for the author, please follow the link and comment on their blog: Revolutions.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.