Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
I was recently asked by a R user about how one could extract the “rule” in a classification/regression tree. The requirement was to obtain the path traced from the root node to the leaf nodes and obtain all the paths or “rules”
path.rpart()
function in the mvpart package provides this convenience
library(mvpart)
# Create a classification tree
ozone <- mvpart(Ozone ~ ., data=airquality)
print(ozone) # Gives you the various splits in the tree
# Issue the two commands below to see the graphical representation
plot(ozone)
text(ozone)
# To obtain the summary of the created tree
summary(ozone)
# To obtain the path to the leaf nodes
leafnodeRows <- grepl("leaf",ozone$frame$var)
nodevals <- as.numeric(rownames(ozone$frame)[leafnodeRows])
rules <- path.rpart(ozone,nodevals)
rulesdf <- do.call("rbind",lapply(rules,function(x)paste(x,collapse = " -AND- ")))
rulesdf <- data.frame(nodeNumber=rownames(rulesdf),rule=rulesdf[,1],stringsAsFactors=FALSE)
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.