Lending Club – naive data analysis
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Dataspora recently analyzed Lending Club‘s data in a geographical way using the data distributed by the site.
Lending Club is an online financial community that brings together creditworthy borrowers and savvy investors so that both can benefit financially. We replace the high cost and complexity of bank lending with a faster, smarter way to borrow and invest.
Lending Club’s returns are very attractive (for the lenders’ point of view), and, at the same time, the Club allows borrowers to avoid high interest rates for similar loans from banks. There are obviously some risk associated with the high returns (like the costly money recovery from a payment default, etc.), and one can ask whether the risks are well weighted with each of the loans.
A few obvious things to note from the cute box charts:
Nothing really surprising so far. Some data on defaults is also available, and one could continue digging into the provided data to see if any pattern emerges.
loans <- read.csv("data/lclub.csv", header=TRUE, skip = 1) o <- data.frame(id=loans$Loan.ID) o$rul <- floor(as.numeric(gsub("%", "", loans$Revolving.Line.Utilization))/10) o$rate <- as.numeric(gsub("%", "", loans$Interest.Rate)) o$grade <- loans$CREDIT.Grade boxplot(rate ~ grade, data=o) boxplot(rate ~ rul, data=o) boxplot(rul ~ grade, data=o) |
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.