Masters of Spin: Unraveling the web with R
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Here is a look at some of the masters of spin bowling in cricket. Specifically this post analyzes 3 giants of spin bowling in recent times, namely Shane Warne of Australia, Muthiah Muralitharan of Sri Lanka and our very own Anil Kumble of India. As to “who is the best leggie” has been a hot topic in cricket in recent years. As in my earlier post “Analyzing cricket’s batting legends: Through the mirage with R”, I was not interested in gross statistics like most wickets taken.
In this post I try to analyze how each bowler has performed over his entire test career. All bowlers have bowled around ~240 innings. All other things being equal, it does take a sense to look a little deeper into what their performance numbers reveal about them. As in my earlier posts the data has been taken from ESPN CricInfo’s Statguru
I have chosen these 3 spinners for the following reasons
Shane Warne : Clearly a deadly spinner who can turn the ball at absurd angles
Muthiah Muralitharan : While controversy dogged Muralitharan he was virtually unplayable on many cricketing venues
Anil Kumble: A master spinner whose chess like strategy usually outwitted the best of batsmen.
The King of Spin according to my analysis below is clearly Muthiah Muralitharan. This is clearly shown in the final charts where the performances of bowlers are plotted on a single graph. Muralitharan is clearly a much more lethal bowler and has a higher strike rate. In addition Muralitharan has the lowest mean economy rate amongst the 3 for wickets in the range 3 to 7. Feel free to add your own thoughts, comments and dissent.
The code for this implementation is available at GitHub at mastersOfSpin. Feel free to clone,fork or hack the code to your own needs. You should be able to use the code as-is on other bowlers with little or no modification
So here goes
Wickets frequency percentage vs Wickets plot
For this plot I determine how frequently the bowler takes ‘n’ wickets in his career and calculate the percentage over his entire career. In other words this is done as follows in R
# Create a table of Wickets vs the frequency of the wickts
colnames(wktsDF) <- c("Wickets","Freq")
# Calculate wickets percentage
wktsDF$freqPercent <- (wktsDF$Freq/sum(wktsDF$Freq)) * 100
and plot this as a graph.
This is shown for Warne below
1) Shane Warne – Wickets Frequency percentage vs Wickets plot
Wickets – Mean Economy rate chart
This chart plots the mean economy rate for ‘n’ wickets for the bowler. As an example to do this for 3 wickets for Shane Warne, a list is created of economy rates when Warne has taken 3 wickets in his entire career. The average of this list is then computed and stored against Warne’s 3 wickets. This is done for all wickets taken in Warne’s career. The R snippet for this implementation is shown below
econRate <- NULL
for (i in 0: max(as.numeric(as.character(bowler$Wkts)))) {
# Create a vector of Economy rate for number of wickets 'i'
a <- bowler[bowler$Wkts == i,]$Econ
b <- as.numeric(as.character(a))
# Compute the mean economy rate by using lapply on the list
econRate[i+1] <- lapply(list(b),mean)
print(econRate[i])
}
Shane Warne – Wickets vs Mean Economy rate
This plot for Shane Warne is shown below
The plots for M Muralithan and Anil Kumble are included below
2) M Muralitharan – Wickets Frequency percentage vs Wickets plot
M Muralitharan – Wickets vs Mean Economy rate
3) Anil Kumble – Wickets Frequency percentage vs Wickets plot
Anil Kumble – Wickets vs Mean Economy rate
Finally the relative performance of the bowlers is generated by creating a single chart where the wicket frequencies and the mean economy rate vs wickets is plotted.
This is shown below
As can be seen in the above 2 charts M Muralidharan not only has a higher strike rate as far as wickets in 3 to 7 range, he also has a much lower mean economy rate
You can clone/fork the R code from GitHub at mastersOfSpin
Conclusion: The performance of Muthiah Muralitharan is clearly superior to both Shane Warne and Kumble. In my opinion the king of spin is M Muralitharan, followed by Shane Warne and finally Anil Kumble
Feel free to dispute my claims. Comments, suggestions are more than welcome
Also see
1. Informed choices through Machine Learning : Analyzing Kohli, Tendulkar and Dravid
2. Informed choices through Machine Learning-2: Pitting together Kumble, Kapil, Chandra
3. Analyzing cricket’s batting legends – Through the mirage with R
You may also like
1. A peek into literacy in India:Statistical learning with R
2. A crime map of India in R: Crimes against women
3. What’s up Watson? Using IBM Watson’s QAAPI with Bluemix, NodeExpress – Part 1
4. Bend it like Bluemix, MongoDB with autoscaling – Part 2
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.