Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Now, there are about 30 mass murders in the U.S. each year (!), so the probability of finding at least four of those 30 events within 4 days of one another should be related to von Mises‘ birthday problem. For instance, Abramson and Moser derived in 1970 that the probability that at least two people (among n) have birthday within k days of one another (for an m days year) is
but I did not find an extension to the case of the four (to borrow from Conan Doyle!)… A quick approximation would be to turn the problem into a birthday problem with 364/4=91 days and count the probability that four share the same birthday
which is surprisingly large. So I checked with a R code in the plane:
T=10^5 four=rep(0,T) for (t in 1:T){ day=sample(1:365,30,rep=TRUE) four[t]=(max(apply((abs(outer(day,day,"-"))<4),1,sum))>4)} mean(four)
and found 0.0278, which means the above approximation is far from terrible! I think it may actually be “exact” in the sense that observing exactly four murders within four days of one another is given by this probability. The cases of five, six, &tc. murders are omitted but they are also highly negligible. And from this number, we can see that there is a 18% probability that the case of the four occurs within seven years. Not so unlikely, then.
Filed under: Books, R, Statistics, Travel Tagged: birthday problem, coincidence, Conan Doyle, mass murders, Pittsburgh, Richard von Mises, The Sign of Four, Toronto, USA Today
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.