The NYC Marathon
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
New York’s annual marathon took place yesterday. Watching a bit of it on television with my friends, I was struck by the much earlier starting time for women than men. Specifically, professional women started running yesterday at 9:10 AM, while professional men start running at 9:40 AM. (This information comes from the runner’s handbook.) I wanted to get a sense of how much this head start depended on real differences in their performance, because I found it very hard to imagine why professional women would run significantly slower than professional men.
Of course, I have seen discussions of the speed difference between men and women before, but I was still very surprised by it yesterday. To get a sense of the scope of the differences, I found some data this morning from the ING Marathon website and made a quick density estimate plot, which you can see below:
It’s clear that men and women had quite difference average speeds yesterday, and that their times had very different distributions. Of course, these plots are each based on 100 observations, so I’m hesitant to make any strong conclusions. Having confirmed for myself that there are real differences in the performance of men and women, I have to confess that I still find it surprising.
For those interested in following up on this, the code I used to produce this plot and the data set I used are both available on GitHub. I’m sure there are other interesting questions one can ask of this data beyond simple comparisons across genders.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.