Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
The rOpenSci project is a poster child for the fluid collaboration that has become increasingly common these days thanks to platforms like Twitter and GitHub. It has been really inspring to see open discussions take shape as rough ideas, which rapidly turn into prototype research software, all of which are now happening in the order of few days to weeks rather than months to years. The origins of this project itself lead back to a series of serendipitous conversations that occurred over Twitter three years ago. Today we are a rapidly growing community of scientists, students, software developers, and information researchers.
When we wrote our proposal to the Sloan foundation last year, we budgeted a small meeting (hackathon) for the four core developers and a few members of the advisory board members. But pretty early on into our funding cycle, we realized that such a meeting could be far more impactful, both in terms of building community and prototyping software, if we were a lot more inclusive. Thanks to a series of fortuitous circumstances we were able to hold exactly such an event last month.
We held rOpenHack, a coding un-conference at GitHub's HQ in San Francisco from March 30th to April 1st. In addition to the core members, we invited several valuable contributors from the R community and put out an open call for other to join. All in all, we had ~ 30 people attend in person with a handful of folks contributing remotely. The attendees were a mix of academics, developers (both industry and non-profit) and several independent scientists, with ~ 40% of the participants being women.
Since this was our first time planning an event like this, we loosely followed an unconference model (similar to Science Online) and asked participants to propose projects on the hackathon repo in advance of the event. This approach was a really great opportunity for folks to identify projects and fellow participants, and allowed us to quickly gauge where there was the most interest. While we didn't work on every single idea over those two days, we did manage to get quite a bit done. Here are some projects where we made a lot of headway:
What we hacked on
Gavin Simpson wrote a package called
Dependencies
to track package dependencies (which is still a challenge in R).Ciera Martinez, Ben Marwick, Ed Szocs, Juan Batiz-Benet and Jeff Hollister worked on a comprehensive reproducibility guide.
Thomas Kluyver from the IPython project (with help from Jeroen Ooms and Ramnath Vaidyanathan) made great headway in finishing up a full iteration of the R kernel (which works great at this writing)
Martin Fenner, Gavin Simpson and Carl Boettiger built an rOpenSci cookbook of software recipes but this project has much wider applications. Once a few remaining bugs are ironed out, it will be an amazingly cool new knitr plugin for Jekyll along with a Ruby gem wrapping the knitr R package.
testdat
– An R package to run unit tests (check for missing data, missing dates, bad formatting, outliers, non-ascii characters etc) on tabular data (Karthik Ram, Hilary Parker, Aylssa Frazee, Anya Bida, Dave Harris and Winston Chang)A best practices guide for working with APIs in R (Hadley Wickham
Jure Triglav was an awesome remote contributor and built a Ruby gem to search for citations to R packages.