A CEO’s View of Open Source Data Science in the Enterprise
Highlights from my interview with Art Steinmetz
Art earlier shared an in-depth perspective on Open Source Data Science in Investment Management as a guest contributor to this blog, so I was curious to learn more about his experience, both as an CEO encouraging his teams to use open source data science, and as an R user himself.
How and why did you get started with open source data science?
Art shared that he started using R, a major language for open source data science, when he became frustrated with the limitations of Excel. As he describes it,
“One of the things that really bugged me was my current self had no idea what my past self did, when I opened a spreadsheet from a year or two prior,”
and he was forced to puzzle through the obscure formulas and the critical dependencies between spreadsheets. He started using R more and more, because he found he was “getting answers faster, and with reusable code.”
Is open source software appropriate for enterprise-level data science?
From Art’s perspective, it is absolutely appropriate, because it is “a great way to boost productivity, by empowering all the interested parties in the organization”. Art related that because of the reach and availability of open source, there were many different people at his organization working on analytic problems. Open source “lets a thousand flowers bloom”, but critically this can be done in a managed, curated way that addresses IT’s concerns, using platforms like RStudio Team to support the full data science production life cycle.
How do you build support for open source software within an organization?
Finally, I asked Art for his advice on how to build support for open source with an organization. While he says this is much easier than it used to be, as open source software has become more accepted, his primary advice was:
- Start small, with quick projects to demonstrate value.
- Inspire others, who will want the same power and flexibility.
- Don’t “go rogue” and appear to be rejecting IT standards. Instead, work with IT as much as possible.
To Learn More
- Watch the full interview with Art on YouTube here.
- Read Art’s previous blog post on Open Source Data Science in Investment Management, where Art relates how OppenheimerFunds struggled to get full value from their data until they adopted an open source data science approach.
- To hear more stories of how organizations are driving change and impact with their open source data science, read some of our customer stories, or join one of our RStudio Enterprise Meetups.