R-bloggers

Best Before Dates by Bass

[This article was first published on R – Win Vector LLC, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

I was searching for one last real world example for my upcoming video talk March 13th on time series forecasting.



Ts note

Hope to see you there! Or reach out to Win Vector LLC for custom training!


I had the seemingly harmless thought: “Let’s look at Stack Overflow trends“. In particular their pre-built “data science and big data trends” query seemed fun.

This is a graph of the percentage of Stack Overflow questions tagged with data science terms such as R, Pandas, and so on. It seems to show exploding interest in R and Pandas, and maybe even Tensorflow. Pandas was likely chosen as a proxy for interest in Python for data science (versus a general interest in Python). I’d prefer view counts over question percentages as a proxy of interest, but it is what it is.

Then I thought, let’s see if they have newer data. They do, and it is horrifying (though not unexpected to those of us in the industry).

The graph appears to show interest in R and Pandas rapidly falling. I know there are alternatives to Pandas (such as Polars), but my spot checks didn’t show any of them taking Pandas’s place. Likely we are seeing a big replacement of data science courses with LLM course work and projects. A relevant point is that ChatGPT was released in November of 2022.

For laughs I digitized the results from the graph into numbers and used Sanjiv Ranjan Das’s excellent book chapter “Product Market Forecasting using the Bass Model” to fit a good old Bass product diffusion model onto the data. The joke is that the Bass model assumes all products die. That isn’t so much the prediction of the Bass model, but one of the assumptions of the model. The idea is: products go obsolete, and Bass helps estimate when.

The Bass methodology gave me the following graph. Keep in mind: it forces this paraboloid like shape no matter what the data.

As dire as the Bass curves are, they are not that far off yet. I did the analysis in R, so I am pleased it chose R (itself) to outlast the other systems :). All jokes aside: forecasting helps you plan and adapt.

To leave a comment for the author, please follow the link and comment on their blog: R – Win Vector LLC.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Exit mobile version