The German fuel prices data set
[This article was first published on Quantitative Consulting - R analytics blog, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
data:image/s3,"s3://crabby-images/565dd/565dd483bbf16e6225563362e6381205025c8ee2" alt=""
The data set on German fuel prices contains the fuel prices, but not the sales, from more than 14000 fuel stations in Germany since June 2014. It is made available by the webservice Tankerkoenig as a Postgres dump (from June 2014 onwards) under CC4.0.
This is a particularly interesting data set
-
from the general public point of view as
- it deals with a topic under strong public scrutiny and is already intensively studied by economists
- the data is also constantly updated and allows for continuous monitoring for changes in trends or new phenomena
-
from the data science point of view as
- this is a real life data set with all the little quirks and bugs whose correction forms large part of an analysts working hours
- the data is small enough (~ 3GB in the original data dump) to be effectively prepared and analysed on a single computer, but big enough to allow to test different techniques for scaling and speeding up the analysis in a cluster
- it allows for the integration of many other (open) data sources (socio-economic data, weather, traffic, holidays). See the graph below.
- interesting results can already be obtained using purely descriptive methods, but the data can also be analysed and modelled under spatial–, time series–, panel– and many other aspects.
Overview of the available data and structure
To leave a comment for the author, please follow the link and comment on their blog: Quantitative Consulting - R analytics blog.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.