Site icon R-bloggers

A real-world, messy dataset to practice on

[This article was first published on R programming – Oscar Baruffa, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

At some point you may be looking for a “real world” dataset to practice analysis on or to give to students.

The value of such data is that it gives analysts a chance to develop skills they need for their work, but are hard to master when given “clean” datasets, especially inside a guided course.

I’ve found this dataset below which, apart from being actual, real-life data, has a few characteristics that makes it a good set to learn about data cleaning and then further analyzing.

The data is a Salary Survey from AskAManager.org. It’s US-centric-ish but does allow for a range of country inputs. I find salary surveys inherently interesting, but here’s some other notable aspects of this dataset.

If you’re using R, you can read the sheet using the googlesheets4 package.

You can of course make a copy of the sheet directly in Google sheets, or you can download it in multiple formats.

Happy analyzing!

Social media preview image: Photo by Wonderlane on Unsplash

< !-- Begin Mailchimp Signup Form -->
  • Don’t miss any updates, sign up below. I don’t post very often

    * indicates required
    < !-- real people should not fill this in and expect good things - do not remove this or risk form bot signups-->
    < !--End mc_embed_signup-->

    The post A real-world, messy dataset to practice on appeared first on Oscar Baruffa.

    To leave a comment for the author, please follow the link and comment on their blog: R programming – Oscar Baruffa.

    R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
    Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.