Articles by R on The Data Sandbox

Network Graphs in R

July 11, 2022 | R on The Data Sandbox

Introduction Network graphs are an important tool for network analysis. They illustrate points, referred to as nodes, with connecting lines, referred to as edges. Since network graphs are such useful tools, there are many options for graph generation. In this posting, I will demonstrate three different techniques for developing network ...
[Read more...]

Relationship Extraction with Spacyr

July 3, 2022 | R on The Data Sandbox

This is the continuation of the previous project were we scrapped the Cooper Mind website with the rvest package. Please refer to that posting for the necessary steps to obtain the verified character names. As a reminder, this project was inspired by the work of Thu Vu were she created ... [Read more...]

Webscraping in R with Rvest

June 21, 2022 | R on The Data Sandbox

Web scraping has become an incredibly important tool in data science, as an easy way to generate new data. The main advantage is the automation of some pretty repetitive tasks. Web scrapping can also be a good way of keeping up with new data on a webs... [Read more...]

Text Prediction Shiny App pt 2

June 7, 2022 | R on The Data Sandbox

Description This is the second part for the creation of a text prediction Shiny Application. From the previous post, we have developed and Corpus of text to start creating text prediction applications. We have also explored the corpus, looking at the...
[Read more...]

Text Prediction Shiny App pt 1

May 30, 2022 | R on The Data Sandbox

This Shiny App was first written in May of 2021 Description The goal of this project was to create an N-gram based model to predict the word to follow the user’s input. This project was to complete the Capstone project for the Johns Hopkins Univers... [Read more...]

Level up your programming skills

April 30, 2022 | R on The Data Sandbox

How do you become a better programmer? Well, there is strong scientific evidence for the support of the principle of deliberate practice. Deliberate practice is a method of skill development first written by Anders Ericsson in the book “Peak: Secrets ... [Read more...]

Dashboards in R with Shiny Dashboard

April 19, 2022 | R on The Data Sandbox

In a previous post, I explore the Flex dashboard library for the creation of a clean and interactive dashboard. That post can be found here. Unknown to me at the time, but I sort of skipped over the more natural progression of creating a dashboard with R Shiny. This is ...
[Read more...]

Benchmarking Data Tables

April 12, 2022 | R on The Data Sandbox

When I started learning R, I heard vague tales of the use of Data Tables. Really just whisperers, of something to consider in the future after I’ve become more proficient. Well now is the time to learn what if anything I’ve been missing out on. Intr... [Read more...]

R-Bloggers site

April 10, 2022 | R on The Data Sandbox

I would like to take the time to mention the r-bloggers site. It is a vast collection of Blogs on everything that has to do will the R language. I would very much like to contribute to their work with this blog. Just to keep it interesting, I would l... [Read more...]

Underrated CRAN Packages

March 30, 2022 | R on The Data Sandbox

I sit here looking for inspiration, nothing interesting to write about. Perhaps there are some popular R packages on CRAN that I don’t know about? You can explore the data on downloads from CRAN with the cranlogs package. Top CRAN downloads With the...
[Read more...]

Creating Dashboards in R

March 9, 2022 | R on The Data Sandbox

Dashboards are a great way to demonstrate knowledge and engage decision makers. Their utility has made PowerBI and Tableau household names. And while these solutions do support R and Python scripts and visualizations, the Flexdashboard package seeks ... [Read more...]

Python in R Markdown

March 2, 2022 | R on The Data Sandbox

Photo by David Clode on Unsplash The main advantage of using the R Markdown format is the utility of running R code within the text. This is clearly more advantageous than just writing code in a Markdown file. R Markdown is however limited to R cod...
[Read more...]

Bike shares in Toronto

February 26, 2022 | R on The Data Sandbox

Photo by Maarten van den Heuvel on Unsplash This article is based on a project written on 01/14/2021 Bike Rental Shiny App This application use the data collected from the Toronto Open Data to generate a histogram of the usage of rental bikes in ... [Read more...]

New features in R

February 22, 2022 | R on The Data Sandbox

Photo by Clint Patterson on Unsplash Recently I had updated my RStudio client and with it came a new update to R. This is an exploration of some of the most interesting changes from R 4.0 to R 4.1. Native Pipe Function Due to the extreme popularity of the magrittr pipe (‘%__%’), R ...
[Read more...]

Speed cameras in Toronto

February 15, 2022 | R on The Data Sandbox

Photo by Sepideh Golchin Rad on Unsplash This project was originally written on 02/01/2021 as part of the Data Products course for the Data Science Specialization from Johns Hopkins University on Coursera Objective This report plots the speed cameras in the Greater Toronto Area from the data provided by Open Toronto, ... [Read more...]

Fancy Tables in R

February 10, 2022 | R on The Data Sandbox

Photo by Juan Gomez on Unsplash Introduction As a continuation from my previous post exploring the use of the Stargazer library to create better looking tables, I thought I would look into the GT library. The GT library takes a different approach by creating an object class with the GT ...
[Read more...]

Job posting analysis

February 5, 2022 | R on The Data Sandbox

Recently, there was a post on medium about the use of Natural Language Processing (NLP) to study a job posting for keywords. I found that this article was very similar to R shiny App that I created a while ago. 1 Introduction Technology has changed the job application process, making it ... [Read more...]

Fitness Tracker Modeling: ML

January 28, 2022 | R on The Data Sandbox

The original paper was written on 12/18/2020 Executive Summary This report analyzes collected data on different users preforming barbell lifts performed at different levels of quality. A machine learning algorithm was used to create a model to dete...
[Read more...]

Never miss an update!
Subscribe to R-bloggers to receive
e-mails with the latest R posts.
(You will not see this message again.)

Click here to close (This popup will not appear again)