Short Attention Span Theatre: Reproducing Axios’ “1 Big Thing” Google Trends 2019 News In Review with {ggplot2}
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
I woke up to Axios’ “1 Big Thing” ridgeline chart showing the crazy that was the 2019 news cycle:
and, I decided to reproduce it in {ggplot2}.
Getting The Data
First, I had to find the data. The Axios chart is interactive, so I assumed the visualization was built on-load. It was, but the data was embedded in a javascript file vs loaded as JSON via an XHR request:
which was easy enough to turn into JSON anyone can use.
library(ggalt)
library(hrbrthemes) # hrbrmstr/hrbrthemes
library(tidyverse)
jsonlite::fromJSON("https://rud.is/dl/2019-axios-news.json") %>% 
  as_tibble() -> xdf
xdf
## # A tibble: 31 x 3
##    name                    avg data      
##    <chr>                 <dbl> <list>    
##  1 Gov't shutdown        20.5  <int [51]>
##  2 Mexico-U.S. border    22.8  <int [51]>
##  3 Green New Deal        11.3  <int [51]>
##  4 Blackface              9.61 <int [51]>
##  5 N. Korea-Hanoi Summit 11.2  <int [51]>
##  6 Boeing 737 Max         4.79 <int [51]>
##  7 Brexit                28.5  <int [51]>
##  8 Israel                42.1  <int [51]>
##  9 SpaceX                24.1  <int [51]>
## 10 Game of Thrones       16.8  <int [51]>
## # … with 21 more rows
This is pretty tidy already, but we’ll need to expand the data column and give each week an index:
unnest(xdf, data) %>% group_by(name) %>% mutate(idx = 1:n()) %>% ungroup() %>% mutate(name = fct_inorder(name)) -> xdf # making a factor foe strip/panel ordering xdf ## # A tibble: 1,581 x 4 ## name avg data idx ## <fct> <dbl> <int> <int> ## 1 Gov't shutdown 20.5 69 1 ## 2 Gov't shutdown 20.5 100 2 ## 3 Gov't shutdown 20.5 96 3 ## 4 Gov't shutdown 20.5 100 4 ## 5 Gov't shutdown 20.5 19 5 ## 6 Gov't shutdown 20.5 9 6 ## 7 Gov't shutdown 20.5 17 7 ## 8 Gov't shutdown 20.5 3 8 ## 9 Gov't shutdown 20.5 2 9 ## 10 Gov't shutdown 20.5 1 10 ## # … with 1,571 more rows
We’ll take this opportunity to find the first week of each month (via rle()) so we can have decent axis labels:
# get index placement for each month axis label
sprintf("2019-%02s-1", 1:51) %>% 
  as.Date(format = "%Y-%W-%w") %>% 
  format("%b") %>% 
  rle() -> mons
mons
## Run Length Encoding
##   lengths: int [1:12] 4 4 4 5 4 4 5 4 5 4 ...
##   values : chr [1:12] "Jan" "Feb" "Mar" "Apr" "May" "Jun" "Jul" "Aug" "Sep" "Oct" ...
month_idx <- cumsum(mons$lengths)-3
month_idx
##  [1]  1  5  9 14 18 22 27 31 36 40 44 48
We’ve got all we need to make a {ggplot2} version of the chart. Here’s the plan:
- use geom_area()and map colour and fill toavg(like Axios did), using an medium alpha value so we can still see below the overlapped areas
- also use an xspline()stat withgeom_area()so we get smooth lines vs pointy ones
- use geom_hline()vs an axis line so we can map a colour aesthetic toavgas well
- make a custom x-axis scale so we can place the labels we just made
- expand the y-axis upper limit to avoid cutting off any part of the geoms
- use the infernoviridis palette, but not the extremes of it
- make facets/panels on the name, positioning the labels on the left
- finally, tweak strip positioning so we get overlapped charts
ggplot(xdf, aes(idx, data)) +
  geom_area(alpha = 1/2, stat = "xspline", aes(fill = avg, colour = avg)) +
  geom_hline(
    data = distinct(xdf, name, avg),
    aes(yintercept = 0, colour = avg), size = 0.5
  ) +
  scale_x_continuous(
    expand = c(0,0.125), limits = c(1, 51),
    breaks = month_idx, labels = month.abb
  ) +
  scale_y_continuous(expand = c(0,0), limits = c(0, 105)) +
  scale_colour_viridis_c(option = "inferno", direction = -1, begin = 0.1, end = 0.9) +
  scale_fill_viridis_c(option = "inferno", direction = -1, begin = 0.1, end = 0.9) +
  facet_wrap(~name, ncol = 1, strip.position = "left", dir = "h") +
  labs(
    x = NULL, y = NULL, fill = NULL, colour = NULL,
    title = "1 big thing: The insane news cycles of 2019",
    subtitle = "Height is search interest in a given topic, indexed to 100.\nColor is average search interest between Dec. 30, 2018–Dec. 20, 2019",
    caption = "Source: Axios <https://www.axios.com/newsletters/axios-am-1d9cd913-6142-43b8-9186-4197e6da7669.html?chunk=0#story0>\nData: Google News Lab. Orig. Chart: Danielle Alberti/Axios"
  ) +
  theme_ipsum_es(grid="X", axis = "") +
  theme(strip.text.y = element_text(angle = 180, hjust = 1, vjust = 0)) +
  theme(panel.spacing.y = unit(-0.5, "lines")) +
  theme(axis.text.y = element_blank()) +
  theme(legend.position = "none")
To produce this finished product:
FIN
The chart could be tweaked a bit more to get even closer to the Axios finished product.
Intrepid readers can also try to use {plotly} to make an interactive version.
Somehow, I get the feeling 2020 will have an even more frenetic news cycle.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
 


