[This article was first published on DataScience+, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
Hello everyone! In this post, I will show you how you can use Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
rbokeh
to build interactive graphs and maps in R.
What is bokeh?
Bokeh is a popular python library used for building interactive plots and maps, and now it is also available in R, thanks to Ryan Hafen. It is a very powerful for creating good looking plots for the web easily, and it is fully compatible with shiny. Generally, plotting in bokeh is done by adding layers to a plot, similar toggplot2
. For creating a simple plot, there are two main steps involved:
figure()
– This will initialize the bokeh plot. It has a variety of parameters to set width, height, title, and axes parameters.ly_geom()
– This will specify the type of geom you want to use. There are a variety of options, includingly_points
,ly_lines
,ly_hist
,ly_boxplot
, etc. Each of these have parameters which allow for specifying size, color, what to show on hover, etc.
iris
dataset. Let’s recreate the visualization using rbokeh
:
clusters <- hclust(dist(iris[, 3:4]), method = 'average') clusterCut <- cutree(clusters, 3) p <- figure(title = 'Hierarchical Clustering of Iris Data') %>% ly_points(Petal.Length, Petal.Width, data = iris, color = Species, hover = c(Sepal.Length, Sepal.Width)) %>% ly_points(iris$Petal.Length, iris$Petal.Width, glyph = clusterCut, size = 13) pwhich gives us the following plot:
aapl <- read.csv('aapl.csv') aapl$Date <- as.Date(aapl$Date) p <- figure(title = 'Apple Stock Data') %>% ly_points(Date, Volume / (10 ^ 6), data = aapl, hover = c(Date, High, Open, Close)) %>% ly_abline(v = with(aapl, Date[which.max(Volume)])) %>% y_axis(label = 'Volume in millions', number_formatter = 'numeral', format = '0.00')which gives us the following plot (with a vertical line on the date with the highest amount of volume):
rbokeh
:
SFData <- read.csv('SFPD_Incidents_-_Previous_Year__2015_.csv') data <- subset(SFData, Category == 'BRIBERY' | Category == 'SUICIDE') p <- gmap(lat = 37.78, lng = -122.42, zoom = 13) %>% ly_points(Y, X, data = data, hover = c(Category, PdDistrict), col = 'red') %>% x_axis(visible = FALSE) %>% y_axis(visible = FALSE)which gives us the following plot:
facet_grid
feature from ggplot2
as follows. We will use the diamonds
dataset from ggplot2
.
diamonds <- ggplot2:: diamonds l <- levels(diamonds$color) plot_list <- vector(mode = 'list', 7) for (i in 1:length(l)) { data <- subset(diamonds, color == l[i]) plot_list[[i]] <- figure(width = 350, height = 350) %>% ly_points(carat, price, data = data, legend = l[i], hover = c(cut, clarity)) } grid_plot(plot_list, nrow = 2)which gives us this plot:
To leave a comment for the author, please follow the link and comment on their blog: DataScience+.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.