Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
The post Cumulative Sum calculation in R appeared first on Data Science Tutorials
Cumulative Sum calculation in R, using the dplyr package in R, you can calculate the cumulative sum of a column using the following methods.
Best online course for R programming – Data Science Tutorials
Approach 1: Calculate Cumulative Sum of One Column
df %>% mutate(cum_sum = cumsum(var1))
Approach 2: Calculate Cumulative Sum by Group
df %>% group_by(var1) %>% mutate(cum_sum = cumsum(var2))
The examples below demonstrate how to apply each strategy in practice.
One way ANOVA Example in R-Quick Guide – Data Science Tutorials
Example 1: Using dplyr, calculate the cumulative sum.
Let’s say we have the following R data frame:
Let’s make a dataset
df <- data.frame(day=c(1, 2, 3, 4, 5, 6, 7, 8), sales=c(57, 42, 50, 99, 59, 51, 58, 45))
Now we can view the dataset
df day sales 1 1 57 2 2 42 3 3 50 4 4 99 5 5 59 6 6 51 7 7 58 8 8 45
To create a new column that holds the cumulative sum of the values in the ‘sales’ column, use the following code.
How to Use the Multinomial Distribution in R? – Data Science Tutorials
library(dplyr)
Let’s calculate the cumulative sum of sales
df %>% mutate(cum_sales = cumsum(sales)) day sales cum_sales 1 1 57 57 2 2 42 99 3 3 50 149 4 4 99 248 5 5 59 307 6 6 51 358 7 7 58 416 8 8 45 461
Example 2: Using dplyr, calculate the Cumulative Sum by Group.
Let’s say we have the following R data frame.
Dealing With Missing values in R – Data Science Tutorials
Make a dataset
df <- data.frame(store=c('X', 'X', 'X', 'X', 'Y', 'Y', 'Y', 'Y'), day=c(1, 2, 3, 4, 1, 2, 3, 4), sales=c(87, 82, 80, 98, 98, 81, 88, 83))
View the dataset now
df X 1 87 2 X 2 82 3 X 3 80 4 X 4 98 5 Y 1 98 6 Y 2 81 7 Y 3 88 8 Y 4 83
To construct a new column that holds the cumulative sum of the values in the ‘sales’ column, grouped by the ‘store’ column, we can use the following code:
library(dplyr)
Now we can calculate the cumulative sum of sales by store.
Methods for Integrating R and Hadoop complete Guide – Data Science Tutorials
df %>% group_by(store) %>% mutate(cum_sales = cumsum(sales)) store day sales cum_sales <chr> <dbl> <dbl> <dbl> 1 X 1 87 87 2 X 2 82 169 3 X 3 80 249 4 X 4 98 347 5 Y 1 98 98 6 Y 2 81 179 7 Y 3 88 267 8 Y 4 83 350
The post Cumulative Sum calculation in R appeared first on Data Science Tutorials
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.