Just Quickly: What I usually want from stringr

Blog on Credibly Curious

3 years ago

[This article was first published on Blog on Credibly Curious, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

TL;DR str_subset(string, pattern) returns the strings that match a pattern.

I don’t often need to work with string data, but when I do, I usually jump to two tools:

grepl, and
stringr.

What I usually want to do is return strings that match some pattern.

For example, say there are 5 items:

items <- c("thing1",
           "thing2",
           "sacvy",
           "item.csv",
           "wat.csv")

Then I can return those items by writing something like this.

items[grepl(".csv$", items)]

## [1] "item.csv" "wat.csv"

Let’s break that down:

This reads as:

Does this end with csv in the object items?

grepl(".csv$", items)

## [1] FALSE FALSE FALSE  TRUE  TRUE

And we can then put this inside the square braces [ of items, to return those that match this pattern:

items[grepl(".csv$", items)]

## [1] "item.csv" "wat.csv"

stringr makes this somewhat more straightforward. First, you can use str_detect instead of grepl. This is nice because it takes the strings first, then the pattern. This makes it more consistent with other tidy tools, which have the data first, then the options of the function:

library(stringr)

str_detect(items, ".csv$")

## [1] FALSE FALSE FALSE  TRUE  TRUE

But what if I just want things returned that match that?

You want str_subset. Think of it like this – subset is another word for filter, which you might be more familiar with:

str_subset(items, ".csv$")

## [1] "item.csv" "wat.csv"

And that’s it, that’s all I have to say.

To leave a comment for the author, please follow the link and comment on their blog: Blog on Credibly Curious.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.

Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.