Site icon R-bloggers

Plot ranges of data in R

[This article was first published on Burns Statistics » R language, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.

How to control the limits of data values in R plots.

R has multiple graphics engines.  Here we will talk about the base graphics and the ggplot2 package.

We’ll create a bit of data to use in the examples:

one2ten <- 1:10

ggplot2 demands that you have a data frame:

ggdat <- data.frame(first=one2ten, second=one2ten)

Seriously exciting data, yes?

Default behavior

The default is — not surprisingly — to create limits so that the data comfortably fit.

Base

Figure 1 shows the default limits.  The command to do this is (almost):

plot(one2ten, one2ten)

The actual commands to create this figure and others are below in Appendix R.

Figure 1: Default base graphics plot.

ggplot2

The default here is basically the same, though the resulting picture looks rather different.  Figure 2 is created with the commands:

require(ggplot2)
print(qplot(first, second, data=ggdat))

The require command makes sure that the package is loaded into the current R session.  In this setting graphs are objects, and they are rendered only when they are printed — hence the call to print.

Figure 2: Default ggplot2 plot.

Typical limit

If the default isn’t what you want, you can change it.  Here we want — for some reason — more room on the left of the plot.

Base

The simplified version of the command for Figure 3 is:

plot(one2ten, one2ten, xlim=c(-2,10))

Figure 3: Typical use of the xlim graphics parameter.

The examples here are on the x-axis.  To control the y-axis, just substitute “y” for “x” — ylim rather than xlim.

ggplot2

In ggplot2 modifications or additions to a plot object are usually done by adding new terms:

print(qplot(first, second, data=ggdat) + xlim(-2, 10))

Figure 4: Typical ggplot2 specification of x limits.

Reverse direction

The first limit need not be the smallest — if not, then the axis is reversed.

Base

Figure 5: Base graph with the x-axis reversed.

ggplot2

The command for Figure 6 is:

print(qplot(first, second, data=ggdat) + xlim(10, 1))

Figure 6: ggplot2 with the x-axis reversed.

Expansion

The sharp-eyed will have noticed that the actual limits in the plots above are not what is specified.  The specifications are strictly inside the plots.  This makes it easy to make sure that no data is plotted on the boundary of the plot.

It is possible to change this behavior as well.

Base

We can see what the real range is by looking at the usr graphics parameter:

> plot(one2ten, one2ten, xlim=c(0,10))
> par("usr")
[1] -0.40 10.40  0.64 10.36

You can set some graphics parameters, and some are read-only.  The usr parameter can be set, but you almost always want to just use it as is.  The first two elements of usr are the x-axis limits, the last two are the y-axis limits.

The x-axis was asked to have limits 10 apart, and we can see that there is an extra 0.4 on each side.

You can force the limits to be taken literally by specifying xaxs (or yaxs for the y-axis):

> plot(one2ten, one2ten, xlim=c(0,10), xaxs="i")
> par("usr")
[1]  0.00 10.00  0.64 10.36
> plot(one2ten, one2ten, xlim=c(0,10), xaxs="i", yaxs="i")
> par("usr")
[1]  0 10  1 10

The "i" stands for “internal”, the other (default) choice is "r" as in “regular”.

ggplot2

Figure 7 is produced with:

print(qplot(first, second, data=ggdat) +
   scale_x_continuous(expand=c(0,0)))

Figure 7: ggplot2 with no x-axis expansion.

I wasn’t able to both modify the expansion and xlim — I’m not sure if that is a bug or solely due to my ignorance of ggplot2.

Appendix R

The actual commands that created the base graphics are:

Figure 1:

plot(one2ten, one2ten, col="royalblue", lwd=2)
text(7, 1.7, " default xlim (and ylim)")

Figure 3:

plot(one2ten, one2ten, col="royalblue", lwd=2, xlim=c(-2,10))
text(7, 1.7, "xlim = c(-2, 10)")

Figure 5:

plot(one2ten, one2ten, col="royalblue", lwd=2, xlim=c(10,1))
text(7, 1.7, "xlim = c(10, 1)")

The post Plot ranges of data in R appeared first on Burns Statistics.

To leave a comment for the author, please follow the link and comment on their blog: Burns Statistics » R language.

R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.