Circular or spherical data, and density estimation
[This article was first published on Freakonometrics - Tag - R-english, and kindly contributed to R-bloggers]. (You can report issue about the content on this page here)
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
I few years ago, while I was working on kernel based density estimation
on compact support distribution (like copulas) I went through a series
of papers on circular distributions. By that time, I thought it was
something for mathematicians working on weird spaces…. but during the
past weeks, I saw several potential applications of those estimators.Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
- circular data density estimation
data:image/s3,"s3://crabby-images/37e69/37e69351b091fc6f58fd4bdcf85294c0d4e53892" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-01.gif"
data:image/s3,"s3://crabby-images/b039e/b039eb8d24b78de28bc910898b036a86261c0d82" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-02.gif"
data:image/s3,"s3://crabby-images/9eaa3/9eaa3a57849d6227550d7058041f126186bedd64" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-03.gif"
von Mises proposed a parametric model in 1918 (see here or there), assuming that
data:image/s3,"s3://crabby-images/c933c/c933cd9e1c9c193445204f70bb42711d72afa707" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-04.gif"
data:image/s3,"s3://crabby-images/1d39c/1d39c3795c2ea808ae7712a47be280f2f46e2372" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-05.gif"
data:image/s3,"s3://crabby-images/eb930/eb930e6b98790e970c606a0113e695d23ff70a4e" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-06.gif"
data:image/s3,"s3://crabby-images/7d152/7d15268b408c51dd4ebdecd0a408b23710b92eb1" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-07.gif"
From a series of observed angles
data:image/s3,"s3://crabby-images/3f1e3/3f1e3679951301291ddc4aa31990e94d368e786a" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-08.gif"
data:image/s3,"s3://crabby-images/95dca/95dcaf7bb1d7442631c01d731c6ef3f8f7a3c236" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-09.gif"
data:image/s3,"s3://crabby-images/c55e0/c55e00b2d3bb475a6e92239e98e37921d99b7b03" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-10.gif"
data:image/s3,"s3://crabby-images/11916/11916fb757dc56b660dad09f63e93e8067839b81" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-11.gif"
data:image/s3,"s3://crabby-images/8a7f2/8a7f2313a791443ecaba7090178f409a31133a6e" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-12.gif"
- density estimation for hours on simulated data
data:image/s3,"s3://crabby-images/1e582/1e58233685c8fed1679fa4f5b5b7885034a04fad" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-13.gif"
data:image/s3,"s3://crabby-images/b27aa/b27aaf484c94953294f9a7f68696fae9ebbbffe5" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-14.gif"
data:image/s3,"s3://crabby-images/b27aa/b27aaf484c94953294f9a7f68696fae9ebbbffe5" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-14.gif"
data:image/s3,"s3://crabby-images/ce4b4/ce4b4137260a3c01e03c28cdd54f975baaab7a6d" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-15.gif"
set.seed(1) library(circular) X=rbeta(100,shape1=2,shape2=4)*24 Omega=2*pi*X/24 Omegat=2*pi*trunc(X)/24 H=circular(Omega,type="angle",units="radians",rotation="clock") Ht=circular(Omegat,type="angle",units="radians",rotation="clock") plot(Ht, stack=FALSE, shrink=1.3, cex=1.03, axes=FALSE,tol=0.8,zero=c(rad(90)),bins=24,ylim=c(0,1)) points(Ht, rotation = "clock", zero =c(rad(90)), col = "1", cex=1.03, stack=TRUE ) rose.diag(Ht-pi/2,bins=24,shrink=0.33,xlim=c(-2,2),ylim=c(-2,2), axes=FALSE,prop=1.5)
circ.dens = density(Ht+3*pi/2,bw=20) plot(Ht, stack=TRUE, shrink=.35, cex=0, sep=0.0, axes=FALSE,tol=.8,zero=c(0),bins=24, xlim=c(-2,2),ylim=c(-2,2), ticks=TRUE, tcl=.075) lines(circ.dens, col="darkgrey", lwd=3) text(0,0.8,"24", cex=2); text(0,-0.8,"12",cex=2); text(0.8,0,"6",cex=2); text(-0.8,0,"18",cex=2)The c
data:image/s3,"s3://crabby-images/87627/87627e7c98282ecd177e06f2a88348e3c7abd4e3" alt="dens-circ-est5.jpeg, mar. 2011"
data:image/s3,"s3://crabby-images/07534/075344c84f4a6730f3421717838220fb5811be32" alt="http://freakonometrics.blog.free.fr/public/perso2/Xi.gif"
data:image/s3,"s3://crabby-images/5a594/5a594039e815a79b089b7f52835564bf58f56ee3" alt="http://freakonometrics.blog.free.fr/public/perso2/circular-density-3.gif"
data:image/s3,"s3://crabby-images/0569e/0569e8a626f16d536eafc59597a7e92b6437dd2e" alt="http://freakonometrics.blog.free.fr/public/perso2/0-24.gif"
data:image/s3,"s3://crabby-images/07534/075344c84f4a6730f3421717838220fb5811be32" alt="http://freakonometrics.blog.free.fr/public/perso2/Xi.gif"
The code is simply the following
U=seq(0,1,by=1/250) O=U*2*pi U12=seq(0,1,by=1/24) O12=U12*2*pi X=rbeta(100,shape1=2,shape2=4)*24 OM=2*pi*X/24 XL=c(X-24,X,X+24) d=density(X) d=density(XL,bw=d$bw,n=1500) I=which((d$x>=6)&(d$x<=30)) Od=d$x[I]/24*2*pi-pi/2 Dd=d$y[I]/max(d$y)+1 plot(cos(O),-sin(O),xlim=c(-2,2),ylim=c(-2,2), type="l",axes=FALSE,xlab="",ylab="") for(i in pi/12*(0:12)){ abline(a=0,b=tan(i),lty=1,col="light yellow")} segments(.9*cos(O12),.9*sin(O12),1.1*cos(O12),1.1*sin(O12)) lines(Dd*cos(Od),-Dd*sin(Od),col="red",lwd=1.5) text(.7,0,"6"); text(-.7,0,"18") text(0,-.7,"12"); text(0,.7,"24") R=1/24/max(d$y)/3+1 lines(R*cos(O),R*sin(O),lty=2)
Note that it is possible to stress more (visually) on hours having few phone calls, or a lot (compared with an homogeneous Poisson process), e.g.
plot(cos(O),-sin(O),xlim=c(-2,2),ylim=c(-2,2), type="l",axes=FALSE,xlab="",ylab="") for(i in pi/12*(0:12)){ abline(a=0,b=tan(i),lty=1,col="light yellow")} segments(2*cos(O12),2*sin(O12),1.1*cos(O12),1.1*sin(O12), col="light grey") segments(.9*cos(O12),.9*sin(O12),1.1*cos(O12),1.1*sin(O12)) text(.7,0,"6") text(-.7,0,"18") text(0,-.7,"12") text(0,.7,"24") R=1/24/max(d$y)/3+1 lines(R*cos(O),R*sin(O),lty=2) AX=R*cos(Od);AY=-R*sin(Od) BX=Dd*cos(Od);BY=-Dd*sin(Od) COUL=rep("blue",length(AX)) COUL[R<Dd]="red" CM=cm.colors(200) a=trunc(100*Dd/R) COUL=CM[a] segments(AX,AY,BX,BY,col=COUL,lwd=2) lines(Dd*cos(Od),-Dd*sin(Od),lwd=2)
We get here those two graphs,
- density of wind direction
data:image/s3,"s3://crabby-images/9b22f/9b22f9ee9164bcfd775aa358e079966f90c86d84" alt="boussole-2.jpg, mar. 2011"
- density of 911 phone calls
|
|
|
|
- density of earth temperatures, or earthquakes
data:image/s3,"s3://crabby-images/16d3d/16d3d5c1d0811bd56b53adb5499e5d7b890af910" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-16.gif"
data:image/s3,"s3://crabby-images/23941/239410638f3eae3d47ba3130e4aa3eabb1ed79ee" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-18.gif"
data:image/s3,"s3://crabby-images/05345/05345a7e2178f86f5ce3f3471f09b6c772451cb3" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-17.gif"
data:image/s3,"s3://crabby-images/0ffe3/0ffe329c5a6f1e669f717f553d48b599b3ab2a1d" alt="http://freakonometrics.blog.free.fr/public/perso2/circ-19.gif"
library(ks) X=cbind(EQ$Longitude,EQ$Latitude) Hpi1 = Hpi(x = X) DX=kde(x = X, H = Hpi1) library(maps) map("world") plot(DX,add=TRUE,col="red") points(X,cex=.2,col="blue") Y=rbind(cbind(X[,1],X[,2]),cbind(X[,1]+360,X[,2]), cbind(X[,1]-360,X[,2]),cbind(X[,1],X[,2]+180), cbind(X[,1]+360,X[,2]+180),cbind(X[,1]-360,X[,2]+180), cbind(X[,1],X[,2]-180),cbind(X[,1]+360, X[,2]-180),cbind(X[,1]-360,X[,2]-180)) DY=kde(x = Y, H = Hpi1) library(maps) plot(DY,add=TRUE,col="purple")Without any correction, we get the red level curves. The pink one integrates correction.
To leave a comment for the author, please follow the link and comment on their blog: Freakonometrics - Tag - R-english.
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.