Conditional Distributions from some Elliptical Vectors
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.
This winter, in my ACT8595 course, I asked my students (that was some homework) to prove that it was possible to derive the conditional distribution when we have a Student-t random vector (and to get the analytical expression of the later). But before, let us recall a standard result about the Gaussian vector. If is a Gaussian random vector, i.e.
then has a Gaussian distribution. More precisely, it is a distribution, with
and is the Schur complement of the block of the matrix ,
Observe that is also related to well known quantity: in the bivariate case, where and are univariate Gaussian variables,
which is the slope in the linear regression of on .
In the case of the Student-t distribution, the conditional distrubution will not be a Student-t distribution anymore, but it will still be an elliptical distribution, and some interpretations of various quantities can actually be obtained.
The density of the multivariate centred Student-t distribution, with unit variance, and parameters and is
If we consider the following blocks,
then we can get that marginal distributions have a centred Student-t distribution, with unit variance, and parameters and ,
Then, to derive the conditional density, we can use Bayes formula,
One can write (as in Section 9.1 in Tong, 1990, The Multivariate Normal Distribution, but other expressions can be found in Section 2.5 in Fang, Ng and Kotz, 1989, Symmetric multivariate and related distributions, or in Section 1.11 in Kotz and Nadarajah, 2004, Multivariate t distributions and their applications) this conditional density as
with
and
This conditional distribution is elliptical, but it is not a Student-t distribution, except in the case where , or when the correlation matrix is the identity.
Now, if we look at the components of this densiy, we can observe that we have
which was mentioned previously, in the Gaussian case: the term on the right is the conditional mean,
and the bloc that appears at several places is the conditional variance,
Now, if we want to visualize that conditional density, let us plot it. The code below is based on Bayes formula
> library(mnormt) > r=.6 > R=matrix(c(1,r,r,1),2,2) > nu=4 > f2=function(x2) dt(x2,df=nu) > f =function(x) dmt(x,S=R,df=nu) > f1.2=function(x1,x2) f(c(x1,x2))/f2(x2)
In order to compare that conditional density with a Student-t one, let us define the density of a non-centred Student-t random variable,
> dstd=function(x,mu,s,nu) gamma((nu+1)/2)/ + (gamma(nu/2)*s*sqrt(pi*nu))* + (1+1/nu*(x-mu)^2/(s^2))^(-(nu+1)/2)
Here is the function we can use to plot those two densities,
> graphdensity=function(x2=-1.5){ + vectx1=seq(-3,3,length=251) + y=Vectorize(function(x) f1.2(x,x2))(vectx1) + plot(vectx1,y,type="l",col="red",ylim=c(0,.5), + xlab="",ylab="") + abline(v=r*x2,lty=2) + lines(vectx1,dstd(vectx1,x2*r,sqrt(1-r^2),nu),col="blue",lty=2)} > graphdensity(-1.5)
In the case where , the two lines are rather close (the difference migth come from computational issues)
> graphdensity(-1)
and just to conclude, a last one
> graphdensity(0)
R-bloggers.com offers daily e-mail updates about R news and tutorials about learning R and many other topics. Click here if you're looking to post or find an R/data-science job.
Want to share your content on R-bloggers? click here if you have a blog, or here if you don't.