Étant donné trois vecteurs , b et c , est-il possible que les corrélations entre a et b , a et c et b et c soient toutes négatives? Est-ce possible?
correlation
correlation-matrix
Antti A
la source
la source
Réponses:
Cela est possible si la taille du vecteur est de 3 ou plus. Par exemple
Les corrélations sont
On peut prouver que pour des vecteurs de taille 2 ce n'est pas possible:
Similarly for correlations between (a,c) and (b,c) we get
Clearly, all of these three formulas can not hold in the same time.
la source
Yes, they can.
Suppose you have a multivariate normal distributionX∈R3,X∼N(0,Σ) .
The only restriction on Σ is that it has to be positive semi-definite.
So take the following exampleΣ=⎛⎝⎜1−0.2−0.2−0.21−0.2−0.2−0.21⎞⎠⎟
Its eigenvalues are all positive (1.2, 1.2, 0.6), and you can create vectors with negative correlation.
la source
let's start with a correlation matrix for 3 variables
non-negative definiteness creates constraints for pairwise correlationsp,q,r which can be written as
For example, ifp=q=−1 , the values of r is restricted by 2r≥r2+1 , which forces r=1 . On the other hand if p=q=−12 , r can be within 2±3√4 range.
Answering the interesting follow up question by @amoeba: "what is the lowest possible correlation that all three pairs can simultaneously have?"
Letp=q=r=x<0 , Find the smallest root of 2x3−3x2+1 , which will give you −12 . Perhaps not surprising for some.
A stronger argument can be made if one of the correlations, sayr=−1 . From the same equation −2pq≥p2+q2 , we can deduce that p=−q . Therefore if two correlations are −1 , third one should be 1 .
la source
A simple R function to explore this:
As a function of
n
,f(n)
starts at 0, becomes nonzero atn = 3
(with typical values around 0.06), then increases to around 0.11 byn = 15
, after which it seems to stabilize:So, not only is it possible to have all three correlations negative, it doesn't seem to be terribly uncommon (at least for uniform distributions).
la source