Forum Discussion
How to compare two variables with unequal data samples with CORREL function?
Hi
I have two variables denoted A and B.
A has 5000 data points comprising data from the last 5000 days, whereas B only has 1000 data points comprising the last 1000 days.
When I use CORREL function between all 5000 data points from A and 1000 data points from B, I get a correlation value of -0,711.
However, when I only take the last 1000 data points from variable A (comprising the last 1000 days) and compare these with the 1000 data points from variable B by using CORREL function, I get a correlation value of 0,981.
I don't understand how there can be such a big difference. Should I assume that the correlation value of 0,981 is most correct since there exists a data point for each of the 1000 data points from variable A to be compared to by variable B (and vice versa), or I am missing something here?
Thanks for any help π
5 Replies
- PReaganBronze Contributor
You are manipulating giving a misinterpretation of your data by using the arrays "'A'!E:E" and "'B'!E:E". While these arrays are the same size, the arrays actually have different amounts of data points. You should only use the range in which there are data points in the CORREL() function.