Significance in R

2023-02-03 05:51 问答作者：

Ok, this is quite hard to explain, but I'm at a complete loss what to do. I'm a relative newcomer to R and although I can completely admire how powerful it is, I'm not too good at actually using it....

Basically, I have some very contrived data that I need to analyse (it wasn't me who chose this, I can assure you!). I have the right and left hand lengths of lots of people, as well as some numeric data that shows their sociability.

Now I would like to know if people who have significantly different lengths of hand are more or less sociable than those who have the same (leading into the research that 'symmetrical' people are more sociable and intelligent, etc.

I have got as far as loading the data into R, then I have no idea where to go from there. How on Earth do I start to separate those who are close to symmetrical to those who aren't to then start to do the analysis?

Ok, using Sacha's great advice, I did the cor.test and got the following:

P开发者_StackOverflow社区earson's product-moment correlation

data:  measurements$l.hand - measurements$r.hand and measurements$sociable 
t = 0.2148, df = 150, p-value = 0.8302
alternative hypothesis: true correlation is not equal to 0 
95 percent confidence interval:
 -0.1420623  0.1762437 
sample estimates:
       cor 
0.01753501

I have never used this test before, so am unsure how to intepret it...you wouldn't think I was on my fourth Scientific degree would you?! :(

A significance test is used to see if the aquired effect in the sample can be due to random error. It is not really usable for individual scores, but what you can do is standardize the differences.

If 'dat' is the name of your dataframe, containing the columns 'hand.left', 'hand.right' and 'social' then you can standardize the differences in this way:

stand=scale(dat$hand.left-dat$hand.right)

Then you can create a logical dummy vector indicating if a person has a likelihood of this difference given the sample:

dum=abs(stand)<1.96

Which you can then use in a t-test:

t.test(dat$social[dum],dat$social[!dum])

However, the problem with this is that due to the nature of standardizing the sample size in the equal hand group will be about 20 times larger then the inequal hand group, so you need a lot of observations. An alternative is to look at the correlation between sociability and the absolute hand differences:

cor.test(abs(dat$hand.left-dat$hand.right), dat$social)

Looks like a regression problem:

set.seed(1)
DF <- data.frame(lhand = rnorm(100, 15, sd = 2), rhand = rnorm(100, 15, sd = 2),
                 social = runif(100))
DF <- within(DF, handedness <- lhand - rhand)

mod <- lm(social ~ handedness, data = DF)
summary(mod)

Here this is no relationship (not suprising as these are just random data):

> summary(mod)

Call:
lm(formula = social ~ handedness, data = DF)

Residuals:
     Min       1Q   Median       3Q      Max 
-0.51113 -0.25170 -0.02336  0.26161  0.52042 

Coefficients:
             Estimate Std. Error t value Pr(>|t|)    
(Intercept)  0.507240   0.030700  16.522   <2e-16 ***
handedness  -0.007896   0.011670  -0.677      0.5    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1 

Residual standard error: 0.3051 on 98 degrees of freedom
Multiple R-squared: 0.00465,    Adjusted R-squared: -0.005506 
F-statistic: 0.4579 on 1 and 98 DF,  p-value: 0.5002

For your real problem, ask yourself if a linear approximation between social and handedness might be appropriate. So plot the data:

plot(social ~ handedness, data = DF)

Does it look bivariate Gaussian?

If the linear relationship looks OK, fit the linear model and then check the residuals for problems (curved patterns, non-constant variance, strong deviation from normality). Then move on from there. If a linear approximation doesn't seem right, can you linearise the relationship with a transformation? If yes, try that, if not how about trying a GLM? etc. etc...

You should examine your data...

hist( measurements$social )

How does it look?... normal?

You might try regression

stand = scale( measurements$l.hand - measurements$r.hand )
m <- lm( measurements$social ~ stand )
m
summary(m)
anova(m)

You can also plot the model and it will give you diagnostics

plot(m)

Please type the phrase ?command (e.g. ?lm) for all of the commands that you try here. It will help you understand what is going on.

继续阅读：correlation r statistics

Significance in R

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？