R语言Fisher检验探究地区间公寓价格的关系-阿里云开发者社区

R语言Fisher检验探究地区间公寓价格的关系

2024-04-17 132

版权

本文内容由阿里云实名注册用户自发贡献，版权归原作者所有，阿里云开发者社区不拥有其著作权，亦不承担相应法律责任。具体规则请查看《阿里云开发者社区用户服务协议》和《阿里云开发者社区知识产权保护指引》。如果您发现本社区中有涉嫌抄袭的内容，填写侵权投诉表单进行举报，一经查实，本社区将立刻删除涉嫌侵权内容。

简介： R语言Fisher检验探究地区间公寓价格的关系

本文使用波兰公寓价格数据说明Fisher检验。



with(data = apart , boxplot(price ~ dis ))

我们在这里对公寓进行分组（这也可以通过简单的回归，这里5个解释变量并不重要）。我们可以重新排列


A = A[order(A$x),]

我们以这里最便宜的地区为参考，




Coefficients:Estimate Std. Error t value Pr(&gt;|t|)(Intercept) 2968.36 58.02 51.160 &lt;2e-16 ***districtBielany 17.38 84.16 0.207 0.836districtPraga 26.45 85.12 0.311 0.756districtUrsynow 42.01 82.65 0.508 0.611districtBemowo 80.10 83.71 0.957 0.339districtUrsus 102.01 82.25 1.240 0.215districtZoliborz 829.59 83.94 9.884 &lt;2e-16 ***districtMokotow 887.10 81.86 10.837 &lt;2e-16 ***districtOchota 987.93 84.16 11.738 &lt;2e-16 ***districtSrodmiescie 2214.39 83.28 26.591 &lt;2e-16 ***---Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
Residual standard error: 597.4 on 990 degrees of freedomMultiple R-squared: 0.5698,  Adjusted R-squared: 0.5659F-statistic: 145.7 on 9 and 990 DF, p-value: &lt; 2.2e-16

我们可以检验前5个地区价格，这是一个多重检验，我们将使用Fisher检验：



linHypo(reg, c("districtBielany = 0""districtPraga = 0""districtUrsynow = 0""districtBemowo = 0""districtUrsus = 0")Linear hypothesis test
Model 1: restricted modelModel 2: m2.price ~ district
Res.Df RSS Df Sum of Sq F Pr(&gt;F)1 995 3540517152 990 353269202 5 782513 0.4386 0.8217

Fisher的统计数据很低， p值为82％。



Linear hypothesis test
Model 1: restricted modelModel 2: m2.price ~ district
Res.Df RSS Df Sum of Sq F Pr(&gt;F)1 996 4054554092 990 353269202 6 52186207 24.374 &lt; 2.2e-16 ***---Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

我们将对前6种地区进行重组（并称A为地区重组）。如果我们看平均价格，按地区，我们得到



with(data = apar , boxplot( price ~ distr ))

我们再次开始，以最便宜的地区作为参考，我们想检验线性回归中接下来的两个地区的系数是否为零。






Linear hypothesis test
Model 1: restricted modelModel 2: m2.price ~ district
Res.Df RSS Df Sum of Sq F Pr(<F)1 997 3552925242 995 354051715 2 1240809 1.7435 0.1754

P为0.17，我们可以接受原假设。然后，我们有三组地区，名称分别为A，B和C。我们获得以下框线图




with(data = apart , boxplot( price ~ dist ))

因此，最终我们可以分类成三个不同的地区，如果目标是预测价格，则无需使用10类分类，而3类分类就足够了！

最受欢迎的见解

R语言Fisher检验探究地区间公寓价格的关系

热门文章

最新文章

相关课程

相关电子书

探索云世界

热门

云计算

大数据

云原生

人工智能

数据库

开发与运维

活动广场

任务中心

训练营

直播

乘风者计划

下载

镜像站

技术资料

R语言Fisher检验探究地区间公寓价格的关系

热门文章

最新文章

相关课程

相关电子书