我有一個統計問題,我想用R來解決。假設我有2個指數,Index1描述隨時間推移的平均價格水平,Index2描述了隨時間的平均租金水平。計算R中的平均指數與指數的比率
這是我的數據(幀):
Year Index1 Index2
1995 100 77.0033
1996 106.63 79.3342
1997 110.45 81.8608
1998 114.4 84.0633
1999 121.75 86.1133
2000 130.59 88.7758
2001 148.85 91.4483
2002 161.43 93.9042
2003 179.39 95.57
2004 204.59 97.1075
2005 227.58 99.9995
2006 253.17 102.2792
2007 277.45 104.0525
2008 276.42 107.1633
2009 261.26 109.8667
2010 280.81 111.9058
2011 295.91 114.0408
2012 306.63 115.56
2013 NA 117.2691
2014 NA 118.2967
編輯:我想計算的價格與租金的平均,換句話說的Index1 /索引2的長時間平均比。之後我想計算平均值的百分比差異(每年)。我怎樣才能做到這一點?
最好的問候, 吉爾斯
編輯:這裏是dput(DF)
structure(list(Year = c(1995, 1996, 1997, 1998, 1999, 2000, 2001,
2002, 2003, 2004, 2005, 2006, 2007, 2008, 2009, 2010, 2011, 2012,
2013, 2014), Price = c("100", "106.63", "110.45", "114.4", "121.75",
"130.59", "148.85", "161.43", "179.39", "204.59", "227.58", "253.17",
"277.45", "276.42", "261.26", "280.81", "295.91", "306.63", "NA",
"NA"), Rent = c(77.0033, 79.3342, 81.8608, 84.0633, 86.1133,
88.7758, 91.4483, 93.9042, 95.57, 97.1075, 99.9995, 102.2792,
104.0525, 107.1633, 109.8667, 111.9058, 114.0408, 115.56, 117.2691,
118.2967)), .Names = c("Year", "Price", "Rent"), row.names = c(NA,
-20L), class = "data.frame")
如果你想與索引2新科拉姆從100開始,你可以做df ['index2_base100'] < - df $ index2/df $ index2 [1] * 100 – agenis 2015-02-10 15:35:02
@agenis哇!感謝您的提示。我總是這樣做更復雜:) 但是,如果有NA值呢? – 2015-02-10 15:36:38
「長時間」的比例是什麼意思?如果您發佈預期結果以更好地理解 – agenis 2015-02-10 16:04:42