2016-10-19 125 views
1

我想製作美國的熱圖,代碼「有效」,但數據不會填充到美國地圖中 - 只有州的名稱顯示 - 我也想填充,但美國的實際地圖是最重要的。ggplot不會填充美國地圖

這裏是我的代碼: #閱讀在我的數據 rawdata_path < - C( 「〜/ R/heatdata.xlsx」)

# Import the data into RStudio: 
rawdata <- readxl::read_excel(rawdata_path, sheet = 1, col_names = TRUE) 

#clean up the data 
rawdata$Lattitude <- as.numeric(rawdata$Lattitude) 
rawdata$Longitude <- as.numeric(rawdata$Longitude) 

#plot the data 
library(ggplot2) 
library(maps) 
ggplot(rawdata, aes(x=Longitude, y = Lattitude, group = rawdata$State))+ 
    geom_polygon(aes(fill=FinalCount))+ 
    geom_path()+ 
    geom_text(data = rawdata, aes(x=Longitude, y = Lattitude, label = State))+ 
    scale_fill_gradientn(colours = rev(heat.colors(10)), na.value = "grey90")+ 
    coord_map() 

這是數據幀的頭叫什麼, RAWDATA的樣子,我想狀態由Count柱填充 - 1爲最輕,10爲暗紅色:

State  Count Group Lattitude Longitude  Rev 
<chr>  <dbl> <dbl>  <dbl>  <dbl>  <dbl> 
arizona  1.0  1 33.50000 -112.05000  0.0 
arkansas  1.0  2 36.36000 -94.20000  0.0 
georgia  1.0  3 33.82000 -84.32000  0.0 
hawaii   1.0  4 21.30000 -157.85000  0.0 
kansas   1.0  5 38.97167 -95.23525  0.0 
maryland  1.0  6 38.98000 -77.08000  0.0 
missouri  1.0  7 39.09000 -94.58000  0.0 
oregon   1.0  8 45.51000 -122.68000  0.0 
pennsylvania 1.0  9 40.43000 -79.97000  0.0 
rhode island 1.0 10 41.82000 -71.41000  0.0 
tennessee  1.0 11 35.10000 -90.00000  0.0 
texas   1.0 12 29.76043 -95.36980  0.0 
louisiana  2.1 13 30.44000 -91.12000 209250.0 
indiana  2.3 14 38.30000 -85.72000 231605.9 
oklahoma  2.7 15 35.22000 -97.34000 274377.9 
michigan  3.8 16 42.73000 -84.48000 381528.5 
florida  4.9 17 30.43826 -84.28073 498338.5 
california  5.1 18 34.06000 -118.24000 511472.0 
illinois  5.3 19 41.83000 -87.68000 537913.5 
kentucky  5.7 20 38.22000 -85.74000 562077.0 
new york  6.3 21 40.75000 -73.99000 630642.8 
massachusetts 9.5 22 42.33038 -71.16619 908952.0 
north carolina 10.0 23 36.07000 -79.82000 1571923.8 

誰能幫助我在這?

回答

1

下面的代碼將加載並繪製地圖,美國從「RAWDATA」的座標和文本:

# Load the data 
rawdata_path <- 'C:/data.xlsx' 
rawdata <- readxl::read_excel(rawdata_path, sheet = 1, col_names = TRUE) 

#clean up the data 
rawdata$Lattitude <- as.numeric(rawdata$Lattitude) 
rawdata$Longitude <- as.numeric(rawdata$Longitude) 

library(ggplot2) 
library(maps) 
# Load the map of the United State 
all_states <- map_data("state") 

ggplot() + 
    geom_polygon(data=all_states, aes(x=long, y=lat, group = group), 
      colour="white", fill="blue") + 
    geom_point(data=rawdata, 
     aes(x=Longitude, y=Lattitude, colour='red', size=Count), 
     alpha=I(0.5)) + 
    geom_text(data = rawdata, aes(x=Longitude, y = Lattitude, label = State)) + 
    scale_fill_gradientn(colours = rev(heat.colors(10)), na.value = "grey90") 

運行它給出了這樣的地圖:

enter image description here

+0

謝謝!我使用的數據是上面的數據框。這是我正在閱讀的名爲heatdata的電子表格。 –

+0

我複製了你的Excel文件並更新了答案。 –

2

如果你正試圖用這些數據生成一張等值線地圖,這很容易與choroplethr。首先,你需要你的狀態列重命名爲「區域」和計數列作爲「價值」

所以輸入數據的樣子:

df <- structure(list(region = structure(c(1L, 2L, 5L, 6L, 9L, 12L, 
15L, 19L, 20L, 21L, 22L, 23L, 11L, 8L, 18L, 14L, 4L, 3L, 7L, 
10L, 16L, 13L, 17L), .Label = c("arizona", "arkansas", "california", 
"florida", "georgia", "hawaii", "illinois", "indiana", "kansas", 
"kentucky", "louisiana", "maryland", "massachusetts", "michigan", 
"missouri", "new york", "north carolina", "oklahoma", "oregon", 
"pennsylvania", "rhode island", "tennessee", "texas"), class = "factor"), 
value = c(1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 1, 2.1, 2.3, 2.7, 
3.8, 4.9, 5.1, 5.3, 5.7, 6.3, 9.5, 10), Group = 1:23), .Names = c("region", 
"value", "Group"), class = "data.frame", row.names = c(NA, -23L 
)) 

該腳本很簡單:

library(ggplot2) 
library(choroplethr) 

choro <- state_choropleth(df) + scale_fill_brewer(palette = "Reds") 
choro 

enter image description here

+0

這真是太好了,正是我在找的東西!非常感謝!我一定會利用更多的這個包。 –

+0

有一個問題 - 我不明白「L」來自數據幀的結構部分。你是如何得到這些數字的,他們爲什麼會這樣呢? –

+0

@DanaOrinick這是'dput'函數的輸出,我試圖複製您發佈的數據的重要部分。 L後綴表示一個整數。爲了腳本的目的,它們並不真正相關。您只需更改列名稱並確保狀態爲小寫。請參閱:https://cran.r-project.org/web/packages/choroplethr/vignettes/b-state-choropleth.html – mpschramm