Read.table into R

我想將一個文本文件讀入R，但是我遇到了第一列與列名和第一列數混合的問題。Read.table into R

數據文本文件

revenues  4118000000.0, 4315000000.0, 4512000000.0, 4709000000.0, 4906000000.0, 5103000000.0 
cost_of_revenue-1595852945.4985902, -1651829192.2662954, -1705945706.6237037, -1758202488.5708148, -1808599538.1076286, -1857136855.234145 
gross_profit 2522147054.5014095, 2663170807.7337046, 2806054293.376296, 2950797511.429185, 3097400461.892371, 3245863144.765855

R代碼裏面： data.predicted_values函數read.table =（「predicted_values.txt」，月= 「」）

輸出：

        V1   V2   V3   V4   V5   V6 
1  revenues  4118000000.0 4315000000 4512000000 4709000000 4906000000 5103000000 
2 cost_of_revenue-1595852945.4985902 -1651829192 -1705945707 -1758202489 -1808599538 -1857136855 
3 gross_profit 2522147054.5014095 2663170808 2806054293 2950797511 3097400462 3245863145

如何將第一列分爲兩部分？我的意思是我想要第一列V1是收入，cost_of_revenue，gross_profit。 V2是4118000000.0，-1595852945.4985902,2522147054.5014095。等等等等。

來源

2013-11-21 user3015546

您的意思是說，你的列是行，反之亦然？ –

你是說有一些空間分隔符而不是其他的？例如第2行。 – Maiasaura

這與@Dinin's的思路是一樣的，但是在第二行佔了負值。

TEXT <- readLines("predicted_values.txt") 
A <- gregexpr("[A-Za-z_]+", TEXT) 
B <- read.table(text = regmatches(TEXT, A, invert = TRUE)[[1]], sep = ",") 
C <- cbind(FirstCol = regmatches(TEXT, A)[[1]], B) 
C 
#   FirstCol   V1   V2   V3   V4   V5   V6 
# 1  revenues 4118000000 4315000000 4512000000 4709000000 4906000000 5103000000 
# 2 cost_of_revenue -1595852945 -1651829192 -1705945707 -1758202489 -1808599538 -1857136855 
# 3 gross_profit 2522147055 2663170808 2806054293 2950797511 3097400462 3245863145

來源

2013-11-21 04:37:34 A5C1D2H2I1M1N2O1R2T1

既然你沒有逗號btwn的rownames和你需要的值，將其添加回去：

txt <- "revenues  4118000000.0, 4315000000.0, 4512000000.0, 4709000000.0, 4906000000.0, 5103000000.0 
cost_of_revenue-1595852945.4985902, -1651829192.2662954, -1705945706.6237037, -1758202488.5708148, -1808599538.1076286, -1857136855.234145 
gross_profit 2522147054.5014095, 2663170807.7337046, 2806054293.376296, 2950797511.429185, 3097400461.892371, 3245863144.765855" 

Lines <- readLines(textConnection(txt)) 
    # replace textConnection(.) with `file = "predicted_values.txt"` 
res <- read.csv(text=sub("(^[[:alpha:][:punct:]]+)(\\s|-)" , 
               "\\1,", Lines) , 
      header=FALSE, row.names=1) 
res

的小數可能無法打印，但它們的存在。

來源

2013-11-21 03:27:10

您需要row.names參數read.table。然後你可以簡單地轉置你的數據：

data.predicted_values = read.table("predicted_values.txt", sep=",", row.names=1) 
data.predicted_values <- t(data.predicted_values)

來源

2013-11-21 03:27:33

我嘗試了兩種方法，但沒有奏效。我想將行名和數字數據分隔成兩個單獨的列。 – user3015546

您需要使用'sub'作爲我的例子。 –

啊我以爲數值是純粹因爲你沒有告訴它使用第一列作爲'row.names' –

回答

相關問題