我有一個矩陣「mat」,其中012編碼SNPs爲列,人爲行。例如:根據第二個數據集和拆分列重新編碼
> mat<-matrix(c("0","1","0","1","2","0","1","1","2"),3,byrow=T)
> rownames(mat)<-c("ID1","ID2","ID3")
> colnames(mat)<-c("rs123","rs333","rs9000")
> mat
rs123 rs333 rs9000
ID1 "0" "1" "0"
ID2 "1" "2" "0"
ID3 "1" "1" "2"
在一個不同的矩陣「MAT2」我有兩列(即主要和次要等位基因)和的SNP爲行相應的等位基因。
> mat2<-matrix(c("A","T","C","T","T","G"),3,byrow=T)
> rownames(mat2)<-c("rs123","rs333","rs9000")
> colnames(mat2)<-c("Allele_A","Allele_B")
> mat2
Allele_A Allele_B
rs123 "A" "T"
rs333 "C" "T"
rs9000 "T" "G"
現在我要重新編碼從第一矩陣012編碼的單核苷酸多態性是在兩列:他們應該是各自等位基因A有兩個新列,如果他們的代碼是零,A/B,如果它是一個和B/B如果是兩個。在我的例子中,我想獲得以下內容:
> mat3<-matrix(c("A","C","T","A","T","T","A","T","T","T","T","T","A","C","G","T","T","G"),3,byrow=T)
> rownames(mat3)<-c("ID1","ID2","ID3")
> colnames(mat3)<-c("rs123_1","rs333_1","rs9000_1","rs123_2","rs333_2","rs9000_2")
> mat3
rs123_1 rs333_1 rs9000_1 rs123_2 rs333_2 rs9000_2
ID1 "A" "C" "T" "A" "T" "T"
ID2 "A" "T" "T" "T" "T" "T"
ID3 "A" "C" "G" "T" "T" "G"
你能幫我實現嗎?先謝謝你!
完美,這爲我做了這份工作! – VGaertner