2010-09-15 127 views
1

我正在收集所有用戶的usb使用情況詳情,並將其轉換爲CSV文件,以便我可以將其導出到某個數據庫中。 輸入文本文件如下: -如何使用AWK將文本文件轉換爲csv

USB History Dump 
by nabiy (c)2008 
(1) --- Kingston DataTraveler 130 USB Device 
instanceID: 0018F3D974B4A9C0E1760896&0 
ParentIdPrefix: 7&b62e00e&2 
Last Mounted As: \DosDevices\I: 
Driver:{4D36E967-E325-11CE-BFC1-08002BE10318}\0033 
Disk Stamp: 09/07/2010 15:07 
Volume Stamp: 09/07/2010 15:07 
(2) --- Kingston DataTraveler 2.0 USB Device 
instanceID: 001D0F1E35B25B8C1201011B&0 
ParentIdPrefix: 7&1f5848f3&0 
Driver:{4D36E967-E325-11CE-BFC1-08002BE10318}\0035 
Disk Stamp: 09/06/2010 15:18 
Volume Stamp: 09/06/2010 15:18 
(3) --- Maxtor OneTouch III USB Device 
instanceID: 044303E5&0 
ParentIdPrefix: 
Driver:{4D36E967-E325-11CE-BFC1-08002BE10318}\0032 
Disk Stamp: 09/10/2010 10:09 
Volume Stamp: 03/12/2010 10:42 

我如何解析這個文件,這樣就可以在下面的格式獲得:

hostname Devic_name instanceID ParentPrefix LastMountedAs Driver 
pcname kingston xxxx xxxxxxxxx xxxxxxxxxx xxxxxxxx 
pcname maxtor 0440xxx 4 d 367 08/07/2010 xxxxxxxx 
pcname kingston xxxxxxx xxxxxxx xxxxxxxxx xxxxxxxx 

計算機名稱將從hostname命令服用。

所需的輸出是CSV格式的數據庫與一些批處理或awk腳本。 任何建議,非常感謝。

+2

告訴你的輸出格式不CSV。 – 2010-09-15 05:26:49

+0

它必須awk嗎? Perl會好嗎? – 2010-09-15 05:29:53

回答

0

使用Perl,這可以這樣處理,產生真正的CSV數據:

:從給定的數據文件輸出

use strict; 
use warnings; 

my @keys = ("Device_Name", "instanceID", "ParentIdPrefix", 
      "Last Mounted As", "Driver"); 

my %values =(); 
my $host = qx/hostname/; 
chomp $host; 

while (<>) 
{ 
    chomp; 
    next unless m/^\(\d+\) ---/ || m/^[\w ]+:/; 
    if (m/\(\d+\) --- (\w+)/) 
    { 
     dump_entry(\%values); 
     %values =(); 
     $values{Device_Name} = $1; 
    } 
    else 
    { 
     my($key,$value) = split /:/; 
     $value =~ s/^\s+//; 
     $value =~ s/\s+$//; 
     $values{$key} = $value if $value ne ""; 
    } 
} 
dump_entry(\%values); 

sub dump_entry 
{ 
    my($ref) = @_; 
    my(%values) = %$ref; 
    return if (scalar(keys %values) == 0); 
    print qq%"$host"%; 
    foreach my $key (@keys) 
    { 
     my $value = $values{$key} // "--none--"; 
     print qq%,"$value"%; 
    } 
    print "\n"; 
} 

"yourpcname","Kingston","0018F3D974B4A9C0E1760896&0","7&b62e00e&2","\DosDevices\I","{4D36E967-E325-11CE-BFC1-08002BE10318}\0033" 
"yourpcname","Kingston","001D0F1E35B25B8C1201011B&0","7&1f5848f3&0","--none--","{4D36E967-E325-11CE-BFC1-08002BE10318}\0035" 
"yourpcname","Maxtor","044303E5&0","--none--","--none--","{4D36E967-E325-11CE-BFC1-08002BE10318}\0032" 

請注意,與問題中的輸出數據不同,數據以讀取順序呈現。

0

Follwing老傳統,這裏是喬納森的代碼的awk版本:-)

cat tess |awk ' 
function cmd(E, A, this,v){ A[0]=0;while((E |getline v)>0)A[A[0]+=1]=v;A["RETURN_CODE"]=close(E);} 
# whatever cvs format you perfer. Here we used a traditional type, with 
# escape sequence when 0x22 or , is present. 
function cvs(s){gsub(",","\\,",s);gsub("\"","\\\"",s);return ((s)?"\""s"\"":"\"--none--\"");}; 
BEGIN{ 
    cmd("hostname",A);host=A[1]; 
    f=0; 
    n=0; 
    print "hostname Devic_name instanceID ParentPrefix LastMountedAs Driver ";# Header 
    while(1){ 
     while((getline r)>0){ 
      if(r~"^[(][0-9]*[)]"){n=1;break;} 
      if(r!~":")continue; 
      key = substr(r,match(r,"^[^:]*"),RLENGTH);sub("^:[ \t]*","",key); 
      match(r,"^[^:]*[:][ \t]*"); 
      value = substr(r,RSTART+RLENGTH);sub("[\t ]*$","",value); 
      A[key]=value; 
     } 
     if(f){ 
      print cvs(host)","cvs(A["devic_name"])","cvs(A["instanceID"])","cvs(A["ParentIdPrefix"])","cvs(A["Last Mounted As"])","cvs(A["Driver"]); 
      delete A; 
     } 
     if(!n)break; 
     if(n)n=0; 
     f=1; 
     sub("^[(][0-9]*[)][ \t]*---[ \t]*","",r); 
     sub("[ ]*USB Device[ ]*$","",r); 
     A["devic_name"] = r; 
     continue; 
    } 
}' 

例輸出follwing

hostname Devic_name instanceID ParentPrefix LastMountedAs Driver 
"host","Kingston DataTraveler 130","0018F3D974B4A9C0E1760896&0","7&b62e00e&2","\DosDevices\I:","{4D36E967-E325-11CE-BFC1-08002BE10318}\0033" 
"host","Kingston DataTraveler 2.0","001D0F1E35B25B8C1201011B&0","7&1f5848f3&0","--none--","{4D36E967-E325-11CE-BFC1-08002BE10318}\0035" 
"host","Maxtor OneTouch III","044303E5&0","--none--","--none--","{4D36E967-E325-11CE-BFC1-08002BE10318}\0032" 
相關問題