2013-09-30 27 views
1

嗨我試圖做一個腳本,將給我一個新聞組活動的摘要。大部分工作到目前爲止,除了當我試圖使用匹配運算符來查看這個$ 6字段是否匹配表達式。我想把所有的戒指放在一個部分下。這是我的腳本是什麼樣子:如何在awk腳本中使用正則表達式?

newsread.awk:

BEGIN{ 
print "\t\t\tNews Reader Summary\n\n" 
printf("    %-15s%-15s%-15s%-15s\n\n","lonestar","runner","ringer","rings"); 
articles[4]; 
groups[4]; 
times[4]; 
cs2413[4];cs2413d[4]; 
} 

NR == 1 {date1 = $1 " " $2 " " $3} 

$6 == "lonestar.jpl.utsa.edu"{ 
    if ($7=="group"){ 
     articles[1]+=$9; 
     if ($8=="utsa.cs.2413"){ 
      cs2413[1]+=$9; 
     } 
     if ($8=="utsa.cs.2413.d"){ 
      cs2413d[1]+=$9; 
     } 
    }else if ($7 == "exit"){ 
     articles[1]+=$9; 
     groups[1]+=$11; 
    }else { 
     times[1]+=$13; 
    } 
} 

$6 == "runner.jpl.utsa.edu"{ 
    if ($7=="group"){ 
       articles[2]+=$9; 
     if ($8=="utsa.cs.2413"){ 
         cs2413[2]+=$9; 
       } 
       if ($8=="utsa.cs.2413.d"){ 
         cs2413d[2]+=$9; 
       } 

     }else if ($7 == "exit"){ 
       articles[2]+=$9; 
       groups[2]+=$11; 
     }else { 
       times[2]+=$13; 
     } 

} 

$6 == "ringer.cs.utsa.edu"{ 
    if ($7=="group"){ 
       articles[3]+=$9; 
     if ($8=="utsa.cs.2413"){ 
         cs2413[3]+=$9; 
       } 
       if ($8=="utsa.cs.2413.d"){ 
         cs2413d[3]+=$9; 
       } 

     }else if ($7 == "exit"){ 
       articles[3]+=$9; 
       groups[3]+=$11; 
     }else { 
       times[3]+=$13; 
     } 

} 

$6 ~ "/ring??.cs.utsa.edu/"{ 
    if ($7=="group"){ 
       articles[4]+=$9; 
     if ($8=="utsa.cs.2413"){ 
         cs2413[4]+=$9; 
       } 
       if ($8=="utsa.cs.2413.d"){ 
         cs2413d[4]+=$9; 
       } 

     }else if ($7 == "exit"){ 
       articles[4]+=$9; 
       groups[4]+=$11; 
     }else { 
       times[4]+=$13; 
     } 

} 
END{ 
    date2 = $1 " " $2 " " $3 
    printf("Articles:  %-15d%-15d%-15d%-15d\n",articles[1],articles[2],articles[3],articles[4]); 
    printf("Groups:  %-15d%-15d%-15d%-15d\n",groups[1],groups[2],groups[3],groups[4]); 
    printf("Cs2413:  %-15d%-15d%-15d%-15d\n",cs2413[1],cs2413[2],cs2413[3],cs2413[4]); 
    printf("Cs2413.d:  %-15d%-15d%-15d%-15d\n",cs2413d[1],cs2413d[2],cs2413d[3],cs2413d[4]); 
    printf("User Time:  %-15d%-15d%-15d%-15d\n",times[1],times[2],times[3],times[4]); 
    printf("\nStart Time = %s\tEnd Time = %s\n",date1,date2); 

} 

這是news.notice的片段看起來像:

Feb 13 21:27:14 ringer nnrpd[11474]: lonestar.jpl.utsa.edu group alt.education.distance 19 
Feb 13 21:27:14 ringer nnrpd[11474]: lonestar.jpl.utsa.edu exit articles 19 groups 1 
Feb 13 21:27:14 ringer nnrpd[11474]: lonestar.jpl.utsa.edu times user 0.470 system 0.930 elapsed 4.766 
Feb 13 21:27:49 ringer nnrpd[11462]: ring42.cs.utsa.edu exit articles 0 groups 2 
Feb 13 21:27:49 ringer nnrpd[11462]: ring42.cs.utsa.edu times user 2.020 system 1.430 elapsed 45.114 
Feb 13 21:28:00 ringer nnrpd[11482]: lonestar.jpl.utsa.edu group utsa.lonestar 7 
Feb 13 21:28:00 ringer nnrpd[11482]: lonestar.jpl.utsa.edu exit articles 7 groups 1 
Feb 13 21:28:00 ringer nnrpd[11482]: lonestar.jpl.utsa.edu times user 0.520 system 0.890 elapsed 48.286 
Feb 13 21:28:38 ringer innd: ME running 
Feb 13 21:28:43 ringer nnrpd[11344]: lonestar.jpl.utsa.edu unrecognized NOOP 
Feb 13 21:29:01 ringer nnrpd[11601]: lonestar.jpl.utsa.edu connect 
Feb 13 21:29:01 ringer nnrpd[11601]: lonestar.jpl.utsa.edu exit articles 0 groups 0 
Feb 13 21:29:01 ringer nnrpd[11601]: lonestar.jpl.utsa.edu times user 0.470 system 0.770 elapsed 1.456 
Feb 13 21:29:03 ringer nnrpd[11602]: lonestar.jpl.utsa.edu connect 
Feb 13 21:29:03 ringer nnrpd[11472]: ring29.cs.utsa.edu exit articles 0 groups 0 
Feb 13 21:29:03 ringer nnrpd[11472]: ring29.cs.utsa.edu times user 1.360 system 0.790 elapsed 114.771 
Feb 13 21:29:03 ringer nnrpd[11602]: lonestar.jpl.utsa.edu exit articles 0 groups 0 
Feb 13 21:29:03 ringer nnrpd[11602]: lonestar.jpl.utsa.edu times user 0.530 system 0.650 elapsed 1.524 
Feb 13 21:29:25 ringer nnrpd[11615]: lonestar.jpl.utsa.edu connect 

,我使用這個命令:

awk -f newsread.awk news.notice > newsread.summary 

這裏是newsread.summary:

  News Reader Summary 


       lonestar  runner   ringer   rings   

Articles:  144686   25066   2    0    
Groups:  5282   8344   19    0    
Cs2413:  0    0    0    0    
Cs2413.d:  40    25    0    0    
User Time:  266197   83377   128   0    

Start Time = Feb 13 21:27:14 End Time = Feb 14 20:56:49 

它必須是awk腳本。

+0

的 「聲明」'文章[4];組[4];次[4]; cs2413 [4]; cs2413d [4];'是不必要的,可能不會做你認爲他們做的事。在awk的所有變種中,沒有必要聲明數組,也沒有這樣做的機制。當數組被索引時(或者在某些awk實現中,當腳本被解析時),數組會自動彈出。 – rici

+0

這很好理解,謝謝!我會擺脫他們。 – MeesterMarcus

回答

2

首先擺脫了引號,即不是這樣:

$6 ~ "/ring??.cs.utsa.edu/" 

但這:

$6 ~ /ring??.cs.utsa.edu/ 

行情分隔字符串,斜線劃不變的RE。

現在,我懷疑你的RE是錯誤的,因爲??表示前一個字符爲零或1個重複,然後再次相同或字面問號(不知道哪個 - 無論哪種方式都沒有意義),而.表示「任何單個字符」。這是一個正則表達式,而不是shell globbing - 具有不同含義的不同元字符。

你可能想要這個:

$6 ~ /^ring..\.cs\.utsa\.edu$/ 
+1

完成了!而你的正確,你提供的正則表達式就像我想要的那樣工作。謝謝! – MeesterMarcus

1

丟失雙引號。

$6 ~ /regex/ 

$6 ~ "/regex/" 
相關問題