2011-10-02 103 views
0

所以我正在騎自行車瀏覽41段文件。對於每個段落,我試圖[1]首先將字符串分解成數組,然後獲取段落的詞頻。然後,我想結合所有段落的數據並獲取整個文檔的詞頻。合併數組和字頻

我能夠獲得數組,它給出了給定段落的「單詞」及其「頻率」,但我無法合併每段的結果以獲得「整個詞頻」 。文檔以下是我有:

function sectionWordFrequency($sectionFS) 
{ 
$section_frequency = array(); 
$filename = $sectionFS . ".xml"; 
$xmldoc = simplexml_load_file('../../editedtranscriptions/' . $filename); 
$xmldoc->registerXPathNamespace("tei", "http://www.tei-c.org/ns/1.0"); 
$paraArray = $xmldoc->xpath("//tei:p"); 

foreach ($paraArray as $p) 
{ 
$para_frequency = (array_count_values(str_word_count(strtolower($p), 1))); 
$section_frequency[] = $para_frequency; 
} 


return array_merge($section_frequency); 
} 

/// now I call the function, sort it, and try to display it 
$section_frequency = sectionWordFrequency($fs); 
ksort($section_frequency); 

foreach ($section_frequency as $word=>$frequency) 
{ 
echo $word . ": " . $frequency . "</br>"; 
} 

現在的結果我得到的是:

1:陣列 2:陣列 3:數組 4:數組

任何幫助是很大的欣賞d。

回答

0

嘗試用這種

$section_frequency = array_merge($section_frequency, $para_frequency); 

替換該行

$section_frequency[] = $para_frequency; 

然後

return $section_frequency 
+0

這是很好的和聰明。我認爲它工作。謝謝 – Jeff

+0

很高興爲您提供幫助。 –