Google索引：_escaped_fragment_不適用於主頁

我確實設置了我的網站（GWT）以供Google抓取。在使用上的谷歌Google網站站長工具的「取像谷歌」頁面上，我看到了以下的模式：Google索引：_escaped_fragment_不適用於主頁

訪問「http://www.mysite.com/#!AJAX_URL」是否正確重定向快照
但是，雖然我沒有設置在web.xml中

==>兩個與此相關的問題，谷歌不要求快照「http://www.mysite.com」：

是因爲Google網站管理員工具不夠聰明，但真正的機器人會正確地請求快照
我應該在web.xml或其他地方添加什麼東西嗎？

感謝，

雨果

來源

2012-09-03 Hugues

搜索了很多後，我找到了答案。這只是Googlebot抓取功能，它不會檢查元標記，而只是返回原始內容。當Google抓取並索引這些網頁時，他們會注意到這個元標記並採取相應的行動。

鏈接到答案就在這裏（見JohnMu評論）：

來源

2012-09-04 10:05:17 Hugues

確保您的 'robots.txt' 允許抓取器訪問：

User-agent: * 
Allow:/

此外，您可能希望提交一個Sitemap to Webmaster Tools。

這聽起來像快照正在正確服務。以防萬一，我張貼工作'index.php'的相關部分。靜態頁面位於'靜態/ $ {TOKEN} .html'

<!doctype html> 
<?php 

function static_url ($token) { return 'static/' . $token . '.html'; } 

$escaped_fragment = $_GET['_escaped_fragment_']; 

if (isset($escaped_fragment)) { 
    $fragment = preg_replace('/\//', '', $escaped_fragment); 
    $file = static_url($fragment); 

    if($escaped_fragment == '' || $escaped_fragment == '/' 
     || (! file_exists($file))) { 
    $fragment = '${DEFAULT_PLACE}:${DEFAULT_STATE}'; // your default place 
    $file = static_url($fragment); 
    } 
    $re = '/(^<[^>]*>)|(\n|\r\n|\t|\s{2,4})*/'; 

    $handle = fopen($file, 'r'); 
    if ($handle != false) { 
    $content = preg_replace($re, '', fread($handle, filesize($file))); 
    fclose($handle); 
    } 
    else { 
    $content = 'Page not found!'; 
    header(php_sapi_name() == 'cgi' ? 'Status: 404' : 'HTTP/1.1 404'); 
    } 
    echo $content; 
} else { ?> 

<html> ... Your GWT host page ... </html> 

<? } ?>

來源

2012-09-04 00:18:54

費利克斯您好，感謝您的反饋意見。試過你提出的建議，但沒有奏效。正如您在我自己的問題的答案中所看到的那樣，它是「不會檢查元標記」的「作爲googlebot獲取」功能。 – Hugues

Google索引：_escaped_fragment_不適用於主頁

回答

相關問題