2013-07-24 78 views
2

我試圖通過機械化登錄到一個網站,我填寫了所有的表格,然後提交,但每次我嘗試它,我最終都在同一頁面上,所以要麼我被重定向回來(沒有錯誤信息),或者表單沒有提交。這是爲什麼?機械化不提交表格

代碼

require 'mechanize' 
class Scraper 

    def initialize 
    @a = Mechanize.new { |agent| 
     agent.follow_meta_refresh = true 
    } 
    end 

    def login 


    @a.get("https://login.salesforce.com/") do |page| 
     red = page.forms.first do |form| 
     form.field_with(:type => "email").value = username 
     form.field_with(:type => "password").value = password 
     end.click_button 
     pp red 
     #puts main.title 
    end 
    end 
end 
s = Scraper.new 
s.login 

初始頁:

#<Mechanize::Page 
    {url #<URI::HTTPS:0x007f8f39d6fb30 URL:https://login.salesforce.com/>} 
    {meta_refresh} 
    {title "salesforce.com - Customer Secure Login Page"} 
    {iframes 
    #<Mechanize::Page::Frame 
    "marketing" 
    "https://www.salesforce.com/login-messages/messages.html?noroundedcorner">} 
    {frames} 
    {links 
    #<Mechanize::Page::Link "Salesforce" "http://www.salesforce.com"> 
    #<Mechanize::Page::Link 
    "Forgot your password?" 
    "/secur/forgotpassword.jsp?locale=us"> 
    #<Mechanize::Page::Link 
    "Sign up for free." 
    "https://www.salesforce.com/form/trial/freetrial.jsp?d=70130000000Enus">} 
    {forms 
    #<Mechanize::Form 
    {name "login"} 
    {method "POST"} 
    {action "https://login.salesforce.com/"} 
    {fields 
     [hidden:0x3fc79cec63ac type: hidden name: un value: ] 
     [hidden:0x3fc79cec6244 type: hidden name: width value: ] 
     [hidden:0x3fc79cec60c8 type: hidden name: height value: ] 
     [hidden:0x3fc79cec5efc type: hidden name: hasRememberUn value: true] 
     [hidden:0x3fc79cec5d58 type: hidden name: startURL value: ] 
     [hidden:0x3fc79cec5bc8 type: hidden name: loginURL value: ] 
     [hidden:0x3fc79cec5a38 type: hidden name: loginType value: ] 
     [hidden:0x3fc79cec987c type: hidden name: useSecure value: true] 
     [hidden:0x3fc79cec969c type: hidden name: local value: ] 
     [hidden:0x3fc79cec9520 type: hidden name: lt value: standard] 
     [hidden:0x3fc79cec9340 type: hidden name: qs value: ] 
     [hidden:0x3fc79cec9174 type: hidden name: locale value: ] 
     [hidden:0x3fc79cec8f80 type: hidden name: oauth_token value: ] 
     [hidden:0x3fc79cec8db4 type: hidden name: oauth_callback value: ] 
     [hidden:0x3fc79cec8be8 type: hidden name: login value: ] 
     [hidden:0x3fc79cec89cc type: hidden name: serverid value: ] 
     [hidden:0x3fc79cec8814 type: hidden name: display value: page] 
     [field:0x3fc79cec8670 type: email name: username value: ] 
     [field:0x3fc79cec84e0 type: password name: pw value: ]} 
    {radiobuttons} 
    {checkboxes 
     [checkbox:0x3fc79cec833c type: checkbox name: rememberUn value: ]} 
    {file_uploads} 
    {buttons [button:0x3fc79cecac2c type: name: Login value: ]}>}> 

最後一頁:

#<Mechanize::Page 
{url #<URI::HTTPS:0x007f9d1d250960 URL:https://login.salesforce.com/>} 


{meta_refresh} 
    {title "salesforce.com - Customer Secure Login Page"} 
    {iframes 
    #<Mechanize::Page::Frame 
    "marketing" 
    "https://www.salesforce.com/login-messages/messages.html? r=https%3A%2F%2Flogin.salesforce.com%2F&noroundedcorner">} 
    {frames} 
    {links 
    #<Mechanize::Page::Link "Salesforce" "http://www.salesforce.com"> 
    #<Mechanize::Page::Link 
    "Forgot your password?" 
    "/secur/forgotpassword.jsp?locale=us"> 
    #<Mechanize::Page::Link 
    "Sign up for free." 
    "https://www.salesforce.com/form/trial/freetrial.jsp?d=70130000000Enus">} 
    {forms 
    #<Mechanize::Form 
    {name "login"} 
    {method "POST"} 
    {action "https://login.salesforce.com/"} 
    {fields 
     [hidden:0x3fce8e93aaac type: hidden name: un value: ] 
     [hidden:0x3fce8e93a8a4 type: hidden name: width value: ] 
     [hidden:0x3fce8e93a638 type: hidden name: height value: ] 
     [hidden:0x3fce8e93a390 type: hidden name: hasRememberUn value: true] 
     [hidden:0x3fce8e93a19c type: hidden name: startURL value: null] 
     [hidden:0x3fce8e939f58 type: hidden name: loginURL value: null] 
     [hidden:0x3fce8e939cc4 type: hidden name: loginType value: ] 
     [hidden:0x3fce8e9399a4 type: hidden name: useSecure value: true] 
     [hidden:0x3fce8e93979c type: hidden name: local value: ] 
     [hidden:0x3fce8e939648 type: hidden name: lt value: standard] 
     [hidden:0x3fce8e93d414 type: hidden name: qs value:r=https%3A%2F%2Flogin.salesforce.com%2F] 
     [hidden:0x3fce8e93d284 type: hidden name: locale value: ] 
     [hidden:0x3fce8e93d0cc type: hidden name: oauth_token value: ] 
     [hidden:0x3fce8e93cf50 type: hidden name: oauth_callback value: ] 
     [hidden:0x3fce8e93cd98 type: hidden name: login value: ] 
     [hidden:0x3fce8e93cc44 type: hidden name: serverid value: ] 
     [hidden:0x3fce8e93cab4 type: hidden name: display value: page] 
     [field:0x3fce8e93c780 type: email name: username value: ] 
     [field:0x3fce8e93c4c4 type: password name: pw value: ]} 
    {radiobuttons} 
    {checkboxes 
     [checkbox:0x3fce8e93c334 type: checkbox name: rememberUn value: ]} 
    {file_uploads} 
    {buttons [button:0x3fce8e93b9d4 type: name: Login value: ]}>}> 

什麼是錯我的代碼?

回答

1

該網站使用Javascript來處理登錄,機械化無法處理。您可以使用類似Selenium的東西訪問該網站。

+1

[watir](http://watir.com/)和[watir-webdriver](http://watirwebdriver.com/)是其他選擇。 – orde

+1

或者你可以弄清楚如何做到這一點。 – pguardiario

+0

具體來說,在javascript進行修改之後,找出POST請求中發送的內容。如果它不是動態的,那麼javascript不需要執行。機械化應該在這種情況下正常工作。 –

0

在有隱藏字段unwidthheight還需要包含用戶名和一些數字,除了包含用戶名的username場形式。

[hidden:0x3fce8e93aaac type: hidden name: un value: ] 
    [hidden:0x3fce8e93a8a4 type: hidden name: width value: ] 
    [hidden:0x3fce8e93a638 type: hidden name: height value: ] 

您可以使用Chrome的督察網絡選項卡下的監控,實際上被髮送到服務器的請求的內容(啓用了「保留日誌」選項),它就會由JavaScript修改後。