我在想如何區分無用信息與鏈接jsoup
。 一串代碼,我應該在這裏解析:用Jsoup獲取具體網址
view-source:https://vk.com/search?c%5Bq%5D=%D0%BA%D0%BE%D1%82&c%5Bsection%5D=communities
public class TestSoup {
public static void main (String[] args) throws Exception {
Document doc = Jsoup.connect("https://vk.com/smcat").get();
Elements links;
//links = doc.select("div > a > img ");
links = doc.select("[data-src_big]");
System.out.println(links);
}
}
我現在輸出:
<img src="https://pp.vk.me/c636126/v636126727/35e1b/ludjlj7T4i8.jpg" class="ph_img" data-id="-23530818_436648332" data-src_big="https://pp.vk.me/c636126/v636126727/35e1c/a1IyGrtjzUQ.jpg|600|448">
有人能解釋我如何從我的輸出中提取第二個鏈接?非常感謝。
你需要提取這個'data-src_big =「https://pp.vk.me/c636126/v636126727/35e1c/a1IyGrtjzUQ.jpg|600|448」' – Joe