nekohtml 用法
生活随笔
收集整理的這篇文章主要介紹了
nekohtml 用法
小編覺得挺不錯的,現在分享給大家,幫大家做個參考.
//nekohtml結合xpath用法
DOMParser parser = new DOMParser();
try {
//設置網頁的默認編碼
parser.setProperty("http://cyberneko.org/html/properties/default-encoding","gb2312");
/*The Xerces HTML DOM implementation does not support namespaces
and cannot represent XHTML documents with namespace information.
Therefore, in order to use the default HTML DOM implementation with NekoHTML's
DOMParser to parse XHTML documents, you must turn off namespace processing.*/
parser.setFeature("http://xml.org/sax/features/namespaces", false);
String strURL = "http://product.dangdang.com/product.aspx?product_id=9317290";
BufferedReader in = new BufferedReader(
new InputStreamReader(
new URL(strURL).openStream()));
parser.parse(new InputSource(in));
in.close();
} catch (Exception e) {
e.printStackTrace();
}
Document doc = parser.getDocument();
// tags should be in upper case
String productsXpath = "/HTML/BODY/DIV[2]/DIV[4]/DIV[2]/DIV/DIV[3]/UL[@class]/LI[9]";
NodeList products;
try {
products = XPathAPI.selectNodeList(doc, productsXpath);
System.out.println("found: " + products.getLength());
Node node = null;
for(int i=0; i< products.getLength();i++)
{
node = products.item(i);
System.out.println( i + ":\n" + node.getTextContent());
}
}catch (TransformerException e) {
e.printStackTrace();
}
總結
以上是生活随笔為你收集整理的nekohtml 用法的全部內容,希望文章能夠幫你解決所遇到的問題。
- 上一篇: [html] 能否做到禁止打印页面?如
- 下一篇: 狼叔直播 Reaction《学习指北:N