Parsing using HTMLParser_问答_开发者_运维开发者技术经验分享

开发者 https://www.devze.com 2023-04-02 00:06 出处：网络

Parser parser = new Parser(); parser.setInputHTML(\"d:/index.html\"); parser.setEncoding(\"UTF-8\"); NodeList nl = parser.parse(null);

相关专题：

Parser parser = new Parser();
    parser.setInputHTML("d:/index.html");
    parser.setEncoding("UTF-8");
    NodeList nl = parser.parse(null); 
    /*
    SimpleNodeIterator sNI=list.elements();
    while(sNI.hasMoreNodes()){
    System.out.println(sNI.nextNode().getText());}
    */
    NodeList trs = nl.extractAllNodesThatMatch(new TagNameFilter("tr"),true);
    for(int i=0;i<trs.size();i++) {
        NodeList nodes = trs.elementAt(i).getChildren();
        NodeList tds  = nodes.extractAllNodesThatMatch(new TagNameFilter("td"),true);
    System.out.println(tds.toString());

I am not getting any output, e开发者_Python百科clipse shows javaw.exe terminated.

Pass the path to the resource into the constructor.

Parser parser = new Parser("index.html");

Parse and print all the divs on this page:

Parser parser = new Parser("http://stackoverflow.com/questions/7293729/parsing-using-htmlparser/");
parser.setEncoding("UTF-8");
NodeList nl = parser.parse(null);
NodeList div = nl.extractAllNodesThatMatch(new TagNameFilter("div"),true);
System.out.println(div.toString());

parser.setInputHtml(String inputHtml) doesn't do what you think it does. It treats inputHtml as the html input to the parser. You use the constructor to point the parser at an html resource (file or URL).

Example:

Parser parser = new Parser();
parser.setInputHTML("<div>Foo</div><div>Bar</div>");

Parsing using HTMLParser

精彩评论

关注公众号

热门标签

图文推荐

Parsing using HTMLParser

更多 问答 相关资讯：

精彩评论

关注公众号

热门标签

图文推荐

更多问答相关资讯：