开发者

How to get a div via PHP?

开发者 https://www.devze.com 2022-12-25 17:49 出处:网络
I get a page using file_get_contents from a remote server, but I want to filter that page and get a DIV from it that has class 开发者_开发百科\"text\" using PHP. I started with DOMDocument but I\'m lo

I get a page using file_get_contents from a remote server, but I want to filter that page and get a DIV from it that has class 开发者_开发百科"text" using PHP. I started with DOMDocument but I'm lost now.

Any help?

$file = file_get_contents("xx");
$elements = new DOMDocument();
$elements->loadHTML($file);
foreach ($elements as $element) {
    if( !is_null($element->attributes)) {
        foreach ($element->attributes as $attrName => $attrNode) {
            if( $attrName == "class" && $attrNode== "text") {
                echo $element;
            }
        }
    }
}


Once you have loaded the document to a DOMDocument instance, you can use XPath queries on it -- which might be easier than going yourself through the DOM.

For that, you can use the DOMXpath class.


For example, you should be able to do something like this :

$dom = new DOMDocument();
$dom->loadHTML($html);

$xpath = new DOMXPath($dom);
$tags = $xpath->query('//div[@class="text"]');
foreach ($tags as $tag) {
    var_dump($tag->textContent);
}


(Not tested, so you might need to adapt the XPath query a bit...)


Personally, I like Simple HTML Dom Parser.

include "lib.simple_html_dom.php"

$html = file_get_html('http://scrapeyoursite.com');
$html->find('div.text')->plaintext;

Pretty simple, huh? It accommodates selectors like jQuery :)


you can use simple_html_dom like here simple_html_dom doc

or use my code like here :

include "simple_html_dom.php";
$html = new simple_html_dom();
$html->load_file('www.yoursite.com');
$con_div = $html->find('div',0);//get value plaintext each html

echo the $con_div in plaintext..

$con_div->plaintext;

it's mean you will find the first div in array ('div',0) and show it in plaintext.. i hope it help you :cheer

0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号