-
Notifications
You must be signed in to change notification settings - Fork 166
Jsoup Extension
Dmitriy Zayceff edited this page Apr 17, 2015
·
13 revisions
API: http://jphp-docs.readthedocs.org/en/latest/api_en/php/jsoup/
Jsoup is a library for working with real-world HTML. It provides a very convenient API for extracting and manipulating data, using the best of DOM, CSS, and jquery-like methods. It based on the jsoup java library jsoup.org
Getting and parsing text of url:
use php\jsoup\Jsoup;
$doc = Jsoup::connect("http://en.wikipedia.org/")->get();
$newsHeadlines = $doc->select("#mp-itn b a");
foreach ($newsHeadlines as $element) {
echo "- {$element->text()}\n";
}Use the static Jsoup::parse(string $html) method, or Jsoup::parse(string $html, string $baseUri) if the page came from the web.
$html = "<html><head><title>First parse</title></head>
<body><p>Parsed HTML into a doc.</p></body></html>";
$doc = Jsoup::parse($html);
echo $doc->title();JPHP Group 2015