A simple PHP DOM scraper based on the DOMDocument class and preg_match()
functions
- html
- css
$html_contents = file_get_contents($url);
$dom_contents = parse_dom_contents($html_contents,'html');
$dom_contents['html:head'] = '';
$dom_contents['html:links'] = '';
$dom_contents['html:scripts'] = '';
$dom_contents['html:styles'] = '';
$dom_contents['html:body'] = '';
$dom_contents['css'][$selector] = $value;
- html -> (a,meta)
- css -> @(media|import|local)
- xml
- rss
- atom