PHP Readability Readability 예제들

프로그래밍 언어: PHP

네임스페이스/패키지 이름: Readability

클래스/타입: Readability

hotexamples.com에서의 예제들: 2

PHP Readability Readability - 2개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 PHP의 Readability\Readability에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

init(2)

getTitle(2)

getContent(1)

Differences between the PHP port and the original ------------------------------------------------------ Arc90's Readability is designed to run in the browser. It works on the DOM tree (the parsed HTML) after the page's CSS styles have been applied and Javascript code executed. This PHP port does not run inside a browser. We use PHP's ability to parse HTML to build our DOM tree, but we cannot rely on CSS or Javascript support. As such, the results will not always match Arc90's Readability. (For example, if a web page contains CSS style rules or Javascript code which hide certain HTML elements from display, Arc90's Readability will dismiss those from consideration but our PHP port, unable to understand CSS or Javascript, will not know any better.) Another significant difference is that the aim of Arc90's Readability is to re-present the main content block of a given web page so users can read it more easily in their browsers. Correct identification, clean up, and separation of the content block is only a part of this process. This PHP port is only concerned with this part, it does not include code that relates to presentation in the browser - Arc90 already do that extremely well, and for PDF output there's FiveFilters.org's PDF Newspaper: http://fivefilters.org/pdf-newspaper/. Finally, this class contains methods that might be useful for developers working on HTML document fragments. So without deviating too much from the original code (which I don't want to do because it makes debugging and updating more difficult), I've tried to make it a little more developer friendly. You should be able to use the methods here on existing DOMElement objects without passing an entire HTML document to be parsed.

Readability 1 문서

예제 #1

파일 보기

파일: Teaser.php 프로젝트: zach-adams/php-teaser

 /** Extract article from a page using php-readability */
 function getArticle($url)
 {
     $html = file_get_contents($url);
     $Readability = new Readability($html, $url);
     $result = $Readability->init();
     $results = array('title' => $Readability->getTitle()->textContent, 'content' => $Readability->getContent()->textContent);
     return $results;
 }

예제 #2

파일 보기

파일: functions.php 프로젝트: Gl0dGroup/CSICON-Publication-Tool

function getTitle($url)
{
    $cachedURL = str_replace('http://', 'http://webcache.googleusercontent.com/search?q=cache:', $url);
    $html = file_get_contents($cachedURL);
    $readability = new Readability($html, $url, 'libxml', false);
    $readabilityData = $readability->init();
    if ($readability->getTitle()->textContent == "") {
        $html = file_get_contents($url);
        $readability = new Readability($html, $url, 'libxml', false);
        $readabilityData = $readability->init();
        if ($readability->getTitle()->textContent == "") {
            return 'This link has no title';
        }
    }
    return $readability->getTitle()->textContent;
}