PHP App\CustomClasses\PHPCrawl\libs PHPCrawlerHTTPRequest::addReceiveContentType 예제들

프로그래밍 언어: PHP

네임스페이스/패키지 이름: App\CustomClasses\PHPCrawl\libs

메소드/함수: addReceiveContentType

hotexamples.com에서의 예제들: 1

PHP App\CustomClasses\PHPCrawl\libs PHPCrawlerHTTPRequest::addReceiveContentType - 1개의 예제가 발견되었습니다. 이것들은 오픈소스 프로젝트에서 추출된 PHP의 App\CustomClasses\PHPCrawl\libs\PHPCrawlerHTTPRequest::addReceiveContentType에 대한 실세계 최고 등급의 예제들입니다. 예제들을 평가하여 예제의 품질 향상에 도움을 줄 수 있습니다.

자주 사용되는 메소드들

보기 숨기기

setUrl(2)

sendRequest(2)

addPostData(1)

setContentSizeLimit(1)

setTmpFile(1)

setProxy(1)

setLinkExtractionTags(1)

setHeaderCheckCallbackFunction(1)

setHTTPProtocolVersion(1)

setFindRedirectURLs(1)

setBasicAuthentication(1)

addReceiveContentType(1)

addLinkSearchContentType(1)

requestGzipContent(1)

excludeLinkSearchDocumentSections(1)

enableAggressiveLinkSearch(1)

clearPostData(1)

clearCookies(1)

addStreamToFileContentType(1)

addCookieDescriptors(1)

예제 #1

파일 보기

파일: PHPCrawler.php 프로젝트: laurenfazah/screenshotter

 /**
  * Adds a rule to the list of rules that decides which pages or files - regarding their content-type - should be received
  *
  * After receiving the HTTP-header of a followed URL, the crawler check's - based on the given rules - whether the content of that URL
  * should be received.
  * If no rule matches with the content-type of the document, the content won't be received.
  *
  * Example:
  * <code>
  * $crawler->addContentTypeReceiveRule("#text/html#");
  * $crawler->addContentTypeReceiveRule("#text/css#");
  * </code>
  * This rules lets the crawler receive the content/source of pages with the Content-Type "text/html" AND "text/css".
  * Other pages or files with different content-types (e.g. "image/gif") won't be received (if this is the only rule added to the list).
  *
  * <b>IMPORTANT:</b> By default, if no rule was added to the list, the crawler receives every content.
  *
  * Note: To reduce the traffic the crawler will cause, you only should add content-types of pages/files you really want to receive.
  * But at least you should add the content-type "text/html" to this list, otherwise the crawler can't find any links.
  *
  * @param string $regex The rule as a regular-expression
  * @return bool TRUE if the rule was added to the list.
  *              FALSE if the given regex is not valid.
  * @section 2 Filter-settings
  */
 public function addContentTypeReceiveRule($regex)
 {
     return $this->PageRequest->addReceiveContentType($regex);
 }