Esempi in PHP per VDB\Spider Spider::setMaxDepth

Linguaggio di programmazione: PHP

Spazio dei nomi/nome del pacchetto: VDB\Spider

Classe/tipologia: Spider

Metodo/funzione: setMaxDepth

Esempi su hotexamples.com: 2

VDB\Spider Spider::setMaxDepth in PHP: 2 esempi trovati. Questi sono i migliori esempi reali in PHP per VDB\Spider\Spider::setMaxDepth, estratti da progetti open source. Li puoi valutare, per aiutarci a migliorare la qualità dei nostri esempi.

Metodi utilizzati di frequente

Mostra Nascondi

crawl(5)

getDownloader(5)

getDiscovererSet(4)

getDispatcher(4)

addDiscoverer(3)

getStatsHandler(3)

getPersistenceHandler(2)

getQueueManager(2)

setMaxDepth(2)

setMaxQueueSize(2)

setQueueManager(2)

setTraversalAlgorithm(2)

addPreFetchFilter(1)

getRequestHandler(1)

setPersistenceHandler(1)

setRequestHandler(1)

Esempio n. 1

Mostra file

File: SpiderTest.php Progetto: aigouzz/php-spider

 public function testCrawlBFSMaxQueueSize()
 {
     $this->spider->setTraversalAlgorithm(Spider::ALGORITHM_BREADTH_FIRST);
     $this->spider->setMaxDepth(1000);
     $this->spider->setMaxQueueSize(3);
     $this->spider->crawl();
     $expected = array($this->hrefA, $this->hrefB, $this->hrefC);
     $stats = $this->spider->getStatsHandler();
     foreach ($stats->getQueued() as $index => $uri) {
         $this->assertEquals($expected[$index], $uri->toString());
     }
 }

Esempio n. 2

Mostra file

File: example_simple.php Progetto: aigouzz/php-spider

<?php

use VDB\Spider\Discoverer\XPathExpressionDiscoverer;
use VDB\Spider\Spider;
require_once __DIR__ . '/../vendor/autoload.php';
// Create Spider
$spider = new Spider('http://www.dmoz.org');
// Add a URI discoverer. Without it, the spider does nothing. In this case, we want <a> tags from a certain <div>
$spider->addDiscoverer(new XPathExpressionDiscoverer("//div[@id='catalogs']//a"));
// Set some sane options for this example. In this case, we only get the first 10 items from the start page.
$spider->setMaxDepth(1);
$spider->setMaxQueueSize(10);
// Execute crawl
$spider->crawl();
// Report
$stats = $spider->getStatsHandler();
echo "\nSPIDER ID: " . $stats->getSpiderId();
echo "\n  ENQUEUED:  " . count($stats->getQueued());
echo "\n  SKIPPED:   " . count($stats->getFiltered());
echo "\n  FAILED:    " . count($stats->getFailed());
// Finally we could do some processing on the downloaded resources
// In this example, we will echo the title of all resources
echo "\n\nDOWNLOADED RESOURCES: ";
foreach ($spider->getPersistenceHandler() as $resource) {
    echo "\n - " . $resource->getCrawler()->filterXpath('//title')->text();
}