class documentation

class Crawler: (source)

View In Hierarchy

Undocumented

Method __init__ Undocumented
Method crawl Undocumented
Method stop Starts a graceful stop of the crawler and returns a deferred that is fired when the crawler is stopped.
Instance Variable crawling Undocumented
Instance Variable engine Undocumented
Instance Variable extensions Undocumented
Instance Variable logformatter Undocumented
Instance Variable request_fingerprinter Undocumented
Instance Variable settings Undocumented
Instance Variable signals Undocumented
Instance Variable spider Undocumented
Instance Variable spidercls Undocumented
Instance Variable stats Undocumented
Method _create_engine Undocumented
Method _create_spider Undocumented
Instance Variable __remove_handler Undocumented
def __init__(self, spidercls, settings=None, init_reactor: bool = False): (source)

Undocumented

@defer.inlineCallbacks
def crawl(self, *args, **kwargs): (source)

Undocumented

Starts a graceful stop of the crawler and returns a deferred that is fired when the crawler is stopped.

crawling: bool = (source)

Undocumented

Undocumented

extensions = (source)

Undocumented

logformatter = (source)

Undocumented

request_fingerprinter: RequestFingerprinter = (source)

Undocumented

settings = (source)

Undocumented

Undocumented

Undocumented

spidercls = (source)

Undocumented

Undocumented

def _create_engine(self): (source)

Undocumented

def _create_spider(self, *args, **kwargs): (source)

Undocumented

__remove_handler = (source)

Undocumented