class documentation
class ExecutionEngine: (source)
Undocumented
Method | __init__ |
Undocumented |
Method | close |
Gracefully close the execution engine. If it has already been started, stop it. In all cases, close the spider and the downloader. |
Method | close |
Close (cancel) spider and clear all its outstanding requests |
Method | crawl |
Inject the request into the spider <-> downloader pipeline |
Method | download |
Return a Deferred which fires with a Response as result, only downloader middlewares are applied |
Method | has |
Undocumented |
Method | open |
Undocumented |
Method | pause |
Undocumented |
Method | schedule |
Undocumented |
Method | spider |
Undocumented |
Method | start |
Undocumented |
Method | stop |
Gracefully stop the execution engine |
Method | unpause |
Undocumented |
Instance Variable | crawler |
Undocumented |
Instance Variable | downloader |
Undocumented |
Instance Variable | logformatter |
Undocumented |
Instance Variable | paused |
Undocumented |
Instance Variable | running |
Undocumented |
Instance Variable | scheduler |
Undocumented |
Instance Variable | scraper |
Undocumented |
Instance Variable | settings |
Undocumented |
Instance Variable | signals |
Undocumented |
Instance Variable | slot |
Undocumented |
Instance Variable | spider |
Undocumented |
Instance Variable | start |
Undocumented |
Property | open |
Undocumented |
Method | _download |
Undocumented |
Method | _downloaded |
Undocumented |
Method | _get |
Undocumented |
Method | _handle |
Undocumented |
Method | _needs |
Undocumented |
Method | _next |
Undocumented |
Method | _next |
Undocumented |
Method | _schedule |
Undocumented |
Method | _spider |
Called when a spider gets idle, i.e. when there are no remaining requests to download or schedule. It can be called multiple times. If a handler for the spider_idle signal raises a DontCloseSpider exception, the spider is not closed until the next loop and this function is guaranteed to be called (at least) once again... |
Instance Variable | _closewait |
Undocumented |
Instance Variable | _spider |
Undocumented |
Gracefully close the execution engine. If it has already been started, stop it. In all cases, close the spider and the downloader.
@inlineCallbacks
def open_spider(self, spider:
def open_spider(self, spider:
Spider
, start_requests: Iterable
= (), close_if_idle: bool
= True):
(source)
¶
Undocumented
def _downloaded(self, result:
Union[ Response, Request]
, request: Request
, spider: Spider
) -> Union[ Deferred, Response]
:
(source)
¶
Undocumented
def _handle_downloader_output(self, result:
Union[ Request, Response, Failure]
, request: Request
) -> Optional[ Deferred]
:
(source)
¶
Undocumented
Called when a spider gets idle, i.e. when there are no remaining requests to download or schedule. It can be called multiple times. If a handler for the spider_idle signal raises a DontCloseSpider exception, the spider is not closed until the next loop and this function is guaranteed to be called (at least) once again. A handler can raise CloseSpider to provide a custom closing reason.