Class Hierarchy

abc.ABCMeta
- scrapy.item.ItemMeta - Metaclass_ of :class:`Item` that handles field definitions.
argparse.ArgumentParser
- scrapy.cmdline.ScrapyArgumentParser - Undocumented
- scrapy.utils.curl.CurlParser - Undocumented
argparse.HelpFormatter
- scrapy.commands.ScrapyHelpFormatter - Help Formatter for scrapy command line help messages.
AssertionError
- scrapy.exceptions.ContractFail - Error raised in case of a failing contract
collections.abc.MutableMapping
- scrapy.Item - Base class for scraped items.
- scrapy.settings.BaseSettings - Instances of this class behave like dictionaries, but store priorities along with their ``(key, value)`` pairs, and can be frozen (i.e. marked immutable).
  - scrapy.settings.Settings - This object stores Scrapy settings for the configuration of internal components, and can be used for any further customization.
collections.OrderedDict
- scrapy.utils.datatypes.LocalCache - Dictionary with a finite number of keys.
dict
- scrapy.Field - Container of field metadata
- scrapy.utils.datatypes.CaselessDict - No class docstring; 0/1 class variable, 2/12 methods, 0/1 class method documented
  - scrapy.http.headers.Headers - Case insensitive http headers dictionary
enum.Enum
- scrapy.core.http2.stream.StreamCloseReason - Undocumented
Exception
- scrapy.core.downloader.handlers.http11.TunnelError - An HTTP CONNECT tunnel could not be established by the proxy.
- scrapy.exceptions.CloseSpider - Raise this from callbacks to request the spider to be closed
- scrapy.exceptions.DontCloseSpider - Request the spider not to be closed yet
- scrapy.exceptions.DropItem - Drop item from the item pipeline
  - scrapy.pipelines.images.NoimagesDrop - Product with no images exception
- scrapy.exceptions.IgnoreRequest - Indicates a decision was made not to process a request
  - scrapy.spidermiddlewares.httperror.HttpError - A non-200 response was filtered
- scrapy.exceptions.NotConfigured - Indicates a missing configuration situation
- scrapy.exceptions.NotSupported - Indicates a feature or method is not supported
- scrapy.exceptions.StopDownload - Stop the download of the body for a given response. The 'fail' boolean parameter indicates whether or not the resulting partial response should be handled by the request errback. Note that 'fail' is a keyword-only argument.
- scrapy.exceptions.UsageError - To indicate a command-line usage error
- scrapy.pipelines.files.FileException - General media error exception
  - scrapy.pipelines.images.ImageException - General image error exception
h2.exceptions.H2Error
- scrapy.core.http2.protocol.InvalidNegotiatedProtocol - Undocumented
- scrapy.core.http2.protocol.MethodNotAllowed405 - Undocumented
- scrapy.core.http2.protocol.RemoteTerminatedConnection - Undocumented
- scrapy.core.http2.stream.InvalidHostname - Undocumented
io.IOBase
- scrapy.extensions.postprocessing.PostProcessingManager - This will manage and use declared plugins to process data in a pipeline-ish way. :param plugins: all the declared plugins for the feed :type plugins: list :param file: final target file where the processed data will be written :type file: file like object...
itemloaders.ItemLoader
- scrapy.loader.ItemLoader - A user-friendly abstraction to populate an :ref:`item <topics-items>` with data by applying :ref:`field processors <topics-loaders-processors>` to scraped data. When instantiated with a ``selector`` or a ``response`` it supports data extraction from web pages using :ref:`selectors <topics-selectors>`.
json.JSONDecoder
- scrapy.utils.serialize.ScrapyJSONDecoder - Undocumented
json.JSONEncoder
- scrapy.utils.serialize.ScrapyJSONEncoder - Undocumented
logging.Filter
- scrapy.utils.log.TopLevelFormatter - Keep only top level loggers's name (direct children from root) from records.
logging.Handler
- scrapy.utils.log.LogCounterHandler - Record log levels count into a crawler stats
parsel.Selector
- scrapy.Selector - An instance of :class:`Selector` is a wrapper over response to select certain parts of its content.
parsel.Selector.selectorlist_cls
- scrapy.selector.unified.SelectorList - The :class:`SelectorList` class is a subclass of the builtin ``list`` class, which provides a few additional methods.
scrapy.commands.bench._BenchServer - Undocumented
scrapy.commands.ScrapyCommand - No class docstring; 0/2 instance variable, 0/4 class variable, 6/9 methods documented
- scrapy.commands.BaseRunSpiderCommand - Common class used to share functionality between the crawl, parse and runspider commands
  - scrapy.commands.crawl.Command - Undocumented
  - scrapy.commands.parse.Command - Undocumented
  - scrapy.commands.runspider.Command - Undocumented
- scrapy.commands.bench.Command - Undocumented
- scrapy.commands.check.Command - Undocumented
- scrapy.commands.edit.Command - Undocumented
- scrapy.commands.fetch.Command - Undocumented
  - scrapy.commands.view.Command - Undocumented
- scrapy.commands.genspider.Command - No class docstring; 0/1 property, 0/1 instance variable, 0/2 class variable, 1/8 method documented
- scrapy.commands.list.Command - Undocumented
- scrapy.commands.settings.Command - Undocumented
- scrapy.commands.shell.Command - No class docstring; 0/2 class variable, 1/7 method documented
- scrapy.commands.startproject.Command - No class docstring; 0/1 property, 0/1 instance variable, 0/2 class variable, 1/5 method documented
- scrapy.commands.version.Command - Undocumented
scrapy.contracts.Contract - Abstract class for contracts
- scrapy.contracts.default.CallbackKeywordArgumentsContract - Contract to set the keyword arguments for the request. The value should be a JSON-encoded dictionary, e.g.:
- scrapy.contracts.default.ReturnsContract - Contract to check the output of a callback
- scrapy.contracts.default.ScrapesContract - Contract to check presence of fields in scraped items @scrapes page_name page_body
- scrapy.contracts.default.UrlContract - Contract to set the url of the request (mandatory) @url http://scrapy.org
scrapy.contracts.ContractsManager - No class docstring; 0/1 class variable, 1/6 method documented
scrapy.core.downloader.contextfactory.AcceptableProtocolsContextFactory - Context factory to used to override the acceptable protocols to set up the [OpenSSL.SSL.Context] for doing NPN and/or ALPN negotiation.
scrapy.core.downloader.Downloader - Undocumented
scrapy.core.downloader.handlers.datauri.DataURIDownloadHandler - Undocumented
scrapy.core.downloader.handlers.DownloadHandlers - No class docstring; 0/4 instance variable, 1/5 method documented
scrapy.core.downloader.handlers.file.FileDownloadHandler - Undocumented
scrapy.core.downloader.handlers.ftp.FTPDownloadHandler - Undocumented
scrapy.core.downloader.handlers.http10.HTTP10DownloadHandler - No class docstring; 0/4 instance variable, 0/1 class variable, 1/3 method, 0/1 class method documented
scrapy.core.downloader.handlers.http11._RequestBodyProducer - Undocumented
scrapy.core.downloader.handlers.http11.HTTP11DownloadHandler - No class docstring; 0/7 instance variable, 0/1 class variable, 1/3 method, 0/1 class method documented
scrapy.core.downloader.handlers.http11.ScrapyAgent - Undocumented
scrapy.core.downloader.handlers.http2.H2DownloadHandler - Undocumented
scrapy.core.downloader.handlers.http2.ScrapyH2Agent - Undocumented
scrapy.core.downloader.handlers.s3.S3DownloadHandler - Undocumented
scrapy.core.downloader.Slot - Downloader slot
scrapy.core.engine.ExecutionEngine - No class docstring; 0/1 property, 0/14 instance variable, 6/22 methods documented
scrapy.core.engine.Slot - Undocumented
scrapy.core.http2.agent.H2Agent - No class docstring; 0/4 instance variable, 1/4 method documented
- scrapy.core.http2.agent.ScrapyProxyH2Agent - No class docstring; 0/1 instance variable, 1/3 method documented
scrapy.core.http2.agent.H2ConnectionPool - No class docstring; 0/4 instance variable, 1/6 method documented
scrapy.core.http2.stream.Stream - Represents a single HTTP/2 Stream.
scrapy.core.scheduler.BaseScheduler - The scheduler component is responsible for storing requests received from the engine, and feeding them back upon request (also to the engine).
- scrapy.core.scheduler.Scheduler - Default Scrapy scheduler. This implementation also handles duplication filtering via the :setting:`dupefilter <DUPEFILTER_CLASS>`.
scrapy.core.scraper.Scraper - No class docstring; 0/7 instance variable, 8/15 methods documented
scrapy.core.scraper.Slot - Scraper slot (one per running spider)
scrapy.crawler.Crawler - No class docstring; 0/11 instance variable, 1/5 method documented
scrapy.crawler.CrawlerRunner - This is a convenient helper class that keeps track of, manages and runs crawlers inside an already setup :mod:`~twisted.internet.reactor`.
- scrapy.crawler.CrawlerProcess - A class to run multiple scrapy crawlers in a process simultaneously.
scrapy.downloadermiddlewares.ajaxcrawl.AjaxCrawlMiddleware - Handle 'AJAX crawlable' pages marked as crawlable via meta tag. For more info see https://developers.google.com/webmasters/ajax-crawling/docs/getting-started.
scrapy.downloadermiddlewares.cookies.CookiesMiddleware - This middleware enables working with sites that need cookies
scrapy.downloadermiddlewares.decompression.DecompressionMiddleware - This middleware tries to recognise and extract the possibly compressed responses that may arrive.
scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware - Undocumented
scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware - Undocumented
scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware - Set Basic HTTP Authorization header (http_user and http_pass spider class attributes)
scrapy.downloadermiddlewares.httpcache.HttpCacheMiddleware - Undocumented
scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware - This middleware allows compressed (gzip, deflate) traffic to be sent/received from web sites
scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware - Undocumented
scrapy.downloadermiddlewares.redirect.BaseRedirectMiddleware - Undocumented
- scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware - Undocumented
- scrapy.downloadermiddlewares.redirect.RedirectMiddleware - Handle redirection of requests based on response status and meta-refresh html tag.
scrapy.downloadermiddlewares.retry.RetryMiddleware - Undocumented
scrapy.downloadermiddlewares.robotstxt.RobotsTxtMiddleware - Undocumented
scrapy.downloadermiddlewares.stats.DownloaderStats - Undocumented
scrapy.downloadermiddlewares.useragent.UserAgentMiddleware - This middleware allows spiders to override the user_agent
scrapy.dupefilters.BaseDupeFilter - No class docstring; 1/4 method, 0/1 class method documented
- scrapy.dupefilters.RFPDupeFilter - Request Fingerprint duplicates filter
scrapy.exporters.BaseItemExporter - No class docstring; 0/5 instance variable, 2/7 methods documented
- scrapy.exporters.CsvItemExporter - Undocumented
- scrapy.exporters.JsonItemExporter - Undocumented
- scrapy.exporters.JsonLinesItemExporter - Undocumented
- scrapy.exporters.MarshalItemExporter - Exports items in a Python-specific binary format (see :mod:`marshal`).
- scrapy.exporters.PickleItemExporter - Undocumented
- scrapy.exporters.PprintItemExporter - Undocumented
- scrapy.exporters.PythonItemExporter - This is a base class for item exporters that extends :class:`BaseItemExporter` with support for nested items.
- scrapy.exporters.XmlItemExporter - Undocumented
scrapy.extensions.closespider.CloseSpider - Undocumented
scrapy.extensions.corestats.CoreStats - Undocumented
scrapy.extensions.debug.Debugger - Undocumented
scrapy.extensions.debug.StackTraceDump - Undocumented
scrapy.extensions.feedexport._FeedSlot - Undocumented
scrapy.extensions.feedexport.BlockingFeedStorage - Undocumented
- scrapy.extensions.feedexport.FTPFeedStorage - Undocumented
- scrapy.extensions.feedexport.GCSFeedStorage - Undocumented
- scrapy.extensions.feedexport.S3FeedStorage - Undocumented
scrapy.extensions.feedexport.FeedExporter - No class docstring; 0/7 instance variable, 3/17 methods, 0/1 class method documented
scrapy.extensions.feedexport.FileFeedStorage - Undocumented
scrapy.extensions.feedexport.ItemFilter - This will be used by FeedExporter to decide if an item should be allowed to be exported to a particular feed.
scrapy.extensions.feedexport.StdoutFeedStorage - Undocumented
scrapy.extensions.httpcache.DbmCacheStorage - Undocumented
scrapy.extensions.httpcache.DummyPolicy - Undocumented
scrapy.extensions.httpcache.FilesystemCacheStorage - No class docstring; 0/5 instance variable, 2/7 methods documented
scrapy.extensions.httpcache.RFC2616Policy - Undocumented
scrapy.extensions.logstats.LogStats - Log basic scraping stats periodically
scrapy.extensions.memdebug.MemoryDebugger - Undocumented
scrapy.extensions.memusage.MemoryUsage - No class docstring; 0/9 instance variable, 1/8 method, 0/1 class method documented
scrapy.extensions.postprocessing.Bz2Plugin - Compresses received data using `bz2 <https://en.wikipedia.org/wiki/Bzip2>`_.
scrapy.extensions.postprocessing.GzipPlugin - Compresses received data using `gzip <https://en.wikipedia.org/wiki/Gzip>`_.
scrapy.extensions.postprocessing.LZMAPlugin - Compresses received data using `lzma <https://en.wikipedia.org/wiki/Lempel–Ziv–Markov_chain_algorithm>`_.
scrapy.extensions.spiderstate.SpiderState - Store and load spider state during a scraping job
scrapy.extensions.statsmailer.StatsMailer - Undocumented
scrapy.extensions.throttle.AutoThrottle - No class docstring; 0/5 instance variable, 1/8 method, 0/1 class method documented
scrapy.http.cookies._DummyLock - Undocumented
scrapy.http.cookies.CookieJar - Undocumented
scrapy.http.cookies.WrappedRequest - Wraps a scrapy Request class with methods defined by urllib2.Request class to interact with CookieJar class
scrapy.http.cookies.WrappedResponse - Undocumented
scrapy.link.Link - Link objects represent an extracted link by the LinkExtractor.
scrapy.linkextractors.lxmlhtml.LxmlLinkExtractor - No class docstring; 0/9 instance variable, 0/1 class variable, 1/6 method documented
scrapy.linkextractors.lxmlhtml.LxmlParserLinkExtractor - No class docstring; 0/6 instance variable, 1/6 method documented
scrapy.logformatter.LogFormatter - Class for generating log messages for different actions.
scrapy.mail.MailSender - Undocumented
scrapy.middleware.MiddlewareManager - Base class for implementing middleware managers
- scrapy.core.downloader.middleware.DownloaderMiddlewareManager - Undocumented
- scrapy.core.spidermw.SpiderMiddlewareManager - Undocumented
- scrapy.extension.ExtensionManager - Undocumented
- scrapy.pipelines.ItemPipelineManager - Undocumented
scrapy.pipelines.files.FSFilesStore - Undocumented
scrapy.pipelines.files.FTPFilesStore - Undocumented
scrapy.pipelines.files.GCSFilesStore - Undocumented
scrapy.pipelines.files.S3FilesStore - No class docstring; 0/3 instance variable, 0/9 constant, 2/5 methods documented
scrapy.pipelines.media.MediaPipeline - No class docstring; 0/5 instance variable, 0/1 constant, 9/18 methods, 0/1 class method, 0/1 class documented
- scrapy.pipelines.files.FilesPipeline - Abstract pipeline that implement the file downloading
  - scrapy.pipelines.images.ImagesPipeline - Abstract pipeline that implement the image thumbnail generation logic
scrapy.pipelines.media.MediaPipeline.SpiderInfo - Undocumented
scrapy.pqueues.DownloaderAwarePriorityQueue - PriorityQueue which takes Downloader activity into account: domains (slots) with the least amount of active downloads are dequeued first.
scrapy.pqueues.DownloaderInterface - No class docstring; 0/1 instance variable, 1/4 method documented
scrapy.pqueues.ScrapyPriorityQueue - A priority queue implemented using multiple internal queues (typically, FIFO queues). It uses one internal queue for each priority value. The internal queue must implement the following methods:
scrapy.resolver._CachingResolutionReceiver - Undocumented
scrapy.resolver.CachingHostnameResolver - Experimental caching resolver. Resolves IPv4 and IPv6 addresses, does not support setting a timeout value for DNS requests.
scrapy.resolver.HostResolution - Undocumented
scrapy.responsetypes.ResponseTypes - No class docstring; 0/2 instance variable, 0/1 constant, 6/8 methods documented
scrapy.robotstxt.RobotParser - No class docstring; 1/1 method, 1/1 class method documented
- scrapy.robotstxt.ProtegoRobotParser - Undocumented
- scrapy.robotstxt.PythonRobotParser - Undocumented
- scrapy.robotstxt.ReppyRobotParser - Undocumented
- scrapy.robotstxt.RerpRobotParser - Undocumented
scrapy.settings.SettingsAttribute - Class for storing data related to settings attributes.
scrapy.shell.Shell - Undocumented
scrapy.signalmanager.SignalManager - No class docstring; 0/1 instance variable, 5/6 methods documented
scrapy.spiderloader.SpiderLoader - SpiderLoader is a class which locates and loads spiders in a Scrapy project.
scrapy.spidermiddlewares.depth.DepthMiddleware - Undocumented
scrapy.spidermiddlewares.httperror.HttpErrorMiddleware - Undocumented
scrapy.spidermiddlewares.offsite.OffsiteMiddleware - No class docstring; 0/3 instance variable, 1/7 method, 0/1 class method documented
scrapy.spidermiddlewares.referer.RefererMiddleware - No class docstring; 0/1 instance variable, 1/6 method, 0/1 class method documented
scrapy.spidermiddlewares.referer.ReferrerPolicy - No class docstring; 0/1 class variable, 2/7 methods documented
- scrapy.spidermiddlewares.referer.NoReferrerPolicy - https://www.w3.org/TR/referrer-policy/#referrer-policy-no-referrer
- scrapy.spidermiddlewares.referer.NoReferrerWhenDowngradePolicy - https://www.w3.org/TR/referrer-policy/#referrer-policy-no-referrer-when-downgrade
  - scrapy.spidermiddlewares.referer.DefaultReferrerPolicy - A variant of "no-referrer-when-downgrade", with the addition that "Referer" is not sent if the parent request was using ``file://`` or ``s3://`` scheme.
- scrapy.spidermiddlewares.referer.OriginPolicy - https://www.w3.org/TR/referrer-policy/#referrer-policy-origin
- scrapy.spidermiddlewares.referer.OriginWhenCrossOriginPolicy - https://www.w3.org/TR/referrer-policy/#referrer-policy-origin-when-cross-origin
- scrapy.spidermiddlewares.referer.SameOriginPolicy - https://www.w3.org/TR/referrer-policy/#referrer-policy-same-origin
- scrapy.spidermiddlewares.referer.StrictOriginPolicy - https://www.w3.org/TR/referrer-policy/#referrer-policy-strict-origin
- scrapy.spidermiddlewares.referer.StrictOriginWhenCrossOriginPolicy - https://www.w3.org/TR/referrer-policy/#referrer-policy-strict-origin-when-cross-origin
- scrapy.spidermiddlewares.referer.UnsafeUrlPolicy - https://www.w3.org/TR/referrer-policy/#referrer-policy-unsafe-url
scrapy.spidermiddlewares.urllength.UrlLengthMiddleware - Undocumented
scrapy.spiders.crawl.Rule - Undocumented
scrapy.spiders.Spider
- scrapy.spiders.init.InitSpider - Base Spider with initialization facilities
- scrapy.utils.spider.DefaultSpider - Undocumented
scrapy.statscollectors.StatsCollector - Undocumented
- scrapy.statscollectors.DummyStatsCollector - Undocumented
- scrapy.statscollectors.MemoryStatsCollector - Undocumented
scrapy.utils.datatypes.SequenceExclude - Object to test if an item is NOT within some sequence.
scrapy.utils.iterators._StreamReader - Undocumented
scrapy.utils.log.StreamLogger - Fake file-like stream object that redirects writes to a logger instance
scrapy.utils.reactor.CallLaterOnce - Schedule a function to be called in the next reactor loop, but only if it hasn't been already scheduled since the last time it ran.
scrapy.utils.request.RequestFingerprinter - Default fingerprinter.
scrapy.utils.sitemap.Sitemap - Class to parse Sitemap (type=urlset) and Sitemap Index (type=sitemapindex) files
scrapy.utils.testproc.ProcessTest - Undocumented
scrapy.utils.testsite.SiteTest - Undocumented
scrapy.utils.trackref.object_ref - Inherit from this class to a keep a record of live instances
- scrapy.http.response.Response - An object that represents an HTTP response, which is usually downloaded (by the Downloader) and fed to the Spiders for processing.
  - scrapy.http.response.text.TextResponse - No class docstring; 1/3 property, 0/7 instance variable, 0/1 class variable, 0/1 constant, 4/15 methods documented
    - scrapy.http.response.html.HtmlResponse - Undocumented
    - scrapy.http.response.xml.XmlResponse - Undocumented
- scrapy.Item - Base class for scraped items.
- scrapy.Request - Represents an HTTP request, which is usually generated in a Spider and executed by the Downloader, thus generating a :class:`Response`.
  - scrapy.FormRequest - Undocumented
  - scrapy.http.request.json_request.JsonRequest - No class docstring; 0/1 property, 0/1 instance variable, 0/1 class variable, 1/3 method documented
  - scrapy.http.request.rpc.XmlRpcRequest - Undocumented
- scrapy.Selector - An instance of :class:`Selector` is a wrapper over response to select certain parts of its content.
- scrapy.selector.unified.SelectorList - The :class:`SelectorList` class is a subclass of the builtin ``list`` class, which provides a few additional methods.
- scrapy.Spider - Base class for scrapy spiders. All spiders must inherit from this class.
  - scrapy.commands.bench._BenchSpider - A spider that follows all links
  - scrapy.spiders.crawl.CrawlSpider - Undocumented
  - scrapy.spiders.feed.CSVFeedSpider - Spider for parsing CSV feeds. It receives a CSV file in a response; iterates through each of its rows, and calls parse_row with a dict containing each field's data.
  - scrapy.spiders.feed.XMLFeedSpider - This class intends to be the base class for spiders that scrape from XML feeds.
  - scrapy.spiders.sitemap.SitemapSpider - No class docstring; 0/2 instance variable, 0/4 class variable, 2/5 methods documented
twisted.internet._sslverify.ClientTLSOptions
- scrapy.core.downloader.tls.ScrapyClientTLSOptions - SSL Client connection creator ignoring certificate verification errors (for genuinely invalid certificates or bugs in verification code).
twisted.internet.base.ThreadedResolver
- scrapy.resolver.CachingThreadedResolver - Default caching resolver. IPv4 only, supports setting a timeout value for DNS requests.
twisted.internet.endpoints.TCP4ClientEndpoint
- scrapy.core.downloader.handlers.http11.TunnelingTCP4ClientEndpoint - An endpoint that tunnels through proxies to allow HTTPS downloads. To accomplish that, this endpoint sends an HTTP CONNECT to the proxy. The HTTP CONNECT is always sent when using this endpoint, I think this could be improved as the CONNECT will be redundant if the connection associated with this endpoint comes from the pool and a CONNECT has already been issued for it.
twisted.internet.error.ConnectionClosed
- scrapy.core.http2.stream.InactiveStreamClosed - Connection was closed without sending request headers of the stream. This happens when a stream is waiting for other streams to close and connection is lost.
twisted.internet.protocol.ClientFactory
- scrapy.core.downloader.webclient.ScrapyHTTPClientFactory - No class docstring; 0/20 instance variable, 0/3 class variable, 2/11 methods documented
twisted.internet.protocol.Factory
- scrapy.core.http2.protocol.H2ClientFactory - Undocumented
twisted.internet.protocol.ProcessProtocol
- scrapy.utils.testproc.TestProcessProtocol - Undocumented
twisted.internet.protocol.Protocol
- scrapy.core.downloader.handlers.ftp.ReceivedDataProtocol - Undocumented
- scrapy.core.downloader.handlers.http11._ResponseReader - Undocumented
- scrapy.core.http2.protocol.H2ClientProtocol - No class docstring; 2/2 properties, 0/7 instance variable, 0/1 constant, 12/21 methods documented
twisted.internet.protocol.ServerFactory
- scrapy.extensions.telnet.TelnetConsole - Undocumented
twisted.protocols.policies.TimeoutMixin
- scrapy.core.http2.protocol.H2ClientProtocol - No class docstring; 2/2 properties, 0/7 instance variable, 0/1 constant, 12/21 methods documented
twisted.web.client.Agent
- scrapy.core.downloader.handlers.http11.ScrapyProxyAgent - No class docstring; 0/1 instance variable, 1/2 method documented
- scrapy.core.downloader.handlers.http11.TunnelingAgent - An agent that uses a L{TunnelingTCP4ClientEndpoint} to make HTTPS downloads. It may look strange that we have chosen to subclass Agent and not ProxyAgent but consider that after the tunnel is opened the proxy is transparent to the client; thus the agent should behave like there is no proxy involved.
twisted.web.client.BrowserLikePolicyForHTTPS
- scrapy.core.downloader.contextfactory.ScrapyClientContextFactory - Non-peer-certificate verifying HTTPS context factory
  - scrapy.core.downloader.contextfactory.BrowserLikeContextFactory - Twisted-recommended context factory for web clients.
twisted.web.http.HTTPClient
- scrapy.core.downloader.webclient.ScrapyHTTPPageGetter - Undocumented
twisted.web.resource.Resource
- scrapy.utils.benchserver.Root - Undocumented
twisted.web.util.Redirect
- scrapy.utils.testsite.NoMetaRefreshRedirect - Undocumented
type
- scrapy.core.scheduler.BaseSchedulerMeta - Metaclass to check scheduler classes against the necessary interface
TypeError
- scrapy.exceptions._InvalidOutput - Indicates an invalid value has been returned by a middleware's processing method. Internal and undocumented, it should not be raised or caught by user code.
typing.AsyncIterable
- scrapy.utils.python.MutableAsyncChain - Similar to MutableChain but for async iterables
typing.Iterable
- scrapy.utils.python.MutableChain - Thin wrapper around itertools.chain, allowing to add iterables "in-place"
typing.Iterator
- scrapy.utils.defer._AsyncCooperatorAdapter - A class that wraps an async iterable into a normal iterator suitable for using in Cooperator.coiterate(). As it's only needed for parallel_async(), it calls the callable directly in the callback, instead of providing a more generic interface.
unittest.TextTestResult
- scrapy.commands.check.TextTestResult - Undocumented
ValueError
- scrapy.http.response.text._InvalidSelector - Raised when a URL cannot be obtained from a Selector
Warning
- scrapy.exceptions.ScrapyDeprecationWarning - Warning category for deprecated features, since the default DeprecationWarning is silenced on Python 2.7+
- scrapy.spidermiddlewares.offsite.PortWarning - Undocumented
- scrapy.spidermiddlewares.offsite.URLWarning - Undocumented
weakref.WeakKeyDictionary
- scrapy.utils.datatypes.LocalWeakReferencedCache - A weakref.WeakKeyDictionary implementation that uses LocalCache as its underlying data structure, making it ordered and capable of being size-limited.
zope.interface.Interface
- scrapy.extensions.feedexport.IFeedStorage - Interface that all Feed Storages must implement
- scrapy.interfaces.ISpiderLoader - No interface docstring; 4/4 methods documented