scrapy.linkextractors

package documentation

(source)

scrapy.linkextractors This package contains a collection of Link Extractors. For more info see docs/topics/link-extractors.rst

Module lxmlhtml Link extractor based on lxml.html

From __init__.py:

IGNORED_EXTENSIONS: list[str] = (source) ¶

Undocumented

Value

['7z',
 '7zip',
 'bz2',
 'rar',
 'tar',
 'tar.gz',
 'xz',
...

Undocumented

Undocumented

Undocumented