class documentation
class LxmlLinkExtractor: (source)
Undocumented
Method | __init__ |
Undocumented |
Method | extract |
Returns a list of :class:`~scrapy.link.Link` objects from the specified :class:`response <scrapy.http.Response>`. |
Method | matches |
Undocumented |
Instance Variable | allow |
Undocumented |
Instance Variable | allow |
Undocumented |
Instance Variable | canonicalize |
Undocumented |
Instance Variable | deny |
Undocumented |
Instance Variable | deny |
Undocumented |
Instance Variable | deny |
Undocumented |
Instance Variable | link |
Undocumented |
Instance Variable | restrict |
Undocumented |
Instance Variable | restrict |
Undocumented |
Method | _extract |
Undocumented |
Method | _link |
Undocumented |
Method | _process |
Undocumented |
Class Variable | _csstranslator |
Undocumented |
def __init__(self, allow=(), deny=(), allow_domains=(), deny_domains=(), restrict_xpaths=(), tags=( 'a', 'area'), attrs=( 'href'), canonicalize=False, unique=True, process_value=None, deny_extensions=None, restrict_css=(), strip=True, restrict_text=None):
(source)
¶
Undocumented
Returns a list of :class:`~scrapy.link.Link` objects from the specified :class:`response <scrapy.http.Response>`. Only links that match the settings passed to the ``__init__`` method of the link extractor are returned. Duplicate links are omitted if the ``unique`` attribute is set to ``True``, otherwise they are returned.