scrapy-grpc is an extension to control a running Scrapy web crawler via gRPC. The service provides access to the main Crawler object.
Install scrapy-grpc using pip:
$ pip install scrapy-grpc
First, you need to include the entension to your EXTENSIONS dict in
settings.py, for example:
EXTENSIONS = {
'scrapy_grpc.webservice.WebService': 500,
}
Then, you need to enable the extension with the GRPC_ENABLED setting,
set to True.
The web server will listen on a port specified in GRPC_PORT (by default, it will try to listen on port 6080)
The endpoint for accessing the crawler object is:
http://localhost:6080/crawler
A Python client is included:
from scrapy_grpc import client ...
These are the settings that control the web service behaviour:
Default: False
A boolean which specifies if the web service will be enabled (provided its extension is also enabled).
Default: 6080
The port range to use for the web service. If set to None or 0, a
dynamically assigned port is used.
Default: '127.0.0.1'
The interface the web service should listen on.