You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/source/topics/frontera-settings.rst
+34-33Lines changed: 34 additions & 33 deletions
Original file line number
Diff line number
Diff line change
@@ -492,63 +492,56 @@ documents scheduled after the change. All previously queued documents will be cr
492
492
Cassandra
493
493
---------
494
494
495
+
.. setting:: CASSANDRABACKEND_CACHE_SIZE
495
496
496
-
.. setting:: CASSANDRABACKEND_DROP_ALL_TABLES
497
+
CASSANDRABACKEND_CACHE_SIZE
498
+
^^^^^^^^^^^^^^^^^^^^^^^^^^^
497
499
498
-
CASSANDRABACKEND_DROP_ALL_TABLES
499
-
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
500
+
Default:: ``10000``
500
501
501
-
Default: ``False``
502
+
Cassandra Metadata LRU Cache size. It's used for caching objects, which are requested from DB every time already known,
503
+
documents are crawled. This is mainly saves DB throughput, increase it if you're experiencing problems with too high
504
+
volume of SELECT's to Metadata table, or decrease if you need to save memory.
502
505
503
-
Set to ``True`` if you need to drop of all DB tables on backend instantiation (e.g. every Scrapy spider run).
504
506
505
-
.. setting:: SQLALCHEMYBACKEND_ENGINE
507
+
.. setting:: CASSANDRABACKEND_CLUSTER_HOSTS
506
508
507
-
CASSANDRABACKEND_CLUSTER_IPS
508
-
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
509
+
CASSANDRABACKEND_CLUSTER_HOSTS
510
+
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
509
511
510
512
Default:: ``['127.0.0.1']``
511
513
512
-
Set IPs from Cassandra Cluster. Default is localhost. To assign more than one IP use this Syntax: ``['192.168.0.1', '192.168.0.2']``
514
+
The list of contact points to try connecting for cluster discovery. All contact points are not required, the driver
515
+
discovers the rest.
516
+
517
+
.. setting:: CASSANDRABACKEND_CLUSTER_PORT
513
518
514
519
CASSANDRABACKEND_CLUSTER_PORT
515
520
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
516
521
517
522
Default:: ``9042``
518
523
519
-
Set port from Cassandra Cluster / Nodes
524
+
The server-side port to open connections to Cassandra.
520
525
526
+
.. setting:: CASSANDRABACKEND_DROP_ALL_TABLES
521
527
522
-
CASSANDRABACKEND_GENERATE_STATS
523
-
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
528
+
CASSANDRABACKEND_DROP_ALL_TABLES
529
+
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
524
530
525
-
Default:: ``False``
531
+
Default: ``False``
526
532
527
-
Set this to true if you want to create an extra Table for stats collection. In this table there will be pages crawled, links queued etv. counted up.
533
+
Set to ``True`` to drop and create all DB tables on backend instantiation.
528
534
535
+
.. setting:: CASSANDRABACKEND_KEYSPACE
529
536
530
537
CASSANDRABACKEND_KEYSPACE
531
538
^^^^^^^^^^^^^^^^^^^^^^^^^
532
539
533
-
Default:: ``frontera``
534
-
535
-
Set cassandra Keyspace
536
-
537
-
CASSANDRABACKEND_CREATE_KEYSPACE_IF_NOT_EXISTS
538
-
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
539
-
540
-
Default:: ``True``
540
+
Default:: ``crawler``
541
541
542
-
Creates Keyspace if it not exist. Set to false if you frontera shouldn't check on every startup.
543
-
544
-
545
-
CASSANDRABACKEND_CRAWL_ID
546
-
^^^^^^^^^^^^^^^^^^^^^^^^^
547
-
548
-
Default:: ``default``
549
-
550
-
Sets an ID in each table for the actual crawl. If you want to run another crawl from begining in same Table set to another Crawl ID. Its an Text field.
0 commit comments