You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -34,6 +35,7 @@ The environmental variables for the cdcrawler-dev App Service include:
34
35
* CRAWLER_GITHUB_TOKEN
35
36
* CRAWLER_HOST
36
37
* CRAWLER_INSIGHTS_KEY
38
+
* CRAWLER_LICENSEE_PARALLELISM
37
39
* CRAWLER_NAME
38
40
* CRAWLER_QUEUE_AZURE_CONNECTION_STRING
39
41
* CRAWLER_QUEUE_PREFIX
@@ -87,6 +89,12 @@ Note that we only use this in the development environment, not in the production
87
89
88
90
We use [Azure Application Insights](https://docs.microsoft.com/en-us/azure/azure-monitor/app/app-insights-overview) to monitor the crawler application. This requires a key and this is where it is kept.
89
91
92
+
### CRAWLER_LICENSEE_PARALLELISM
93
+
94
+
This is the maximum number of `licensee` processes to run in parallel. `licensee` is a tool to collect license
95
+
information. The default value is `10` and setting it to a smaller value can reduce CPU spikes and lead to the crawler
96
+
having a more uniform CPU usage.
97
+
90
98
### CRAWLER_NAME
91
99
92
100
This is a name to refer to the crawler with. Note that we set it in the App Service in the development environment and in [the Docker file](https://github.com/clearlydefined/crawler/blob/32a0d6b59edfda5d3226c50680e4a8338af395cd/Dockerfile) for the Prod environment.
0 commit comments