We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
1 parent 422a3e0 commit 050384cCopy full SHA for 050384c
src/script/hostgraph/hostgraph_config.sh
@@ -12,7 +12,7 @@
12
### saved as tuples <from_host, to_host>
13
14
# crawls to be processed
15
-CRAWLS=("CC-MAIN-2025-43" "CC-MAIN-2025-47" "CC-MAIN-2025-51")
+CRAWLS=("CC-MAIN-2025-47" "CC-MAIN-2025-51" "CC-MAIN-2026-04")
16
17
INPUT_BASE_URL="s3://commoncrawl/"
18
@@ -42,7 +42,7 @@ S3A_OUTPUT_PREFIX=s3a://commoncrawl-webgraph
42
################################################################################
43
# construct a merged graph of multiple monthly crawls
44
45
-MERGE_NAME=cc-main-2025-oct-nov-dec
+MERGE_NAME=cc-main-2025-26-nov-dec-jan
46
47
# Naming convention should be the three months' crawls that are
48
# used to generate this graph release. In the event of multiple months
0 commit comments