Skip to content

Commit 6d50654

Browse files
committed
Refine author count bucketing to individual buckets for 1-4 authors and 5+ grouping
1 parent faa2b27 commit 6d50654

File tree

1 file changed

+10
-8
lines changed

1 file changed

+10
-8
lines changed

scripts/1-fetch/arxiv_fetch.py

Lines changed: 10 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -426,20 +426,22 @@ def bucket_author_count(n):
426426
"""
427427
Convert author count to predefined buckets for analysis.
428428
429-
Buckets: "1", "2-3", "4-6", "7-10", "11+", "Unknown"
429+
Buckets: "1", "2", "3", "4", "5+", "Unknown"
430430
Reduces granularity for better statistical analysis.
431431
"""
432432
if n is None:
433433
return "Unknown"
434434
if n == 1:
435435
return "1"
436-
if 2 <= n <= 3:
437-
return "2-3"
438-
if 4 <= n <= 6:
439-
return "4-6"
440-
if 7 <= n <= 10:
441-
return "7-10"
442-
return "11+"
436+
if n == 2:
437+
return "2"
438+
if n == 3:
439+
return "3"
440+
if n == 4:
441+
return "4"
442+
if n >= 5:
443+
return "5+"
444+
return "Unknown"
443445

444446

445447
def save_count_data(

0 commit comments

Comments
 (0)