Skip to content

Commit dc4191f

Browse files
committed
make search query more specific
1 parent 0d8df03 commit dc4191f

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

scripts/1-fetch/internetarchive_fetch.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -303,7 +303,7 @@ def query_internet_archive(args):
303303
unmapped_language_counter = Counter()
304304

305305
fields = ["licenseurl", "language"]
306-
query = "creativecommons.org"
306+
query = "licenseurl:*creativecommons.org*"
307307
license_mapping = load_license_mapping()
308308

309309
rows = 1000000

0 commit comments

Comments
 (0)