Skip to content
This repository was archived by the owner on Sep 10, 2025. It is now read-only.

Commit b5e2a1b

Browse files
authored
Update links to multi30k dataset since original servers are down (#2194)
* Update multi30k.py * Update test dataset hash
1 parent 60bea66 commit b5e2a1b

File tree

1 file changed

+5
-4
lines changed

1 file changed

+5
-4
lines changed

torchtext/datasets/multi30k.py

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -11,16 +11,17 @@
1111
_create_dataset_directory,
1212
)
1313

14+
# TODO: Update URL to original once the server is back up (see https://github.com/pytorch/text/issues/1756)
1415
URL = {
15-
"train": "http://www.quest.dcs.shef.ac.uk/wmt16_files_mmt/training.tar.gz",
16-
"valid": "http://www.quest.dcs.shef.ac.uk/wmt16_files_mmt/validation.tar.gz",
17-
"test": "http://www.quest.dcs.shef.ac.uk/wmt16_files_mmt/mmt16_task1_test.tar.gz",
16+
"train": r"https://raw.githubusercontent.com/neychev/small_DL_repo/master/datasets/Multi30k/training.tar.gz",
17+
"valid": r"https://raw.githubusercontent.com/neychev/small_DL_repo/master/datasets/Multi30k/validation.tar.gz",
18+
"test": r"https://raw.githubusercontent.com/neychev/small_DL_repo/master/datasets/Multi30k/mmt16_task1_test.tar.gz",
1819
}
1920

2021
MD5 = {
2122
"train": "20140d013d05dd9a72dfde46478663ba05737ce983f478f960c1123c6671be5e",
2223
"valid": "a7aa20e9ebd5ba5adce7909498b94410996040857154dab029851af3a866da8c",
23-
"test": "0681be16a532912288a91ddd573594fbdd57c0fbb81486eff7c55247e35326c2",
24+
"test": "6d1ca1dba99e2c5dd54cae1226ff11c2551e6ce63527ebb072a1f70f72a5cd36",
2425
}
2526

2627
_PREFIX = {

0 commit comments

Comments
 (0)