Skip to content

Commit 4336304

Browse files
committed
A quick addition to the native api guide clarifying the limitations of the dirindex/wget recipe.
1 parent 039b3b7 commit 4336304

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

doc/sphinx-guides/source/api/native-api.rst

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1955,6 +1955,8 @@ An example of a ``wget`` command line for crawling ("recursive downloading") of
19551955
19561956
.. note:: In addition to the files and folders in the dataset, the command line above will also save the directory index of each folder, in a separate folder "dirindex".
19571957

1958+
.. note:: The recipe above does NOT work if the Dataverse instance serves the files via direct download (i.e., when instead of streaming the content, the download API issues redirect links to the files stored on S3). Unfortunately, this describes **every** file served by the Dataverse instance at Harvard, among others. With direct download, saving an entire dataset while preserving its folders structure is possible, but requires some extra scripting.
1959+
19581960
List All Metadata Blocks for a Dataset
19591961
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
19601962

0 commit comments

Comments
 (0)