diff --git a/docs/hub/datasets-adding.md b/docs/hub/datasets-adding.md index ae55c09a7..5f918ac4c 100644 --- a/docs/hub/datasets-adding.md +++ b/docs/hub/datasets-adding.md @@ -85,6 +85,7 @@ The Hub natively supports multiple file formats: - Text (.txt) - Images (.png, .jpg, etc.) - Audio (.wav, .mp3, etc.) +- PDF (.pdf) - [WebDataset](https://github.com/webdataset/webdataset) (.tar) It supports files compressed using ZIP (.zip), GZIP (.gz), ZSTD (.zst), BZ2 (.bz2), LZ4 (.lz4) and LZMA (.xz).