Skip to content

Clarify JSON annotation processing in fs.py#30

Open
yrrIsHere wants to merge 1 commit intofacebookresearch:mainfrom
yrrIsHere:fix_fs
Open

Clarify JSON annotation processing in fs.py#30
yrrIsHere wants to merge 1 commit intofacebookresearch:mainfrom
yrrIsHere:fix_fs

Conversation

@yrrIsHere
Copy link
Copy Markdown

@yrrIsHere yrrIsHere commented Nov 29, 2025

Updated comment to clarify that annotation files are multi-line JSON (JSONL) and adjusted JSON loading to process line-by-line.

Why?

Please refer to my issue: #28.

Those annotation files claimed to be JSON but are actually JSONL, which makes downloading from s3 a little bit tricky.

How ?

I directly treat annotation files on s3 as JSONL files.

Test plan

After this modification, at least using download_s3.py is ok, including function download_single_example, download_interaction_pair, download_samples_1gb and download_session_exploration.

Updated comment to clarify that annotation files are multi-line JSON (JSONL) and adjusted JSON loading to process line-by-line.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant