-
Notifications
You must be signed in to change notification settings - Fork 10
Open
Description
Hi, I downloaded the chunked PMC dataset with link http://nlp.dmis.korea.edu/projects/selfbiorag-jeong-et-al-2024/data/retriever/PMC_128.tar.gz
I found that there are files
PMC_128_Abs_Articles.json PMC_128_Main_Articles.json
PMC_128_Abs_Embeds.npy PMC_128_Main_Embeds.npy
PMC_128_idx_array.npy
I assume that contain's everything?
For the other small files under PMC_128_temporary, they should be the same as the above merged file?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels