Thanks for open-sourcing such a great dataset.
But I encountered two issues while working with it:
Android dataset (20250328) — I’m unable to reconstruct and use the dataset. In particular, android.tar.gz.part-000 is only 209 Bytes, which seems unusually small.