Skip to content

OSError: [Errno 22] Invalid argument forbidden character #7388

@langflogit

Description

@langflogit

Describe the bug

I'm on Windows and i'm trying to load a datasets but i'm having title error because files in the repository are named with charactere like < >which can't be in a name file. Could it be possible to load this datasets but removing those charactere ?

Steps to reproduce the bug

load_dataset("CATMuS/medieval") on Windows

Expected behavior

Making the function to erase the forbidden character to allow loading the datasets who have those characters.

Environment info

  • datasets version: 3.2.0
  • Platform: Windows-10-10.0.19045-SP0
  • Python version: 3.12.2
  • huggingface_hub version: 0.28.1
  • PyArrow version: 19.0.0
  • Pandas version: 2.2.3
  • fsspec version: 2024.9.0

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions