Skip to content

Conversation

TheLazzziest
Copy link
Contributor

@TheLazzziest TheLazzziest commented Aug 24, 2025

Hi there!

Currently, I am working on a feature in dlt which leverages fsspec extensively. the library filesystem source lacks native support for standard HTTP endpoints. Many public data sources, such as those hosted on CDNs, are only accessible via HTTP, as demonstrated by this example: NYC trip data: Cloudfront.

In order to enable the support of public resources, dlt must know the last modification date of a resource which a CDN can provide in a standardized way, e.g.: RFC 1123. Thus, fsspec can use them to provide the date in turn.

@martindurant
Copy link
Member

I am happy to include this. Please post a link to the RFC for posterity (I was not aware of it).

@TheLazzziest
Copy link
Contributor Author

TheLazzziest commented Aug 26, 2025

Hi @martindurant! Thank you for your quick feedback. I can, but it seems that the original document is quite confusing.

Don't you mind if I mention the document this way: RFC: 1123 -> RFC: 822#Section-5

It should become more obvious where the actual knowledge comes from.

In addition to that, I used this resource for a more distilled explanation. Let me know if that works.

@martindurant
Copy link
Member

Thanks for the links, that will do nicely. I'll try to see why the CI test is reporting filenotfound for a URL that clearly does exist.

This is also my first time finding out about dlt, which I only briefly glanced at so far. It seems to have some ideas in common with intake.

@martindurant
Copy link
Member

It turns out the failure was temporary.

@martindurant martindurant merged commit 4e5f768 into fsspec:master Aug 29, 2025
28 of 30 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants