EIA-861 charting - existing? or preferred data retrieval? #4584
-
|
Hi, I'm interested in a simple web app to select and chart EIA-861 sales & revenue data. I haven't found one, are y'all aware of any? If so, I may just use that and the rest of this question is moot. If there isn't one, I'd like to code one. I'd use Is that fine? Do you want me to cache for durations beyond what I wrote above? Do you prefer that I use the CSV or Parquet? Or, do you want me to host my own copies of the PUDL data? This is less preferable to me, because then I need my own backend process to update my copy. I'd rather just code all this as a zero-maintenance web app. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
|
Hi @Lalo-ATX, EIA has a dashboard interface for working with a bunch of their electricity data, if you're looking for something that already exists. You can select the EIA-861 sales data from one of the dropdowns. If you want to build something more flexible, all of the PUDL data is available as a collection of Parquet files for programmatic use by the public through the AWS Open Data Registry, which provides us with a free S3 bucket at Note that there are multiple version of the data in that bucket. We have nightly builds that put fresh data under which you might e.g. access through DuckDB by doing: SELECT * FROM
's3://pudl.catalyst.coop/nightly/core_eia861__yearly_sales.parquet'
WHERE YEAR(report_date) >= 2023
LIMIT 10
The CSVs that are available through the PUDL Viewer aren't meant for programmatic use -- we primarily offer them so that spreadsheet users can still work with the data if they need to. You can see other code snippets in the PUDL data dictionary. See also the more extensive data access documentation. |
Beta Was this translation helpful? Give feedback.
Hi @Lalo-ATX,
EIA has a dashboard interface for working with a bunch of their electricity data, if you're looking for something that already exists. You can select the EIA-861 sales data from one of the dropdowns.
If you want to build something more flexible, all of the PUDL data is available as a collection of Parquet files for programmatic use by the public through the AWS Open Data Registry, which provides us with a free S3 bucket at
s3://pudl.catalyst.coop. For programmatic use, that's the best place to pull the data from. You can query it dynamically and efficiently pull only the subset of data you need with tools like pandas, polars, or DuckDB. Or read whole tables into memory all a…