Skip to content

Comments

MBTA ridership data#287

Merged
annehaley merged 9 commits intomasterfrom
mbta-ridership
Feb 23, 2026
Merged

MBTA ridership data#287
annehaley merged 9 commits intomasterfrom
mbta-ridership

Conversation

@annehaley
Copy link
Collaborator

This PR leverages the ingest_mbta custom dataset conversion to add ridership data to the MBTA Rapid Transit dataset. Ridership data was provided by @augustposch. This PR also makes a slight change to the Legend panel so that the ridership range will appear on either side of the color bar.

Note: To apply these changes, the MBTA Rapid Transit dataset must be overwritten. Re-run the ingest task with "action": "replace" added to the JSON specification for that dataset.

@annehaley
Copy link
Collaborator Author

Currently, the ridership data is not one-to-one with the existing network topology:

  • Several rows in the ridership data have no matches to existing network node names ("Back of the Hill", "Museum of Fine Arts", "Northeastern University", "Packard's Corner", "Saint Mary's Street", "Massachusetts Avenue")
  • Many network nodes have no ridership data (in screenshot below, nodes missing ridership data are shown in red)
    @augustposch I will need your help to resolve these missing matches before we merge. I suspect some are just slight name variations, but we may need to look for more of the missing data.

Screenshot below shows the MBTA Rapid Transit dataset with points colored by "total_ridership" (with colormap "greens" and null color red) and lines colored by feature props.
image

Base automatically changed from vector-feat-relationships to master February 17, 2026 15:04
@augustposch
Copy link
Collaborator

@annehaley I've resolved most of the discrepancies. As you mentioned, many come down to small name variations.

Back of the Hill ... Back Of The Hill
Museum of Fine Arts ... Museum Of Fine Arts
Northeastern University ... Northeastern
Packard's Corner ... Packards Corner
Saint Mary's Street ... Saint Marys Street
Massachusetts Avenue ... Massachusetts Ave

Check your matching algorithm for these stations which we have all the data for: Chestnut Hill (not Chestnut Hill Avenue), Quincy Center (not Quincy Adams), BU East (not BU Central).

There are two more I see on your screenshot where we have all the data so I'm not sure what the issue is: Sutherland Road, Haymarket.

Aside from the above, it looks like all the missing stations come from the Red Line streetcar portion (riders actually change vehicles there) and the Silver Line (which is physically buses even though advertised as subway-equivalent).

Red Line streetcar: Cedar Grove, Butler, Milton, Central Avenue, Valley Road, Capen Street, Mattapan
Silver Line: Temple Pl @ Washington St, Tremont St @ Boylston Station, Nubian, Melnea Cass Blvd, Lenox St, Mass. Ave, Worcester Street, West Newton Street, Union Park Street, East Berkeley Street, Herald Street, Chinatown Gate, Essex St @ Atlantic Ave, 23 Dry Dock Ave, 27 Dry Dock Ave, Black Falcon Ave, 88 Black Falcon Ave, Design Center, Tide St, Harbor St, Northern Ave, World Trade Center, Courthouse, South Station, Airport, Airport Terminal E, Airport Terminal C, Airport Terminal B2, Airport Terminal B1, Airport Terminal A, Silver Line Way

I'll look for Red Line streetcar data and Silver Line data - hopefully there's a dataset we can integrate with what we've already done.

@annehaley
Copy link
Collaborator Author

@augustposch Thanks for identifying the name discrepancies. I should have anticipated that a case-insensitive match would be appropriate for this. I also removed apostrophes from the station names and applied some custom replacements before looking for matches.

Besides the known missing stations along the silver line and red line streetcar, the remaining discrepancies were actually due to a bug with the way we generate features for a network. Some node features were getting created more than once, so the duplicates were not getting the ridership data saved. I fixed this bug in faed8d5.

Here's what the ridership visualization looks like now:
image

If this looks good to you, we can merge this version and work on the other missing data later.

Copy link
Collaborator

@augustposch augustposch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

New screenshot looks good. Glad you identified all the issues and fixed these! I'll look into the silver line and red line streetcar data this week.

@cloudflare-workers-and-pages
Copy link

cloudflare-workers-and-pages bot commented Feb 23, 2026

Deploying geoinsight with  Cloudflare Pages  Cloudflare Pages

Latest commit: 45991ab
Status: ✅  Deploy successful!
Preview URL: https://8c76ab6f.geoinsight.pages.dev
Branch Preview URL: https://mbta-ridership.geoinsight.pages.dev

View logs

@annehaley annehaley merged commit 7cf53c5 into master Feb 23, 2026
3 checks passed
@annehaley annehaley deleted the mbta-ridership branch February 23, 2026 19:35
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants