Overview
Work collaboratively with MBTA staff to:
- use MBTA data sets to explore mass transit use in Greater Boston and
- identify patterns, insights, and/or new uses for data that can support enhanced MBTA service-delivery.
Specifically, you will
- ingest a data set containing real time vehicle location data from the MBTA and
- analyze it to understand the impact of a recent change in vehicle routing on service delivery.
Problem
Use MBTA-provided data to generate a comparison of on-time bus performance before and after the implementation of significant MBTA service changes that went into effect over the course of spring and summer 2013 (specific dates, by route, are available).
Project Tiers
The project will use a combination of SmartBusMart and other tools to read the data set, extract information relevant to the performance metrics used by the MBTA, and to compute the metrics for the time periods surrounding the service change. (MBTA’s reporting database, SmartBusMart, has the ability to ingest and analyze time-series data.)
- Ingestion tier – Pulls information from the data set into a data store for analysis
- Processing tier - reads data from the data store and computes performance metrics
- MBTA’s on-time performance standard is defined in a public policy document - the MBTA Service Delivery Policy
Dataset
Detailed Historical Bus Vehicle Positioning Information: Four years worth’s of historical bus location information from the MBTA’s SmartBusMart system.
The data sets are stored at the Massachusetts Green High Performance Computing Center (MGHPCC). Information on how to access them will be available at the beginning of the project.
Project logistics
- Mentors:
- Dominick Tribone, Special Assistant for Strategic Initiatives at the MBTA (email: dtribone at MBTA dot com)
- Dave Barker, Manager of Operations Technology at the MBTA (email: DBarker at MBTA dot com),
- Christopher Scranton, Senior Manager for Big Data and Technology Initiatives at the Massachusetts Technology Collaborative (email: scranton at masstech dot org).
- Ata Turk, ataturk at bu dot edu
- Min-max team size: 4-6
- Expected project hours per week (per team member): 6-8
- Will the project be open source? Yes
Preferred past experience
- Google Transit Feed Specification (GTFS)
- GIS systems
- Map/reduce
Background
Several years ago, the MBTA started a project to install the now-familiar signs that tell travelers when the next bus, or train will be arriving. During the early stages of the project, the MBTA also made the real-time vehicle information available to smartphone app developers, posting a set of public data feeds and running an app development contest. As a result, Boston became the first city in the nation with a set of smartphone apps providing an instant look at real time bus and train arrival updates.
From this project, the MBTA has accumulated several years of data showing the minute-by-minute position of dozens of buses and trains, along with traffic congestion measures, service alerts, weather data, and other pertinent information. Project participants are confident that the data holds the answers to some important questions about how to make mass transit in Greater Boston more efficient and convenient. This project is your chance to help the MBTA find out.
During the course of the project, you will work with mentors from the MBTA and the Massachusetts Technology Collaborative to identify questions to answer using the data, assemble the information and tools needed to come up with answers, and perform the analysis.