Project logistics
- Mentors: Shreyas Dube
- Min-max team size: 3-5
- Expected project hours per week (per team member): 8 hours
- Will the project be open source? Yes - Apache like license
Preferred past experience
- Java
- Basic data visualization concepts, e.g.: picking the right representations. No 3D pie charts please.
- Familiarity with database management systems, e.g. MySQL and data modeling techniques.
- Familiarity with either AWS or Google Cloud (students can pick one platform).
Project Overview
This project was inspired by http://mbtaviz.github.io/ and https://mbta.meteor.com/. The goal of this project is to analyze and visualize, in real time, patterns that cause failures in the MBTA subway system. The ultimate goal is to provide riders with useful data to help them plan their commute. For example, how much time does it generally take to fix a “Switch problem” on the red line, or how does the system as a whole perform, or which is the best and the worst line. Requirements:
- The system shall use MBTA-realtime APIs as the primary data source.
- The system should be scalable and be able to ingest failure data incrementally, multiple times a day.
- The user interface should be interactive and encourage users to explore data in meaningful ways.
Some Technologies you will learn/use:
- Java
- Data visualization (perhaps using d3.js)
- Consuming and developing REST APIs.
- JSON, XML
- Either Amazon Web Services or Google Cloud Platform (App Engine, Bigtable, Endpoints).