This is data I received from the Chicago Transit Authority through an Illinois FOIA request. It covers bus stop and passenger load data for all buses in the CTA system. It does not include rail data. Contact me Steven Lucy slucy@parallactic.com.
Data is as-is: The CTA included a letter with this data emphasizing that they do not know if this data is any good. It is gathered from automated counters and there may be malfunctions or obviously spurious data. It has not been vetted or reviewed by the CTA, just provided as-is under FOIA.
Big files: do not open these in a spreadsheet program, they are way too big. You will need to load them into a database.
A note about dates: Dates and times in this data require some careful attention. As far as I can tell, "survey date" means the date the whole bus trip is scheduled for, even if the trip spans a midnight. So a bus may stop on 1/15/2015 at 11:59pm and then next stop at 12:01am would still be listed as 1/15/2015 even though the stop actually happened on 1/16/2015. Similarly a bus run scheduled for 1/16/2015 at 12:05am that starts 10 minutes early due to operational reasons may show a stop at 11:55pm but still show the 1/16/2015 date, even though that stop actually happened on 1/15/2015. This has to be dealt with in any analysis of the data.
I received two files from the CTA under different FOIA requests, in slightly different formats. Here they are:
I wrote some quick scripts at https://github.com/slucy/cta-passenger-counts to read it into a database. If you write tools to load or analyze the data, please submit a pull request.
Sample data:
"SERIAL_NUMBER","ROUTE_NUMBER","DIRECTION_NAME","VEHICLE_NUMBER","SURVEY_DATE","STOP_ID","TIME_ACTUAL_ARRIVE","MAIN_CROSS_STREET","PASSENGERS_ON","PASSENGERS_OFF","PASSENGERS_IN","LATITUDE_ACTUAL","LONGITUDE_ACTUAL" "44748397","6.0","South","4183","10/1/2014","100661","12/30/1899 12:55:59 AM","WacCol","0","0","0","41.8882","-87.62159" "44751071","8.0","South","1065","10/1/2014","100182","12/30/1899 1:29:21 AM","RotHal","0","0","5.458","41.81784","-87.64574" "44748397","6.0","South","4183","10/1/2014","1983","12/30/1899 1:01:46 AM","WACKER (UPPER)/WABASH","3.03","0","8.08","41.88748","-87.62595" "44748397","6.0","South","4183","10/1/2014","17568","12/30/1899 1:23:17 AM","LAKE PARK/49TH STREET","0","0","45.466","41.80588","-87.58933" "44748397","6.0","South","4183","10/1/2014","4316","12/30/1899 1:25:43 AM","S HYDE PARK/5200 SOUTH","0","2.972","37.424","41.80091","-87.5845" "44748397","6.0","South","4183","10/1/2014","4053","12/30/1899 1:31:37 AM","STONY ISLAND/60TH STREET","0","0","27.334","41.78582","-87.58666" "44748397","6.0","South","4183","10/1/2014","17783","12/30/1899 1:38:41 AM","67TH/CHAPPEL","0","0","21.426","41.77339","-87.57541" "44748397","6.0","South","4183","10/1/2014","13691","12/30/1899 1:40:35 AM","67TH STREET/SOUTH SHORE","0","0","17.53","41.77351","-87.567" "44755423","8.0","South","1229","10/1/2014","10104","12/30/1899 11:43:06 PM","HALSTED/LINCOLN/FULLERTON","0","2.193","5.683","41.92576","-87.64883"
Sample data:
id, route, direction, vehicle number, survey date, actual time, passengers on, passengers off, passengers in, actual latitude, actual longitude --- 45703132,29,South,1847,01-03-2015,19:14:54,0,0,11.214,41.76878,-87.62619 45703132,29,South,1847,01-03-2015,18:59:51,0,0,11.652,41.82167,-87.62645 45703132,29,South,1847,01-03-2015,19:11:23,0,0,11.326,41.78563,-87.62553 45703132,29,South,1847,01-03-2015,19:07:05,0,2.193,10.367,41.79494,-87.62579 45703132,29,South,1847,01-03-2015,19:06:49,0,0,12.56,41.79485,-87.62578 45703132,29,South,1847,01-03-2015,19:04:41,0,0,12.56,41.80378,-87.62604 45703132,29,South,1847,01-03-2015,19:00:44,0,0,11.652,41.82019,-87.62643 45703132,29,South,1847,01-03-2015,19:00:24,0,0,11.652,41.82197,-87.62651 45703132,29,South,1847,01-03-2015,18:59:57,0,0,11.652,41.82336,-87.62658 45703132,29,South,1847,01-03-2015,18:58:34,1.015,0,11.652,41.82714,-87.62653