catalog · 12

Datasets

Sources for the stories above. Each dataset has curated views, structured filters, and a playground for arbitrary queries.

OpenDataPhilly · Carto 5,773,898 rows

Philly 311 service requests

Live snapshot of the City of Philadelphia's 311 requests, queried directly against phl.carto.com via our Cloudflare Worker proxy. Five tabs, real SQL, real maps.

livecivicincidentsgeosql
OpenDataPhilly · Carto 1,451,562 rows

L&I Violations

License & Inspections violation notices issued to Philadelphia properties — the enforcement side of 311. Carto snapshot covering 2007 through March 2020. Join with 311 data to ask: does calling correlate with action?

historicalcivicenforcementgeosql
PFD · ArcGIS Feature Service 128,491 rows

Philadelphia Fire Department incidents

Every PFD dispatch since 2024-01-01 — false alarms, EMS assists, hazmat, and the ~14% that are actual fires. Quarterly updates from the city's stat360_fire_incidents layer, queried through our ArcGIS proxy.

livesafetyincidentsgeo
NYC Open Data · Socrata 24,000,000 rows

NYC 311 service requests

Live snapshot of New York City's 311 service requests, twenty-four million rows from 2010 to today. Queried via SODA v3 against data.cityofnewyork.us through our Cloudflare worker proxy. Noise dominates; less than 2% is what most people would call 'social disorder.'

livecivicincidentsgeosql
NYC TLC · Parquet · DuckDB WASM 1,500,000,000 rows

NYC TLC taxi trip records

One-and-a-half billion yellow / green / FHV trips since 2009. Stories use build-time DuckDB aggregates. The Playground tab runs DuckDB WASM in the browser — ad-hoc SQL against remote Parquet, no server required.

livemobilityparquetlargeduckdb
DCP · NYC Open Data · Socrata 860,000 rows

PLUTO — every NYC tax lot

The Department of City Planning's Primary Land Use Tax Lot Output. ~860K tax lots, ~70 fields each — zoning district, land use, building class, year built, residential units, assessed value. The substrate beneath nearly every quantitative urban-policy paper written about NYC.

liveland-usegeosql
DOHMH · NYC Open Data · Socrata 400,000 rows

NYC Restaurant Inspections

Every sustained violation issued to every food establishment by the Department of Health and Mental Hygiene. One row per violation per inspection. The grade card hung in your favorite spot's window comes from this dataset — and the famous 1900-01-01 placeholder dates.

livehealthincidentssql
DEP · NYC Open Data · Socrata

NYC Lead Service Line Inventory

Per-property classification of which NYC buildings are still served by lead pipes. Published per the EPA's 2024 Lead and Copper Rule Improvements. The headline isn't the lead count — it's the staggering "Unknown" classification, the public-health data void at the heart of the city's 2037 replacement deadline.

liveenvironmenthealthgeosql
HPD · NYC Open Data · Socrata

HPD Maintenance Code Violations

Every Housing Maintenance Code violation issued by HPD. Joins to PLUTO via BBL. The substrate beneath every "worst landlord" feature, the join key for tenant-advocacy tools that pierce LLC corporate-veil opacity to identify serial offenders.

livehousingenforcementgeosql
HPD · NYC Open Data · Socrata

HPD Affordable Housing Production

Every affordable housing project the city has financed under Housing New York and successor programs. Unit counts segmented by AMI band — what "affordable" actually means depends on which bands you include in the headline.

livehousingsql
MTA · NYS Open Data · Socrata

MTA Subway Origin-Destination

The MTA's algorithmic reconstruction of where 4M daily subway riders actually go. Turnstiles only capture entries; exits are probabilistically inferred from each rider's next entry. The cleanest public view of NYC's transit circulatory system.

livemobilitysql
OTI · NYC (planned)

NYC LL35 Algorithmic Tools Report

The city's annual algorithmic-tools disclosure required by Local Law 35 of 2021. AI/ML systems used by city agencies that affect rights, liberties, benefits, or safety — including the controversial ACS predictive risk scores that prompted the GUARD Act response. Static editorial; no live adapter.

plannededitorialgovernance