This project explores the relationship between traffic collisions and various factors in San Diego County, focusing on data from 2015 to 2019. We investigate the correlation between collision locations and popular nightlife areas, temporal patterns of accidents, and potential police biases in traffic stops.
- Research Questions
- Hypotheses
- Datasets
- Methods
- Key Findings
- Ethical Considerations
- Limitations
- Team Members
- Acknowledgements
- What are the most common types of traffic collisions in San Diego County?
- Is there a relationship between high bar density areas and traffic collision frequency?
- Which police beats and geographic divisions experience the most severe accidents?
- Are there any demographic biases in police traffic stops?
- Minor, non-fatal accidents will be most prevalent.
- More collisions will occur near nightlife hotspots (e.g., Pacific Beach, Gaslamp).
- Lower-income neighborhoods will experience more severe accidents.
- Younger drivers will be stopped and questioned more frequently.
- Traffic Collisions (2015-2019): 28,122 observations
- Source: San Diego Data Portal
- Police Stops (2018-2019): 179,725 observations
- Source: San Diego Data Portal
- Yelp Bars: 50 observations
- Source: Yelp API
- Yelp Clubs: 49 observations
- Source: Yelp API
- Geospatial analysis of collision locations relative to nightlife areas
- Temporal analysis of collision frequency by time and day
- Demographic analysis of police stops
- Statistical testing of hypotheses
- Most common violations: Traffic signal and sign violations
- Highest collision frequency: Pacific Beach (1500 collisions)
- Severe accidents: Northwestern San Diego (highest average injuries), Southern San Diego (highest average fatalities)
- Demographics: Younger people stopped more frequently and for longer durations
- Implemented Safe Harbour protocol to protect individual privacy
- Careful interpretation of results to avoid reinforcing stereotypes or biases
- Consideration of socioeconomic factors in analyzing collision patterns
- Incomplete bar and nightclub data from Yelp API
- Overlapping violation categories in the dataset
- Broad geographic divisions may obscure local patterns
- Limited timeframe (2015-2019) may not capture long-term trends
- Sarah Amiraslani (@SarahAmiraslani)
- Paul Chu (@paulchu54)
- Catherine Eng (@ceeng)
- Jose Jimenez (@JoseJimenezJr019)
- Erin Park (@eyp012)
- Alysa Quijada (@alysa-quijada)
This project was completed as part of COGS 108: Data Science in Practice at the University of California, San Diego. We thank our instructors and the San Diego Data Portal for providing resources and data.
For more information on San Diego police beats, visit the San Diego Police Department website.