Skip to content

quironaquariano/classification-accident-severity

Repository files navigation

Coursera_Capstone

This project is part of the IBM Data Science Professional Certification provided by Coursera and its main objective is to apply data science concepts to analyze a real-world scenario.

1.Introduction

One of the current concerns of the Seattle Department of Transportation (SDT) is to find solutions that can minimize the number of traffic accidents, as well as deaths, injuries and damage from traffic accidents, in this context, all relevant information about the occurrence of accidents are registered and maintained by the department for access by all researchers. This data is necessary for the planning and implementation of countermeasures, operational control and for evaluating road safety programs and improvements.

Accidents have different severities and the implementation of countermeasures must be able to take this aspect into account in order to prioritize their projects, therefore, the process of classifying the severity of accidents must be carried out as accurately as possible. This project intends to analyze all collisions registered since 2004, in order to build a Machine Learning Model to predict the classification of the severity of new accidents based on their characteristics with 100% accuracy

1.2.Interest

The department may have more accurate and automatic accident classification and, consequently, programs that receive grant funding in the Seattle region have benefited from this information to define their action plan. Citizens of the city may be interested in wanting to know under what circumstances they are most vulnerable to getting involved in car collisions.

2.Data Understanding

The data set used in this project is available in a comma-separated values (CSV) file format and has been downloaded from Seattle Open GeoData Portal and includes all types of collisions since 2004 to Present. The dataset metadata was found at Department of Transportation Seattle. The dataset contains 221738 rows/registers and 40 columns/fields. The dataset contains columns with 3 diferent type of values, float64, object and Int64.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published