Atlanta Police Department’s online historical crime database has data from 1/1/2009 and is updated weekly. For this analysis I will examine all crime data posted on the Atlanta Police Department Open Data Portal from January 1, 2016 to April 7, 2017.
The data contains 35064 crimes and 23 variables, and there are some wrangling need to do, such as remove the columns I don’t need, omit ‘NA’s, convert the data to the right types.
Crimes have decreased in the recentl months.
The number of crimes increased around April, July and September 2016.
The crime time distribution appears bimodal with peaking around midnight and again at the noon, then again between 6pm and 8pm.
Downtown and midtown are the most common locations where crimes take place, followed by Old Fourth Ward and West End.
larceny theft are the top crimes in Atlanta followed by aggravated assault
What days and times are especially dangerous?
Among the high crime categories, larceny tend to increase on Fridays and Saturdays. while burglary residence generally occurred more often during the weekdays than the weekends. Auto theft were least reported on Thursdays and increase for the weekends.
The number of crimes in downtown increased in weekends and decreased on Tuesdays. Crime distribution is fairly even throughout the week for Midtown and Old Fourth Ward. North Buckhead reported least crimes on Sundays.
The top crimes exhibit different sinusoid time-interval patterns. Larceny from vehicle declined around 5am and peaked in the evening, Larceny-non vehicle peaked around 3pm, Auto-theft had a steady increase during the day and peaked in the evening, burglary-residence happened more often in the late morning than in the evening.
Plot a location map of Crimes in Atlanta using stats$_$denisty layer.
I want to plot the density of crime on a map of the area around downtown Atlanta. The first step is to get the map data, then create the map use the following:
The density areas can be interpreted as follows:
All the shaded areas together contain 3/4 of the top crimes in the data. Each shade represents 1/4 of the top crimes in the data. The smaller the area of a particular shade, the higher the crime density.
The End
Remember that we are seeing crime data here, not arrest data, It would be more meaningful if the original dataset contains arrest information. I would be interested to see where the arrests are happening.
Source code that created this post can be found here. I am happy to hear any feedback and questions.