The goal of this project was to extract data from https://www.openstreetmap.org (open source database similar to Google Maps) for an area of the world I am interested in. I then assessed the quality of the data for validity, accuracy, completeness, consistency and uniformity. Finally I imported the data into a SQL database for further querying and auditing.
Below is a blocks link that demonstrates the data extraction and cleaning process as well as database queries.
http://bl.ocks.org/gill-0/raw/9c92a05cc3492a6e4985e3ce2565e801/
Audit different fields in OSM/XML file
audit.py
Modify incorrect fields in OSM file and export to CSV files
data.py
Import CSV files into database
import_csv.py
Query and edit incorrect fields in database
query_functions.py
Database schema provided by Udacity
schema.py
SQL queries to explore database
sql_query.py
Discover structure and number of tags in OSM file
map_parser.py
High level audit of tags for types of potential problems
tags.py
Function to discover number of users
users.py
Smaller OSM/XML file used for testing
sample.osm