Skip to content

gill-0/Wrangle-OpenStreetMap-Data

Repository files navigation

Wrangle-OpenStreetMap-Data

Introduction

The goal of this project was to extract data from https://www.openstreetmap.org (open source database similar to Google Maps) for an area of the world I am interested in. I then assessed the quality of the data for validity, accuracy, completeness, consistency and uniformity. Finally I imported the data into a SQL database for further querying and auditing.

Data Extraction, Cleaning, and Database Queries

Below is a blocks link that demonstrates the data extraction and cleaning process as well as database queries.

http://bl.ocks.org/gill-0/raw/9c92a05cc3492a6e4985e3ce2565e801/

Files

Audit different fields in OSM/XML file

audit.py

Modify incorrect fields in OSM file and export to CSV files

data.py

Import CSV files into database

import_csv.py

Query and edit incorrect fields in database

query_functions.py

Database schema provided by Udacity

schema.py

SQL queries to explore database

sql_query.py

Discover structure and number of tags in OSM file

map_parser.py

High level audit of tags for types of potential problems

tags.py

Function to discover number of users

users.py

Smaller OSM/XML file used for testing

sample.osm

Releases

No releases published

Packages

No packages published