-
Notifications
You must be signed in to change notification settings - Fork 21
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gathering tasks for COEP hackathon, 2 Feb 2019 #20
Comments
Tree canopy of Pune measurement using satellite data.Ref: https://www.citylab.com/environment/2018/12/urban-tree-canopy-maps-artificial-intelligence-descartes-labs/578701/ |
Public transport data Animation, Visualization, AnalysisFor Viz: Runparticles : http://renderfast.com/RunParticles/ Analysis : Find the traffic choke-points, busy corridors, less-frequented stops, etc. Possible datasets : Pune bus static GTFS, realtime logs We will have a large dataset of GPS logs of select bus routes in Pune, will be released a few days before the event. Links:
|
PMJDY data analysisSee this discussion thread: https://groups.google.com/d/msg/datameet/ErNY82gA7dw/TOmnF7dLFQAJ Here's an example of data analysis done on it 2 yrs ago, we can take it further, make animated / interactive visualizations etc: https://zenodo.org/record/263919#.XCWWT99fjZs Use this zenodo page for citations: https://zenodo.org/record/1410405#.XCWYEN9fjZs |
Pune Tree census data analysis, comparisonGet datasets from here: http://nikhilvj.co.in/files/trees/
This is draft data from them, they have requested for feedback on it. So this dataset should be analysed for anomalies, etc. |
Openstreetmap mapping: Rural roads in Maharashtrahttps://tasks.teachosm.org/contribute?difficulty=ALL&organisation=datameet |
Scrape data from PMC STP app. (sewage treatment plant)The dept folks don't have raw data collecting at their end; the app-based system was set up by a vendor who's gone now. They have requested open data portal to extract the data from the app itself. The app fetches data dynamically. Android developers can run it on simulation and archive the data packets, convert them to CSV so Open data portal can publish the archived data. The archived data can be of great value to researchers, environmental groups to analyse how much sewage is treated, how much is untreated, how it affects the water bodies, etc. |
Data Cleaning tasks for data hosted on Pune Open Data PortalTabular Listing: There are cases where the excel container has messed up dates, interpreting dd/mm/yyyy as mm/dd/yyyy. Also, multiple-row headers, merged cells etc make some of the data unsuitable for programmatic reading. Possible things that can be done: Fix datesMake CSVs with one header row, no gaps etcCreate an accompanying document / cover letter that details what each column stands for etcMake unpivoted ('narrow') versions of pivoted ('wide') data |
MH Talukas PDFs and shapefiles comparisonTaluka PDF maps from MRSAC: http://www.mrsac.gov.in/en/taluka-maps Each PDF has outlines and names of villages in the Taluka. MH Villages shapefile : https://drive.google.com/open?id=0B3gxOiUzXTR-RVdZNXh4X1huUG8 What we have to do is
Possible discrepancies
Discrepancies can be logged in this tracking sheet (request organisers for access), or can be compiled separately if there is more details. We should try as far as possible to standardise it into tables and not keep it verbose. Larger aim of the exerciseTo document discrepancies between the official PDFs and the villages shapefile that Datameet has Why
|
Finding ward number geospatiallyGiven a dataset of entities in Pune with lat-long locations, use QGIS or other geospatial tools to determine the ward number under which each data-point falls, and create an additional column in this dataset indicating ward number. Supporting data: Pune ward maps, latest as well as previous. Data this exercise can be done on: http://opendata.punecorporation.org/Citizen/CitizenDatasets/Index?categoryId=37 |
Linguistic / NLP analysis on Grievances / Feedback datasetsLinguistic / NLP analysis on Grievances / Feedback datasets hosted on Pune Open Data Portal. |
main participant audience : 3rd year computer engineering and IT students of COEP. But event will be optional to attend and will be open for others.
The text was updated successfully, but these errors were encountered: