The IPA Data Cleaning Package (ipaclean) is a Stata package that contains IPA's custom Stata programs for cleaning & validating survey data. The package includes the following programs:
ipaappend
- Safely append datasetsipamegre
- Safely merge datasetsipaodksplit
- Split select_multiple responses into dummy variablesipaodkmergerepeats
- Reshape and merge ODK/SurveyCTO stype repeat groups in wide formatipacompare
- Compare data from multiple rounds of surveyipacodebook
- Export nicely formatted codebooks to excel
* ipaclean may be installed directly from GitHub
net install ipaclean, all replace from("https://raw.githubusercontent.com/PovertyAction/ipaclean/main")
net install ipahelper, all replace from("https://raw.githubusercontent.com/PovertyAction/ipahelper/main")
ipaclean update
* after initial installation ipaclean can be updated at any time using
ipaclean update
* to verify you have the latest versions of the commands
ipaclean version
Check out the IPACLEAN wiki for more information about ipaclean.
If you encounter a clear bug, please file a minimal reproducible example on github. For questions and other discussion, please email us at researchsupport@poverty-action.org.
ipaodksplit
is partly based on theodksplit
command written by A.R.M Mehrab Ali(https://github.com/ARCED-Foundation)ipaodkmergerepeats
is heavily based onodkmergerepeats
written by Chris Boyeripacodebook
is inspired by cbook_stats written by Michael Rosenbaum. The template() and applyusing options of the command are inspired by the iecodebook command from the World Bank DIME Analytics Team.