Open source repositories tagged with #datacleansing, ranked by health score.
OpenRefine is a free, open source power tool for working with messy data and improving it