The NGO Clarity project (that's the name for the moment!) aims to demonstrate a method to assess the quality of NGO organisational data extracted from public/governmental open data websites.
We have used a sample dataset of 36000 organisations from the Indian Planning Commission website to demonstrate our method. We have created and are testing out a process for operating on raw data from the website, extracted into a spreadsheet, to produce a series of quality metrics.
We are proposing that these quality metrics are categorised into:
There are quality assessment rules we have defined along each of these categories, and are currently implementing the technical solution for the process.
The goal of the project is to kick-start a movement to help donor organisations make informed donation decisions based solidly on analysis of data about potential donation recipients.
We use XML, XQuery, XPath, EXIST (open source XML database), JavaScript for the purpose of demonstration.
Update on 4 Dec - we have changed the approach to make it easy to get a demo out by end of day. Currently using Python instead of XML, on an extraction of data from Excel.