IRS Form 990 Decoder
This repository contains everything you need to get started exploring the IRS Form 990 dataset hosted by Amazon Web Services on S3. This includes instructions for an easier-to-use 990 database provided free to the public by Charity Navigator.
New version available!
As part of the Nonprofit Open Data Collective, Charity Navigator has been proud to contribute to the eFile Master Concordance, which provides a standardized mapping between the many XML schemas in the primary dataset. The concordance is still in draft. A validation event took place in November 2017.
The Charity Navigator 990 Decoder and the community concordance has been the basis for several 990-related projects, including IRSx and Open990, as well as a body of documentation hosted by the Nonprofit Open Data Collective.
As a result of this hard work, Charity Navigator now has code capable of extracting all fields from the entire IRS eFile dataset. The data will be made publcily available after the Validatathon event. If you wish to preview the data, please follow the instructions in the readme for the new code.
Want to DIY?
Since this library was written, more and more resources have become available for analyzing the IRS e-file dataset. If you want to build your own 990 analysis tool, the author of this library has written a how-to article on Medium.
Working with the original 990 Toolkit
Due to the imminent release of a much richer dataset, we have deprecated our original toolkit. If you wish to access it anyway, you can view the original documentation here.
Authors (original version)
Code and visualizations: David Bruce Borenstein
Documentation: David Bruce Borenstein and Zach Weinsteiger
Crosswalk between XML and database columns (990): Vince Bogucki
Crosswalk between XML and database columns (EZ): David Bruce Borenstein and Zach Weinsteiger