Document Mining and Analysis on Environmental Reports

EDR, Inc. Mathematics, 2016-17

Liaison(s): Paul R. Schiffer, Richard White, Zach Fisk
Advisor(s): Weiqing Gu
Students(s): Vinh The Hoang (PM), Abram Sanderson, Annaliese Johnson, Matthew Bae, Johan Hoeger

The EDR clinic team was asked to work on a solution for EDR which would enable them to tag key pieces of information that appear in State and Federal Government environmental documents. As a major holder and distributor of environmental data, EDR desires to have a more effective method for finding desired information in these documents than their current methods allow. Since an environmental professional might be interested in multiple aspects of any given the document, the EDR team has explored classification and search techniques for both images and text in order to analyze and tag documents. The team’s work will be formatted as per EDR’s request, for adoption by EDR, and merging into their current systems.