Image

Kanishka Bhaduri

Member since: Sep 24, 2010, Mission Critical Technologies Inc

Anomaly Detection from ASRS Databases of Textual Reports

shared by Kanishka Bhaduri, updated on Sep 10, 2010

Summary

Our primary goal is to automatically analyze textual reports from the Aviation Safety Reporting System (ASRS) database to detect/discover the anomaly categories reported by the pilots, and to assign each report to the appropriate category/categories. We have used two state-of-the-art models for text analysis: (i) mixture of von Mises Fisher (movMF) distributions, and (ii) latent Dirichlet allocation (LDA) on a subset of all ASRS reports. The models achieve a reasonably high performance in discovering anomaly categories and clustering reports. Each category is represented by the most representative words with the highest probability in this category. In addition, since the inference algorithm for LDA was somewhat slow, we have developed a new fast LDA algorithm which is 5-10 times more efficient than the original one, therefore more applicable for the practical use. Further, we have developed a simple visualization tool based on non-linear manifold embedding (ISOMAP) to generate a 2-d visual representation of each report based on its content/topics, which gives a direct view of the structure of the whole dataset as well as the outliers.

Files

CIDU_poster_2.ppt
294.5 KB 252 downloads

Discussions

Add New Comment

Kanishka's Projects (4)

Need help?

Visit our help center