Patrick Moran, College of Charleston


Feature Extraction from Textual Datasets



Abstract: A new method is presented for topic extraction from sets of textual documents. This method takes into account word frequencies, semantic similarity of words, and proximity of words in the dataset. Existing techniques such as a nonnegative matrix factorization and graph clustering algorithms are employed. This method was developed and tested on sets of camera reviews, and results will be presented.

Advisor: Prof. Carl Meyer (North Carolina State University)