BI and the “Unstructured Data” Challenge
Text Analytics Typical steps in text analytics include –
Retrieve documents for analysis.
Create a categorization/taxonomy from the extracts or acquire and apply a domain-specific taxonomy.
Apply statistical techniques to classify documents, look for patterns such as associations and clusters.
Apply statistical &/ linguistic &/ structural techniques to identify, tag, and extract entities, concepts, relationships, and events (features) within document sets.
tagging = text augmentation
©Alta Plana Corporation, 2008
The Data Warehousing Institute