BI and the “Unstructured Data” Challenge
For “traditional” BI on text, key in on extracting information to databases.
Entities and concepts (features) are like dimensions in a standard BI model. Both classes of object are hierarchically organized and have attributes.
We can have both discovered and predetermined classifications (taxonomies) of text features.
Text-sourced information is very high dimensionality.
©Alta Plana Corporation, 2008
The Data Warehousing Institute