X hits on this document

177 views

0 shares

0 downloads

0 comments

36 / 64

BI and the “Unstructured Data” Challenge

36

Information Extraction

For “traditional” BI on text, key in on extracting information to databases.

Entities and concepts (features) are like dimensions in a standard BI model. Both classes of object are hierarchically organized and have attributes.

We can have both discovered and predetermined classifications (taxonomies) of text features.

Text-sourced information is very high dimensionality.

©Alta Plana Corporation, 2008

The Data Warehousing Institute

Document info
Document views177
Page views177
Page last viewedThu Dec 08 04:55:29 UTC 2016
Pages64
Paragraphs499
Words3241

Comments