X hits on this document

163 views

0 shares

0 downloads

0 comments

46 / 64

BI and the “Unstructured Data” Challenge

46

Information Extraction An e-mail message is “semi-structured.”

Semi=half. What’s “structured” and what’s not? Is augmentation/tagging and entity extraction enough?

What categorization might you create from that example message?

If we extracted all the entities to a database, what could you do with them?

From semi-structured text, it’s especially easy to extract metadata.

There are many forms of s-s information...

©Alta Plana Corporation, 2008

The Data Warehousing Institute

Document info
Document views163
Page views163
Page last viewedMon Dec 05 13:27:33 UTC 2016
Pages64
Paragraphs499
Words3241

Comments