OpenWest 2014/Unstructured Data

Add Structure to Unstructured Data: Text Analysis and Speech to Text
by Craig Golightly

"A good chunk of what is produced each day on the internet and within companies is Unstructured Data (free form text fields, social media, voice comments) and too many people are getting buried by the sheer amount of this data. It doesn't store or sort nicely in its raw form, yet that is what most people do--they just store it and save it for "later". Join me for some real life examples and applications of tools to make sense of your Unstructured Data and turn it into something useful NOW."

-

Text analytic and speech to text

Consistency is not a human trait

100,000 tweets, 5 sorters, 44% agreement
 * if accuracy is less than 100% = Fail
 * if accuracy is greater than 0% = Win

Text analysis allows us to monitor the known and discover the unknown

Not just used to sort and search but FILTER!

Can monitor 100% of call center recordings? How to pick?

1) Maturity? how long has it been around

2) Features - do they add value or is it just cool

Can they find imperative (action) items. If they claim 100% it is a sales pitch. Uses Wikipedia logic to find categories sentiment must have positive, negative, neutral intent.

Speech (phoetic search) monitors - known (speech to text)

3) Speed & Scale - desktop or larger. Small footprint and speed.

4) Open Data - what format is data in text, proprietary, accessibility, metadata, measure of confidence

5) tune-ability, - does it work with our process?

6) Cost - not just licensing but data center and configuration cost

notes by Bethany