Ben Langhinrichs

Photograph of Ben Langhinrichs

E-mail address - Ben Langhinrichs







Recent posts

Mon 16 Sep 2019

About that email in Notes



Mon 9 Sep 2019

Perils of PDF 4: Missing and obscured data



Fri 6 Sep 2019

Perils of PDF 3: Wide Tables and data loss


October, 2019
SMTWTFS
  01 02 03 04 05
06 07 08 09 10 11 12
13 14 15 16 17 18 19
20 21 22 23 24 25 26
27 28 29 30 31

Search the weblog





























Genii Weblog

Data mining thought for the day

Wed 16 Dec 2015, 09:24 AM



by Ben Langhinrichs
In between dealing with the horror that is Internet Explorer 9 and releasing new versions, I am working on a presentation on data mining in Notes rich text. With that in mind, here is my data mining thought for the day: 
 
There is implicit as well as explicit data and meta data. Explicit is there to be read, implicit is there to be discerned.
 
  1. Explicit data is the content (e.g., field in document; audio of phone call).
  2. Explicit meta data is the context (e.g., db and views where document is found; identity of callers and time of call).
  3. Implicit data is the internal implied context (e.g., words appear to be in English; caller sounds angry and agitated).
  4. Implicit meta data is the external cumulative context (e.g., occasional words in documents by this author appear to be German words which might imply native tongue; calls between person A and place of employment tend to be more agitated and frequent very late on Fridays which might imply somebody has to work weekends and is angry about it).
 
OK, back to Internet Explorer. If I am not heard from soon, send the Saint Bernard and brandy.
 

Copyright © 2015 Genii Software Ltd.

What has been said:

No documents found