Ben Langhinrichs

Photograph of Ben Langhinrichs

E-mail address - Ben Langhinrichs







Recent posts

Thu 29 Apr 2021

Archive a Notes DB off-line w/ Field data and active content



Tue 20 Apr 2021

Archive a Notes DB off-line in 4 easy steps



Thu 18 Mar 2021

Preservation of all the tiny details


May, 2021
SMTWTFS
      01
02 03 04 05 06 07 08
09 10 11 12 13 14 15
16 17 18 19 20 21 22
23 24 25 26 27 28 29
30 31

Search the weblog





























Genii Weblog

Data mining for random phrases

Tue 22 Mar 2016, 09:41 PM



by Ben Langhinrichs
Every few years, our Midas LSX evolves to take on new challenges. Sometimes, those are challenges that seem obvious, such as exporting Notes data to HTML or JSON with high fidelity. Sometime, though, the challenges are not so obvious. Over the past year, we have been evolving the data export features to allow data mining, or selective extraction. While I could show lots of details, I thought I'd write a few posts showing different sorts of data extraction. All are done with the Export to CSV sample, and so require no "programming" even with LotusScript. For these, I'll generally use Tableau, an excellent data visualization tool, though honestly the output is a CSV file which could be analyzed in almost any software package.

For the first post, I thought I'd search for a specific phrase in rich text fields, although I will export more than just the phrase. Each row contains a text record along with the name and UNID of the documents, so if somebody mentions the phrase multiple times, there will be multiple rows. If there are no mentions, there are no rows created.

I did two simple charts, one for the phrase "rich text" and the other for the phrase "security". For both, I searched the Business Partner forum from 2004. (At the very bottom, you can see the Export Directive I used to get an idea how this works.)

Inline JPEG image


Inline JPEG image

The form I filled out for the "security" search is below.

Inline JPEG image


Copyright 2016 Genii Software Ltd.

What has been said:

No documents found