Generating a CSV Term Hit Details Export report
The CSV Term Hit Details Export system report provides an exhaustive list of matches from the documents in the selected infoset to the search terms given as report parameters.
Before you begin
Important: Before you run this report on an infoset that you created in a product release before version 7.6.0.15, rerun any Step-up Analytics actions to ensure that the information required for proper report results is added to the index.
Starting with version 7.6.0.17, unmodified files are skipped when a Step-up Analytics action is run unless any of its cartridges was updated. Therefore, update at least one cartridge per action to be able to rerun the action.
About this task
When you generate the report, you specify the search terms in addition to the basic report information such as the report name and optional recipients.
For each match in a data object, the report contains basic information about the
object, such as its location, size, type, and ownership, and when the object was created, last
accessed, or modified. In addition, these details are available:
- The search term as you entered it in your request.
- The overall match count for a document; a value of 3, for example, indicates that three search expression matches were found in the document. The matches can be for one search expression or for different ones. Each has its own entry in the report resulting in three report rows for that document.
- The text that matched a search expression. A maximum of 128 characters is included in the report.
- Short snippets of the text to the left and right of the match to provide some context. This column is filled only for text matches on indexed annotations in documents that were processed with the respective cartridge.
- The UIMA type of the match. This column is filled only for text matches on indexed annotations in documents that were processed with the respective cartridge.
- The language as detected for that document.
- The offset of both the start and the end of the match within the document as a number of plain text characters.
- The document size in KB.
- An offset percentage to give you a rough understanding of where in the document to look for the match; the lower the value, the closer to the start of the document.
- The node ID, which is the unique StoredIQ identifier of the document.
- The date and time that the index was built.
Important: Depending on your search terms, the generated report might contain
sensitive data.
Procedure
To generate a CSV Term Hit Details Export report:
