Creating a knowledge base based on IBM FileNet Content Manager content

You create a knowledge base by using Classification Workbench to analyze a collection of sample documents from your IBM® FileNet® Content Manager object store and gather statistics. Training a knowledge base with sample data enables IBM Content Classification to classify similar content.

If your decision plan uses knowledge bases that correspond to folders or document classes in IBM FileNet Content Manager, create a separate knowledge base for each type of classification. Category names in the knowledge base must follow this convention:

The workflow that you follow to gather sample documents for your knowledge base depends on the status of your data.

Status of your data Procedure
Documents are already organized in an IBM FileNet Content Manager object store
  1. Configure and run the Content Extractor.
  2. Import the extracted content and XML data into Classification Workbench.
  3. Create and analyze a knowledge base based on the IBM FileNet Content Manager folder or document class structure.
  4. Create a decision plan to specify rules for classifying the content.
Documents are not in IBM FileNet Content Manager yet, but they are categorized, for example, in file system folders
  1. Import documents directly into Classification Workbench.
  2. Create and analyze a knowledge base based on the file system directory structure.
  3. Create a decision plan to specify rules for classifying the content. If you plan to classify documents into folders or document classes, create folders and document classes in an IBM FileNet Content Manager object store that correspond to categories in the knowledge base.
Documents are not categorized, but you know how they should be
  1. Import documents into Classification Workbench.
  2. Assign categories to documents by using Classification Workbench and create and analyze a knowledge base.
  3. Create a decision plan to specify rules for classifying the content. If you plan to classify documents into folders or document classes, create folders and document classes in an IBM FileNet Content Manager object store that correspond to categories in the knowledge base.
Documents are not categorized and you do not know how they should be
  1. Import documents into the Taxonomy Proposer and discover categories.
  2. Import categorized content items into Classification Workbench.
  3. Create and analyze a knowledge base based on the discovered categories.
  4. Create a decision plan to specify rules for classifying the content.