Technical Blog Post
Abstract
How to use the CSS skip feature in bulk
Body
When indexing content into IBM Content Search Services (CSS), retries for non-index-able content such as Tiff files shouldn't occur, but if they do you can tell the Content Platform Engine to skip these. This means that it will not include them again when re-dispatching content to be indexed. This feature is typically used for problem documents such as corrupt content or emails with unusually large attachments.
If there are only a few of these documents, it is easy to use the Administration Console for Content Platform Engine (ACCE) tool to navigate to the Indexing Queue and select Actions --> Skip Requests. When there are more items than can easily be dealt with, you can use the following bulk processing approach.
How to use the CSS skip feature in bulk
1. In the ACCE administration tool, open an object store and create a new object store search.
2. On the SQL view tab, enter the following query:
SELECT TOP 100 This FROM CmIndexRequest WHERE RetryCount > 1
3. Select Enable bulk action on the bulk action tab.
4. In the Script section, click Run script.
5. Copy the following JavaScript code and paste it into the Script field.
importClass(Packages.com.filenet.api.constants.RefreshMode);
importClass(Packages.com.filenet.api.constants.IndexingOperation);
function OnCustomProcess (CEObject)
{
CEObject.set_IndexingOperation(IndexingOperation.SKIP);
CEObject.save(RefreshMode.REFRESH);
}
6. Click Run. The administration console runs the query and the JavaScript action.
UID
ibm11280614