Verifying the WorkSite IDOL index database for missing or corrupt documents

If you are receiving search results that appear to not have documents you are sure should be there, moved your indexer to another server, or the Worksite DiffTool is not resolving missing document issues, this may assist support in tracking down your problem.  To run a check on the database, new indexing and full-text searching will be paused, but explicit SQL searches such as doc number, author, client, etc will still work.

1 – The first step is to stop the indexing services on your index server, using either the supplied batch file from Autonomy or the services.msc console.

2 – Navigate to the Content Engine directory, and open the .cfg file there.  Add the following lines in the [server] section:

Validatenodetable=true

Validatebtree=true

Validatediskindex=true

ValidateRefIndex=true

3 – Start the Content Engine service only.  This will start the validation.  The time will depend on your hardware and the size of your database (number of documents)

4 – Open the application.log file in the Content Engine’s logs folder.  This will give you the status of the check and the database.

 Once you see some lines such as the following, its an indication the check is running.  There are many different stages, so the first 100% does not mean its completed:

08/08/2011 16:41:09 [1] Starting validation of 2754305 term entries.
08/08/2011 16:41:14 [1] Validation 10% complete
08/08/2011 16:41:23 [1] Validation 20% complete
08/08/2011 16:41:38 [1] Validation 30% complete
08/08/2011 16:41:53 [1] Validation 40% complete

Some examples of errors are as follows.  If these are not fixed with the DiffTool, a rebuild of the index will usually be required.

08/08/2011 16:53:27 [1] Reference index for doc 368972 is missing. H: 1. V: “IMANAGE1.234916_1″
08/08/2011 16:53:27 [1] Reference index for doc 368973 is missing. H: 0. V: “IMANAGE1.234205_1″
08/08/2011 16:53:27 [1] Reference index for doc 368973 is missing. H: 2. V: “IMANAGE1.234205_1″
08/08/2011 16:53:27 [1] Reference index for doc 368973 is missing. H: 1. V: “IMANAGE1.234205_1″
08/08/2011 16:53:27 [1] Reference index for doc 368975 is missing. H: 0. V: “IMANAGE1.235021_1″
08/08/2011 16:53:27 [1] Reference index for doc 368975 is missing. H: 2. V: “IMANAGE1.235021_1″

 The final line of the log once its complete will read as follows depending on your database:

08/08/2011 16:53:41 [1] Refindex validation complete. 419134 docs. 913420 refs checked. 47482 failures.

Leave a Comment

Your email address will not be published. Required fields are marked *

Scroll to Top