Index and analyze documents

To make the content of documents searchable, you add the documents to the index.

Note: Although group members and group leaders can index documents if they have permissions, typically, a case administrator performs this task.

To cluster documents by conceptual similarity, you can analyze the text of a document and automatically extract common concepts. Concepts are nouns or noun phrases that describe a document, such as names, places, organizations, themes, and so on. The application weights each concept based on the concept's relevance. When you view documents in the Map and on the Mines page, documents with similar concepts are clustered together.

Depending on when you want to analyze document concepts, the following options are available:

You can submit documents for indexing and concept analysis the next time that an indexing and enrichment job runs. For more information, see Index document content.

You can analyze document concepts immediately, without waiting for the next indexing and enrichment job. The availability of this features depends on the case settings. For more information, see Analyze and extract document concepts.

For more information about viewing the concepts that are associated with a document, see View document coding, to view concepts from the Code pane; About concepts and clusters in the Map to view concepts in the Map pane; or Analyze document content using mines to view concepts in the Mines View.

Index document content

You can index documents to make the content of the documents searchable. Depending on the case settings, you can also reanalyze documents for concepts the next time that an indexing and enrichment job is run.

Note: You do not need to submit documents for a content file find job, because indexing and enrichment includes the content file find job.

You can also exclude documents from the index, to prevent document content from being searchable.

The permissions set by your administrator determine access to this feature.

Add documents to the index

To add documents to the index:

In the List pane, select the check box next to the documents that you want to index.

On the Tools menu, select Indexing and enrichment.

If the Submit for list appears, select Indexing and enrichment.

Select Refresh indexing and enrichment.

User HelpDepending on the case settings, you can reanalyze documents for concepts the next time that an indexing and enrichment job runs. To do this, select the Concept analysis check box.

Click OK.

The index is updated when the next indexing and enrichment job runs. Your administrator can also manually update the index.

Note: As of Nuix Discover version 10.6.005, for the Predictive Coding and Production features, Nuix Discover is configured (by default) to skip email headers for .msg files during indexing. To apply this change, you must re-index your case.

Remove documents from the index

To exclude documents from the index:

In the List pane, select the check box next to the documents that you want to index.

On the Tools menu, select Indexing and enrichment.

If the Submit for list appears, select Indexing and enrichment.

Select Exclude from indexing and enrichment.

Click OK.

Analyze and extract document concepts

Depending on the case settings, you can analyze and extract document concepts immediately, without waiting for the next indexing and enrichment job.

The permissions set by your administrator determine access to this feature.

To analyze document concepts immediately:

In the List pane, select the check box next to the documents that you want to analyze.

Note: Documents must be indexed before you can submit them for concept analysis. For information about how to index documents and submit them for concept analysis at the same time, see Add documents to the index.

On the Tools menu, select Indexing and enrichment.

In the Submit for list, select Concept analysis.

By default, the application analyzes and extracts document concepts for only the documents that have not yet been analyzed. If you want to reanalyze documents that have already been analyzed, select the Refresh concept analysis for selected documents that have already been analyzed check box.

Click OK.

Concept analysis is run on documents that are already indexed.