The DBMS_XDBT package provides a convenient mechanism for administrators to set up a CONTEXT index on the Oracle XML DB hierarchy. The package contains procedures to create default preferences, create the index and set up automatic synchronization of the CONTEXT index
The DBMS_XDBT package also contains a set of package variables that describe the configuration settings for the index. These are intended to cover the basic customizations that installations may require, but is by no means a complete set.
See Also:
Oracle XML DB Developer's GuideThis chapter contains the following topics:
Overview
Security Model
Operational Notes
The DBMS_XDBT package can be used in the following fashion:
Customize the package to set up the appropriate configuration.
Use the DROPPREFERENCES Procedure to drop any existing index preferences
Create new index preferences using the CREATEPREFERENCES Procedure procedure
Create the CONTEXT index using the CREATEINDEX Procedure procedure
Set up automatic synchronization of the index using the CONFIGUREAUTOSYNC Procedure
Owned by XDB, the DBMS_XDBT package must be created by SYS or XDB. The EXECUTE privilege is granted to SYS or XDB. Subprograms in this package are executed using the privileges of the current user.
The DBMS_XDBT package can be customized by using a PL/SQL procedure or an anonymous block to set the relevant package variables, configuration settings, and then execute the procedures. A more general approach would be to introduce the appropriate customizations by modifying this package in place, or as a copy. The system must be configured to use job queues, and the jobs can be viewed through the USER_JOBS catalog views. This section describes the configuration settings, or package variables, available to customize the DBMS_XDBT package.
Table 168-1 General Indexing Settings for Customizing DBMS_XDBT
| Parameter | Default Value | Description |
|---|---|---|
|
|
|
Name of the |
|
|
|
Tablespace used by tables and indexes comprising the |
|
|
|
Memory used by index creation and SYNC; less than or equal to the |
|
|
|
The log file used for |
Table 168-2 Filtering Settings for Customizing DBMS_XDBT
| Parameter | Default Value | Description |
|---|---|---|
|
|
|
List of mime types that should not be indexed. |
|
|
|
List of mime types that do not need to use the INSO filter. Use this for text-based documents. |
|
|
|
Name of the filter preference. |
Table 168-3 Stoplist Settings for Customizing DBMS_XDBT
| Parameter | Default Value | Description |
|---|---|---|
|
|
|
Name of the stoplist. |
|
|
|
List of stopwords, in excess of |
Table 168-4 Sectioning and Section Group Settings for Customizing DBMS_XDBT
| Parameter | Default Value | Description |
|---|---|---|
|
|
|
Default sectioner. Use |
|
|
|
Name of the section group. |
Table 168-5 Other Index Preference Settings for Customizing DBMS_XDBT
| Parameter | Default Value | Description |
|---|---|---|
|
|
|
Name of the datastore preference |
|
|
|
Name of the storage preference. |
|
|
|
Name of the wordlist preference. |
|
|
|
Name of the default lexer preference. |
Table 168-6 SYNC (CONTEXT Synchronization) Settings for Customizing DBMS_XDBT
| Parameter | Default Value | Description |
|---|---|---|
|
|
|
Indicates when the index should be SYNCed. One of |
|
|
Maximum number of documents in the |
|
|
|
How often, in minutes, the pending queue should be checked. Only if the |
|
|
|
Indicates how often, in minutes, the index should be SYNCed. Only if the AutoSyncPolicy is |
Table 168-7 DBMS_XDBT Package Subprograms
| Subprogram | Description |
|---|---|
|
Configures the |
|
|
Creates a USER datastore preference for the |
|
|
Creates a filter preference for the |
|
|
Creates the |
|
|
Creates a lexer preference for the |
|
|
Creates preferences required for the |
|
|
Creates a storage preference for the |
|
|
Creates a section group for the |
|
|
Creates a wordlist preference for the |
|
|
Creates a stoplist for the |
|
|
Drops any existing preferences |
This procedure sets up jobs for automatic SYNCs of the CONTEXT index.
The system must be configured for job queues for automatic synchronization. The jobs can be viewed using the USER_JOBS catalog views
The configuration parameter AutoSyncPolicy can be set to choose an appropriate synchronization policy.
The synchronization can be based on one of the following:
| Sync Basis | Description |
|---|---|
SYNC_BY_PENDING_COUNT |
The SYNC is triggered when the number of documents in the pending queue is greater than a threshold (See the MaxPendingCount configuration setting). The pending queue is polled at regular intervals (See the CheckPendingCountInterval configuration parameter) to determine if the number of documents exceeds the threshold. |
SYNC_BY_TIME |
The SYNC is triggered at regular intervals. (See the SyncInterval configuration parameter). |
SYNC_BY_PENDING_COUNT_AND_TIME |
A combination of both of the preceding options. |
This procedure creates a user datastore preference for the CONTEXT index on the XML DB hierarchy.
The name of the datastore preference can be modified; see the DatastorePref configuration setting.
The default USER datastore procedure also filters the incoming document. The DBMS_XDBT package provides a set of configuration settings that control the filtering process.
The SkipFilter_Types array contains a list of regular expressions. Documents with a mime type that matches one of these expressions are not indexed. Some of the properties of the document metadata, such as author, remain unindexed.
The NullFilter_Types array contains a list of regular expressions. Documents with a mime type that matches one of these expressions are not filtered; however, they are still indexed. This is intended to be used for documents that are text-based, such as HTML, XML and plain-text.
All other documents use the INSO filter through the IFILTER API.
This procedure creates a NULL filter preference for the CONTEXT index on the XML DB hierarchy.
The name of the filter preference can be modified; see FilterPref configuration setting.
The USER datastore procedure filters the incoming document; see CREATEDATASTOREPREF Procedurefor more details.
This procedure creates the CONTEXT index on the XML DB hierarchy.
The name of the index can be changed; see the IndexName configuration setting.
Set the LogFile configuration parameter to enable ROWID logging during index creation.
Set the IndexMemory configuration parameter to determine the amount of memory that index creation, and later SYNCs, will use.
This procedure creates a BASIC lexer preference for the CONTEXT index on the XML DB hierarchy.
This procedure creates a set of default preferences based on the configuration settings.
This procedure creates a section group for the CONTEXT index on the XML DB hierarchy.
The name of the section group can be changed; see the SectiongroupPref configuration setting.
The HTML sectioner is used by default. No zone sections are created by default. If the vast majority of documents are XML, consider using the AUTO_SECTION_GROUP or the PATH_SECTION_GROUP; see the SectionGroup configuration setting.
This procedure creates a stoplist for the CONTEXT index on the XML DB hierarchy.
This procedure creates a BASIC_STORAGE preference for the CONTEXT index on the XML DB hierarchy.
The name of the storage preference can be modified; see the StoragePref configuration setting.
A tablespace can be specified for the tables and indexes comprising the CONTEXT index; see the IndexTablespace configuration setting.
Prefix and Substring indexing are not turned on by default.
The I_INDEX_CLAUSE uses key compression.