Administration Guide

Adjusting the Optimization Class

When an SQL query is compiled, a number of optimization techniques can be used to determine the most efficient access plan for that query. Using more optimization techniques results in:

Improvements in run-time performance
Increased query compilation time
Increased system resource usage.

For this reason, you may wish to limit the number of techniques applied to optimizing your query by setting the optimization class. This can be particularly useful if you have:

Very small databases or very simple dynamic queries
Limited memory available at compile time on your database server
A desire to reduce the query compilation (for example, PREPARE) time.

You may select from any of the query optimization classes described below, although class 0 and class 9 should be used only in special circumstances. Class 5 is the default. Classes 0, 1, and 2 use the Greedy join enumeration algorithm; for complex queries this algorithm considers far fewer alternative plans, and incurs significantly less compilation time, than classes 3 and above. Classes 3 and above use the Dynamic Programming join enumeration algorithm; this algorithm considers far more alternative plans, and can incur significantly more compilation time, than classes 0, 1, and 2 as the number of tables increases.

0 -

This class directs the optimizer to use a minimal amount of optimization to generate an access plan. For example:

Any non-uniform distribution statistics are not considered by the optimizer.
Only basic query rewrite rules are applied (see "Query Rewrite by the SQL Compiler" for information about query rewrite).
Greedy join enumeration occurs (see "Search Strategies for Selecting Optimal Join").
Only nested loop join and index scan access methods are enabled (see "Join Concepts" and "Index Scan Concepts").
List prefetch and index ANDing are disabled as access methods.
The star join strategy is not considered.

This class should only be used in special circumstances requiring the lowest possible query compilation overhead. An application consisting entirely of very simple dynamic SQL statements which access well-indexed tables is a good example of where query optimization class 0 is appropriate.

1 -

This class directs the optimizer to use a degree of optimization which is roughly comparable to DB2/6000 Version 1, plus some additional low cost features not found in Version 1. In particular:

Any non-uniform distribution statistics are not considered by the optimizer.
Only a subset of the query rewrite rules are applied, including those provided in DB2/6000 Version 1.
Greedy join enumeration (see "Search Strategies for Selecting Optimal Join".)
List prefetch and index ANDing are disabled as access methods.

Optimization class 1 is quite similar to class 0 except that Merge Scan joins and table scans are also available.

2 -

This class directs the optimizer to use a degree of optimization which significantly improves upon that of class 1, while keeping the compilation cost significantly lower than classes 3 and above for complex queries. In particular:

All available statistics, including both frequency and quantile non-uniform distribution statistics, are utilized.
All of the query rewrite rules are applied, except computationally intensive rules which are applicable only in very rare cases.
Greedy join enumeration (see "Search Strategies for Selecting Optimal Join") is used.
A wide range of access methods are considered, including list prefetch.
The star join strategy is considered, if applicable.

Optimization class 2 is quite similar to class 5 except that it uses Greedy join enumeration rather than Dynamic Programming. This class has the most optimization of all the optimization classes that use the Greedy join enumeration algorithm, which considers fewer alternatives for complex queries, and therefore consumes less compilation time than classes 3 and above. It is therefore recommended for very complex queries in a decision support or on-line analytic processing (OLAP) environment. In such cases, there is a good chance the same query is executed infrequently, so that its access plan is unlikely to remain in the cache until the next occurence of the query.

3 -

This class requests that a moderate amount of optimization be performed to generate an access plan. This class comes closest to matching the query optimization characteristics of DB2 for MVS/ESA or OS/390. This optimization class has the following characteristics:

Non-uniform distribution statistics, which track frequently occurring values are used, if available.
Most query rewrite rules, including subquery-to-join transformations are applied.
Dynamic programming join enumeration (see "Search Strategies for Selecting Optimal Join"):
- Limited use of composite inner tables (see "Composite Tables")
- Limited use of Cartesian products for star schemas involving "look-up" tables (see "Search Strategies for Star Join")
A wide range of access methods are considered, including list prefetch and index ANDing.

This class is suitable for a broad range of applications. Using this class gives the optimizer a better chance of selecting an excellent access plan for queries with four or more joins. However, the optimizer might fail to consider a better plan which would be chosen with the default query optimization class.

5 -

This class directs the optimizer to use a significant amount of optimization to generate an access plan. For example, class 5 has the following characteristics:

All available statistics including both frequency and quantile non-uniform distribution statistics.
All of the query rewrite rules are applied including the routing of queries to summary tables, except for those computationally intensive rules which are applicable only in very rare cases.
Dynamic programming join enumeration (see "Search Strategies for Selecting Optimal Join"):
- Limited use of composite inner tables (see "Composite Tables")
- Limited use of Cartesian products for star schemas involving "look-up" tables (see "Search Strategies for Star Join")
A wide range of access methods are considered, including list prefetch, index ANDing, and summary table routing.

When the optimizer detects that the additional resources and processing time are not warranted for complex dynamic SQL queries, optimization is reduced. The extent or size of the reduction is dependent on the machine size and the number of predicates.

When the query optimizer reduces the amount of query optimization performed, it continues to apply all the query rewrite rules that would normally be applied. However, it does use the greedy join enumeration method and reduces the number of access plan combinations that are considered.

Query optimization class 5 is an excellent choice for a mixed environment consisting of both transactions and complex queries. This optimization class has been designed to apply the most valuable query transformations and other query optimization techniques in an efficient manner.

7 -

This class directs the optimizer to use a significant amount of optimization to generate an access plan. It is the same as query optimization class 5 except that it does not reduce the amount of query optimization for complex dynamic SQL queries.

9 -

This class directs the optimizer to use all available optimization techniques. These include:

All available statistics
All query rewrite rules
All possibilities for join enumerations, including Cartesian products and unlimited composite inners
All access methods.

This class can greatly expand the number of possible access plans that are considered by the optimizer. This class should be used to determine whether more comprehensive optimization can generate a better access plan for very complex and very long-running queries using large tables. Explain and performance measurements should be used to verify that a better plan has been found.

How Do You Set the Optimization Class?

The way to request a specific query optimization class depends on whether you are using static or dynamic SQL.

Static SQL statements use the optimization class specified on the PREP and BIND commands. The QUERYOPT column in the SYSCAT.PACKAGES catalog table records the optimization class used to bind the package. If the package is rebound either implicitly or using the REBIND PACKAGE command, this same optimization class will be used for the static SQL statements. If you want to change the optimization class used for these static SQL statements, you must use the BIND command. If you do not specify the optimization class, DB2 uses the default optimization as specified by dft_queryopt.
Dynamic SQL statements use the optimization class specified by the CURRENT QUERY OPTIMIZATION special register which is set using the SQL SET statement. For example, the following statement sets the optimization class to 1:
```
   SET CURRENT QUERY OPTIMIZATION = 1
```
To ensure that a dynamic SQL statement always uses the same optimization class, you may want to include this SET statement in your application program. For more information, refer to the SQL Reference.
If the CURRENT QUERY OPTIMIZATION register has not been set, dynamic statements will be bound using the default query optimization class. The default value for both dynamic and static SQL is determined by value of the configurable database parameter DFT_QUERYOPT. Class 5 is the default query optimization class unless you have changed the default. (For more information on this parameter, see "Default Query Optimization Class (dft_queryopt)".) The default values for the bind option and the special register are taken from the DFT_QUERYOPT configuration parameter.

How Much Optimization is Necessary?

Most statements will be adequately optimized using a reasonable amount of resources with the default query optimization class. The query compilation time and resource consumption, at a given optimization class, is primarily influenced by the complexity of the query, particularly the number of joins and subqueries. However, compilation time and resource usage are also affected by the amount of optimization performed for the various optimization classes. For any optimization class, you can expect to see a greater difference in query compilation time and resource consumption for a very complex query than for a simple one.

The following may help you select which optimization class to use:

Start by using the default query optimization class.
If you wish to use a class other than the default, try class 1, 2 or 3 first.
Use a low optimization class (0 or 1) for queries having very short run-times, that is, queries taking less than one second. (See the following discussion for additional criteria about when to choose a low optimization class.)
Use optimization class 1 or 2 if you have many tables with many of the join predicates that are on the same column, and if compilation time is a concern.
Use a higher optimization class (3, 5, or 7) for long running queries, that is, queries taking more than 30 seconds.
Under normal circumstances, you should not use optimization class 9.
For queries that run a long time, run the query using db2batch to determine how much of the time is spent in compilation and how much is spent in execution.
- If most of the time is spent in compilation then reduce the optimization class.
- If most of the time is spent in execution then consider a higher optimization class.

Note that query optimization classes 1, 2, 3, 5, and 7 are all suitable for general purpose use.

Only if you require further reductions in query compilation time and you know the kind of SQL (for example, extremely simple statements) that will be executed should you consider class 0. This SQL will tend to have the following characteristics:

Access to a single or only a few tables
Fetches a single or only a few rows
Uses fully qualified, unique indexes.

Online transaction processing (OLTP) transactions are good examples of this kind of SQL.

Complex queries may require different amounts of optimization to select the best access plan. You may wish to consider using higher optimization classes for queries exhibiting the following characteristics:

Access to large tables
A large number of predicates
Many subqueries
Many joins
Many set operators, such as UNION and INTERSECT
Many qualifying rows
GROUP BY and HAVING operations
Nested table expressions
A large number of views.

Decision support queries or month-end reporting queries against fully normalized databases are good examples of complex queries where at least the default query optimization class should be used.

Another reason to use higher query optimization classes is SQL which was produced by a query generator. Many query generators create SQL which is not efficient. Poorly written queries, including those produced by a query generator, may require additional optimization to make it possible to select a good access plan. Using query optimization class 2 and higher can improve poorly written SQL queries.

The use of static or dynamic SQL, and whether the same dynamic SQL is repeatedly executed are also important considerations. For static SQL, the query compilation time and resources are expended just once and the resulting plan can be used many times. In general, static SQL should always use the default query optimization class. Dynamic statements are bound and executed at run time; therefore, you should consider whether the overhead of additional optimization for dynamic statements improves your overall performance. However, if the same dynamic SQL statement is executed repeatedly, the selected access plan will be cached. For the purposes of selecting a query optimization class, the statement can be treated like a static SQL statement.

(Refer to the Embedded SQL Programming Guide for information on when to use static and dynamic SQL.)

If you think you have a query that could benefit from additional optimization, but you are not sure, or you are concerned about compilation time and resource usage, you may want to perform some benchmark testing. This testing can help you quantify the benefits obtained from different optimization classes. See Chapter 19. "Benchmark Testing" for general techniques and the specific use of the db2batch tool. When designing and running your benchmark test, consider whether the SQL statements in your application are static or dynamic:

For dynamic SQL statements, your testing should compare the average run time for the statement. You can use the following formula to help you calculate the average run time:

   compile time + sum of execution times for all iterations
   --------------------------------------------------------
                  number of iterations

where, the number of iterations represents the number of times that you expect that the SQL statement will be executed each time it is compiled.
Note: Following the initial compilation, dynamic SQL statements are recompiled when a change to the environment requires the statement to be recompiled. Once cached, a SQL statement does not need to be compiled again since subsequent PREPARE statements will re-use the cached statement assuming the environment does not change. (See "Catalog Cache Size (catalogcache_sz)" and "Package Cache Size (pckcachesz)" for information about a cache that can improve performance when working with dynamic SQL statements.)

For static SQL statements, your testing should compare the statement run times.

Note:

While you may also be interested in the compile time of static SQL, the total (compile and run) time for the statement is difficult to use in any meaningful context. Comparing the total time does not recognize the fact that a static SQL statement can be run many times for each time it is bound and that it is generally not bound during run time.

[ Top of Page | Previous Page | Next Page | Table of Contents | Index ]

[ DB2 List of Books | Search the DB2 Books ]