2
Optimizer Operations

This chapter expands on the ideas introduced in Chapter 1, "Introduction to the Optimizer" and explains optimizer actions in greater detail for specific cases. This chapter describes how the cost-based optimizer evaluates expressions and performs specific operations. It also explains how the CBO transforms some SQL statements into others to achieve the same goal more efficiently.

The chapter contains the following sections:

How the Optimizer Performs Operations

The optimizer fully evaluates expressions whenever possible and translates certain syntactic constructs into equivalent constructs. Constructs are translated in the following cases:

Oracle can more quickly evaluate the resulting expression than the original expression.
The original expression is a syntactic equivalent of the resulting expression. Sometimes, different SQL constructs can operate identically. For example, Oracle maps = ANY (subquery) and IN (subquery) to a single construct.

This section contains the following discussions:

How the CBO Evaluates IN-List Iterators

The IN-list iterator is used when a query contains an IN clause with values. The execution plan is identical to what would result for a statement with an equality clause instead of IN except for one additional step. That extra step occurs when the IN-list iterator feeds the equality clause with unique values from the IN-list.

Both of the statements in Example 2-1 and Example 2-1 are equivalent and produce the same plan.

Example 2-1 IN-List Iterators Initial Statement

SELECT header_id, line_id, revenue_amount
  FROM so_lines_all
 WHERE header_id IN (1011,1012,1013);

SELECT header_id, line_id, revenue_amount
  FROM so_lines_all
 WHERE header_id = 1011
    OR header_id = 1012
    OR header_id = 1013;

Plan
-------------------------------------------------
SELECT STATEMENT
 INLIST ITERATOR
  TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
   INDEX RANGE SCAN SO_LINES_N1

An equivalent statement in Example 2-1 binds :b1 to the different unique values by the IN-list iterator.

Example 2-2 Alternate Statement with Equality

SELECT header_id, line_id, revenue_amount
  FROM so_lines_all l
 WHERE header_id = :b1;

Plan
-------------------------------------------------
SELECT STATEMENT
 TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
  INDEX RANGE SCAN SO_LINES_N1

Example 2-3 uses a unique index. Because there is a sort involved on the IN-list, even with complete keys of unique indexes, there is still a range scan.

Example 2-3 IN-List Iterator with Unique Index

SELECT header_id, line_id, revenue_amount
  FROM so_lines_all
 WHERE line_id IN (1011,1012,1013);

Plan
-------------------------------------------------
SELECT STATEMENT
 INLIST ITERATOR
  TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
   INDEX RANGE SCAN SO_LINES_U1

In Example 2-4, the IN-list operator can be used when driving into a table with a nested loop operation.

Example 2-4 IN-List Iterator with a Nested Loop

SELECT h.header_id, l.line_id, l.revenue_amount
  FROM so_headers_all h, so_lines_all l
 WHERE l.inventory_item_id = :b1
   AND h.order_number = l.header_id
   AND h.order_type_id IN (1,2,3);

Plan
-------------------------------------------------
SELECT STATEMENT
 NESTED LOOPS
  TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
   INDEX RANGE SCAN SO_LINES_N5
  INLIST ITERATOR
   TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
    INDEX RANGE SCAN SO_HEADERS_U2

The IN-list operator is especially useful if there is an expensive first step that you do not want to repeat for every IN-list element. In Example 2-5, even though there are three IN-list elements, the full scan on so_lines_all happens only once.

Example 2-5 Using the IN-List Iterator to Avoid an Expensive First Step

SELECT h.header_id, l.line_id, l.revenue_amount
  FROM so_headers_all h, so_lines_all l
 WHERE l.s7 = :b1
   AND h.order_number = l.header_id
   AND h.order_type_id IN (1,2,3);

Plan
-------------------------------------------------
SELECT STATEMENT
 NESTED LOOPS
  TABLE ACCESS FULL SO_LINES_ALL
  INLIST ITERATOR
   TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
    INDEX RANGE SCAN SO_HEADERS_U2

When the Optimizer Uses IN-List Iterators

The optimizer uses an IN-list iterator when an IN clause is specified with values, and the optimizer finds a selective index for that column. If there are multiple OR clauses using the same index, then the optimizer chooses this operation rather than CONCATENATION or UNION ALL, because it is more efficient.

IN-List Iterator Hints

There are no hints for this operation. You can provide a hint to use the relevant index, which can cause this operation. Example 2-6 shows a query without an INDEX hint, and the resulting execution plan.

Example 2-6 Calling the IN-List Iterator without the INDEX Hint

SELECT h.customer_id, l.line_id, l.revenue_amount
  FROM so_lines_all l, so_headers_all h
 WHERE l.s7 = 20
   AND h.original_system_reference = l.attribute5
   AND h.original_system_source_code IN (1013,1014);

Plan
--------------------------------------------------
SELECT STATEMENT
 NESTED LOOPS
  TABLE ACCESS FULL SO_LINES_ALL
  TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
   INDEX RANGE SCAN SO_HEADERS_N5

Example 2-7 shows a query with an INDEX hint, and the resulting execution plan.

Example 2-7 Calling the IN-List Iterator with the INDEX Hint

SELECT /*+INDEX(h so_headers_n9 */ h.customer_id, l.line_id, l.revenue_amount
  FROM so_lines_all l, so_headers_all h
 WHERE l.s7 = 20
   AND h.original_system_reference = l.attribute5
   AND h.original_system_source_code IN (1013,1014);

Plan
--------------------------------------------------
SELECT STATEMENT
 NESTED LOOPS
  TABLE ACCESS FULL SO_LINES_ALL
  INLIST ITERATOR
   TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
    INDEX RANGE SCAN SO_HEADERS_N9

How the CBO Evaluates Concatenation

Concatenation is useful for statements with different conditions combined with an OR clause. With concatenation, you get a good execution plan with appropriate indexes. Examples 2-8 and 2-9 show two plans, each accessing the table through the appropriate index and combined using concatenation.

The plan in Example 2-8 does not return duplicate rows, so for each component it appends a negation of the previous components.

Example 2-8 How the CBO Evaluates Concatenation

SELECT l.header_id, l.line_id, l.revenue_amount
  FROM so_lines_all l
 WHERE l.parent_line_id = :b1
    OR l.service_parent_line_id = :b1;

Plan
-------------------------------------------------
SELECT STATEMENT
 CONCATENATION
  TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
   INDEX RANGE SCAN SO_LINES_N20
  TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
   INDEX RANGE SCAN SO_LINES_N17

In Example 2-8, the components are the following:

l.parent_line_id = :b1
l.service_parent_line_id = :b1 and l.parent_line_id != :b1

Example 2-9 shows how the optimizer uses concatenation to optimize a nested OR statement.

Example 2-9 Concatenation of a Query with a Nested OR Statement

SELECT p.header_id, l.line_id, l.revenue_amount
  FROM so_lines_all p , so_lines_all l
 WHERE p.header_id = :b1
   AND (l.parent_line_id = p.line_id
       OR l.service_parent_line_id = p.line_id);

Plan
-------------------------------------------------
SELECT STATEMENT
 CONCATENATION
  NESTED LOOPS
   TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
    INDEX RANGE SCAN SO_LINES_N1
   TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
    INDEX RANGE SCAN SO_LINES_N20
  NESTED LOOPS
   TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
    INDEX RANGE SCAN SO_LINES_N1
   TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
    INDEX RANGE SCAN SO_LINES_N17

Example 2-10 shows the resulting execution plan for the same query shown in Example 2-9, but with concatenation specifically disabled by the NO_EXPAND hint.

Example 2-10 Query with Nested OR with Concatenation Disabled

SELECT /*+NO_EXPAND */ p.header_id, l.line_id, l.revenue_amount
  FROM so_lines_all p, so_lines_all l
 WHERE p.header_id = :b1
   AND (l.parent_line_id = p.line_id
       OR l.service_parent_line_id = p.line_id);

Plan
-------------------------------------------------
SELECT STATEMENT
 NESTED LOOPS
  TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
   INDEX RANGE SCAN SO_LINES_N1
  TABLE ACCESS FULL SO_LINES_ALL

Trying to execute the statement in a single query produces a poor execution plan. Because the optimizer has two paths to follow and has been instructed not to decompose the query, it needs to access all the rows in the second table to see if any rows match one of the conditions.

Concatenation Hints

Use the hint USE_CONCAT for this operation.

When Not to Use Concatenation

Concatenation is expensive in the following cases and should not be used:

When OR conditions are on same column and can use the IN-list operator, which is more efficient than concatenation
If an expensive step gets repeated for every concatenation

Example 2-11 illustrates this point.

Example 2-11 When Not to Use Concatenation

Consider the following statement:

SELECT h.customer_id, l.line_id, l.revenue_amount
  FROM so_lines_all l, so_headers_all h
 WHERE l.s7 = 20
   AND h.original_system_reference = l.attribute5
   AND h.original_system_source_code IN (1013,1014);

Plan
--------------------------------------------------
SELECT STATEMENT
 NESTED LOOPS
  TABLE ACCESS FULL SO_LINES_ALL
  TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
   INDEX RANGE SCAN SO_HEADERS_N5

The initial statement calls for a full scan on so_lines_all as the first stage. The optimizer chooses to use a single column index for the second table, but we want it to use a two-column index.

You could use a hint to force concatenation as in shown in Example 2-12, but the initial full scan is still repeated which is not desirable.

Example 2-12 Using the Concatenation Hint

SELECT /*+USE_CONCAT*/ h.customer_id, l.line_id, l.revenue_amount
  FROM so_lines_all l, so_headers_all h
 WHERE l.s7 = 20
   AND h.original_system_reference = l.attribute5
   AND h.original_system_source_code IN (1013,1014);

Plan
--------------------------------------------------
SELECT STATEMENT
 CONCATENATION
  NESTED LOOPS
   TABLE ACCESS FULL SO_LINES_ALL
   TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
    INDEX RANGE SCAN SO_HEADERS_N9
  NESTED LOOPS
   TABLE ACCESS FULL SO_LINES_ALL
   TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
    INDEX RANGE SCAN SO_HEADERS_N9

If, instead, you provide a hint to use the two-column index, then the optimizer switches to that with an IN-list iterator. The initial scan is not repeated, and a better execution plan results, as follows:

Example 2-13 Using the Index Hint

SELECT /*+INDEX(h so_headers_n9 */ h.customer_id, l.line_id, l.revenue_amount
  FROM so_lines_all l, so_headers_all h
 WHERE l.s7 = 20
   AND h.original_system_reference = l.attribute5
   AND h.original_system_source_code IN (1013,1014);

Plan
--------------------------------------------------
SELECT STATEMENT
 NESTED LOOPS
  TABLE ACCESS FULL SO_LINES_ALL
  INLIST ITERATOR
   TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
    INDEX RANGE SCAN SO_HEADERS_N9

How the CBO Evaluates Remote Operations

The remote operation indicates that there is a table from another database being accessed through a database link. Example 2-14 has a remote driving table:

Example 2-14 How the CBO Evaluates a Query with a Remote Driving Table

SELECT c.customer_name, count(*)
  FROM ra_customers c, so_headers_all@oe h
 WHERE c.customer_id = h.customer_id
   AND h.order_number = :b1
GROUP BY c.customer_name;

Plan
--------------------------------------------------
SELECT STATEMENT
 SORT GROUP BY
  NESTED LOOPS
   REMOTE
   TABLE ACCESS BY INDEX ROWID RA_CUSTOMERS
    INDEX UNIQUE SCAN RA_CUSTOMERS_U1

Remote Database Query Obtained from the Library Cache

SELECT "ORDER_NUMBER","CUSTOMER_ID"
  FROM "SO_HEADERS_ALL" "H"
 WHERE "ORDER_NUMBER"=:"SYS_B_0";

Example 2-15 has a local driving table.

Example 2-15 How the CBO Evaluates a Query with a Local Driving Table

SELECT c.customer_name, h.order_number
  FROM ra_customers c, so_headers_all@oe h
 WHERE c.customer_id = h.customer_id
   AND c.customer_name LIKE :b1;

Plan
--------------------------------------------------
SELECT STATEMENT
 MERGE JOIN
  REMOTE
  SORT JOIN
   TABLE ACCESS BY INDEX ROWID RA_CUSTOMERS
    INDEX RANGE SCAN RA_CUSTOMERS_N1

Remote Database Query Obtained from the Library Cache

SELECT "ORDER_NUMBER","CUSTOMER_ID"
  FROM "SO_HEADERS_ALL" "H"
 WHERE "CUSTOMER_ID" IS NOT NULL  
ORDER BY "CUSTOMER_ID";

A couple of factors influence the execution plan:

Network round trips can be several orders of magnitude more expensive than the physical and logical I/Os.
The optimizer does not have any statistics on the remote database, which might not be an Oracle database.

In general, the optimizer chooses to access the remote tables first, before accessing the local tables. This works well for cases like Example 2-14, where the driving table is the remote table. However, if the driving table is the local table, then there might not be any selective way of accessing the remote table without first accessing the local tables. In such cases, you might need to provide appropriate hints to avoid performance problems.

Example 2-16 How the CBO Evaluates a Query with a Local Driving Table and a Nested Loops Hint

SELECT /*+USE_NL(c h) */ c.customer_name, h.order_number
  FROM ra_customers c, so_headers_all@oe h
 WHERE c.customer_id = h.customer_id
   AND c.customer_name LIKE :b1;

Plan
--------------------------------------------------
SELECT STATEMENT
 NESTED LOOPS
  TABLE ACCESS BY INDEX ROWID RA_CUSTOMERS
   INDEX RANGE SCAN RA_CUSTOMERS_N1
  REMOTE

Remote Database Query Obtained from the Library Cache

SELECT /*+ USE_NL("H") */ "ORDER_NUMBER","CUSTOMER_ID"
  FROM "SO_HEADERS_ALL" "H" WHERE :1="CUSTOMER_ID"
FILTER;

The construct used by the optimizer is applying a filter condition to filter out rows; this filter could not be applied when the table was accessed. Example 2-17 uses no filter.

Example 2-17 Querying a Remote Table Using No Filter

SELECT h.order_number
  FROM so_headers_all h
 WHERE h.open_flag = 'Y'
   AND attribute1 IS NOT NULL;

Plan
--------------------------------------------------
SELECT STATEMENT
 TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
  INDEX RANGE SCAN SO_HEADERS_N2

Besides the conditions used in the access path, a table might have additional conditions to filter rows when the table is visited. Conditions that get applied when the table is accessed, like attribute1 IS NOT NULL, do not show up as FILTER. Example 2-18 and Example 2-19 illustrate such conditions.

Example 2-18 shows a query with a GROUP BY condition that creates a filter.

Example 2-18 Querying with a Filter Created by a GROUP BY Condition

SELECT h.order_number, count(*)
  FROM so_headers_all h
 WHERE h.open_flag = 'Y'
   AND attribute1 IS NOT NULL
GROUP BY h.order_number
HAVING COUNT(*) = 1 ß Filter condition;

Plan
--------------------------------------------------
SELECT STATEMENT
 FILTER
  SORT GROUP BY
   TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
    INDEX RANGE SCAN SO_HEADERS_N2

Example 2-19 shows a query with a subquery that creates a filter.

Example 2-19 Querying with a Filter Created by a Subquery

SELECT h.order_number
  FROM so_headers_all h
 WHERE h.open_flag = 'Y'
   AND EXISTS (SELECT null FROM so_lines_all l
               WHERE l.header_id = h.header_id
               AND l.revenue_amount > 10000);

Plan
--------------------------------------------------
SELECT STATEMENT
 FILTER
  TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
   INDEX RANGE SCAN SO_HEADERS_N2
  TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
   INDEX RANGE SCAN SO_LINES_N1

In this example, for every row meeting the condition of the outer query, the correlated EXISTS subquery is executed. If a row meeting the condition is found in the so_lines_all table, then the row from so_headers_all is returned.

How the CBO Executes Distributed Statements

The optimizer chooses execution plans for SQL statements that access data on remote databases in much the same way that it chooses execution plans for statements that access only local data:

If all the tables accessed by a SQL statement are located together on the same remote database, then Oracle sends the SQL statement to that remote database. The remote Oracle instance executes the statement and sends only the results back to the local database.
If a SQL statement accesses tables that are located on different databases, then Oracle decomposes the statement into individual fragments, each of which accesses tables on a single database. Oracle then sends each fragment to the database that it accesses. The remote Oracle instance for each of these databases executes its fragment and returns the results to the local database, where the local Oracle instance can perform any additional processing that the statement requires.

When choosing a cost-based execution plan for a distributed statement, the optimizer considers the available indexes on remote databases, just as it considers indexes on the local database. The optimizer also considers statistics on remote databases for the CBO. Furthermore, the optimizer considers the location of data when estimating the cost of accessing it. For example, a full scan of a remote table has a greater estimated cost than a full scan of an identical local table.

For a rule-based execution plan, the optimizer does not consider indexes on remote tables.

See Also:

Chapter 6, "Optimizing SQL Statements" for more information on tuning distributed queries

How the CBO Executes Sort Operations

Sort operations result when users specify some operation that requires a sort. Commonly encountered operations include the following:

SORT UNIQUE

SORT UNIQUE occurs if a user specifies a DISTINCT clause (Example 2-20) or if an operation requires unique values for the next step (Example 2-21).

Example 2-20 DISTINCT Clause Causing SORT UNIQUE

SELECT DISTINCT last_name, first_name
 FROM per_all_people_f
 WHERE full_name LIKE :b1;

Plan
--------------------------------------------------
SELECT STATEMENT
 SORT UNIQUE
  TABLE ACCESS BY INDEX ROWID PER_ALL_PEOPLE_F
   INDEX RANGE SCAN PER_PEOPLE_F_N54

Example 2-21 IN Subquery Causing SORT UNIQUE

SORT UNIQUE provides the outer query with a unique list of header_ids. The plan shows that the IN subquery has been un-nested and transformed into VW_NSO_1.

SELECT c.customer_name, h.order_number
  FROM ra_customers c, so_headers_all h
 WHERE c.customer_id = h.customer_id
   AND h.header_id in
     (SELECT l.header_id FROM so_lines_all l
      WHERE l.inventory_item_id = :b1
      AND ordered_quantity > 10);

Plan
--------------------------------------------------
SELECT STATEMENT
 NESTED LOOPS
  NESTED LOOPS
   VIEW  VW_NSO_1
    SORT UNIQUE
     TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
      INDEX RANGE SCAN SO_LINES_N5
   TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
    INDEX UNIQUE SCAN SO_HEADERS_U1
  TABLE ACCESS BY INDEX ROWID RA_CUSTOMERS
   INDEX UNIQUE SCAN RA_CUSTOMERS_U1

If the optimizer can guarantee (with unique keys) that duplicate values will not be passed, then a sort can be avoided, as in Example 2-22.

Example 2-22 IN Subquery That Does Not Cause SORT UNIQUE

UPDATE so_lines_all l
   SET line_status = 'HOLD'
 WHERE l.header_id IN
     ( SELECT h.header_id FROM so_headers_all h
        WHERE h.customer_id = :b1);

Plan
--------------------------------------------------
UPDATE STATEMENT
 UPDATE  SO_LINES_ALL
  NESTED LOOPS
   TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
    INDEX RANGE SCAN SO_HEADERS_N1
   INDEX RANGE SCAN SO_LINES_N1

SORT AGGREGATE

SORT AGGREGATE does not actually involve a sort. It is used when aggregates are being computed across the whole set of rows, as shown in Example 2-23.

Example 2-23 Query Causing SORT AGGREGATE

SELECT SUM(l.revenue_amount)
  FROM so_lines_all l
 WHERE l.header_id = :b1;

Plan
--------------------------------------------------
SELECT STATEMENT
 SORT AGGREGATE
  TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
   INDEX RANGE SCAN SO_LINES_N1

SORT GROUP BY

SORT GROUP BY is used when aggregates are being computed for different groups in the data. The sort is required to separate the rows into different groups, as shown in Example 2-24.

Example 2-24 Query That Causes SORT GROUP BY

SELECT created_by, SUM(l.revenue_amount)
  FROM so_lines_all l
 WHERE header_id > :b1
GROUP BY created_by;

Plan
--------------------------------------------------
SELECT STATEMENT
 SORT GROUP BY
  TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
   INDEX RANGE SCAN SO_LINES_N1

SORT JOIN

SORT JOIN happens during a SORT MERGE JOIN, if the rows need to be sorted by the join key, as shown in Example 2-25.

Example 2-25 Query Causing SORT JOIN

SELECT SUM(l.revenue_amount), l2.creation_date
  FROM so_lines_all l, so_lines_All l2
 WHERE l.creation_date < l2.creation_date
   AND l.header_id <> l2.header_id
GROUP BY l2.creation_date, l2.line_id;

Plan
--------------------------------------------------
SELECT STATEMENT
 SORT GROUP BY
  MERGE JOIN
   SORT JOIN
    TABLE ACCESS FULL SO_LINES_ALL
   FILTER
    SORT JOIN
     TABLE ACCESS FULL SO_LINES_ALL

SORT ORDER BY

SORT ORDER BY is required when the statement specifies an ORDER BY that cannot be satisfied by one of the indexes, as shown in Example 2-26.

Example 2-26 Query Causing SORT ORDER BY

SELECT h.order_number
  FROM so_headers_all h
 WHERE h.customer_id = :b1
 ORDER BY h.creation_date DESC;

Plan
--------------------------------------------------
SELECT STATEMENT
 SORT ORDER BY
  TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
   INDEX RANGE SCAN SO_HEADERS_N1

How the CBO Executes Views

Either of the following can result in creation of a view by the CBO:

A complex view has not been decomposed (Example 2-27).
A temporary or inline view is being used (Examples 2-28 and 2-29).

Example 2-27 Query Causing a View

SELECT order_id
FROM orders
 WHERE customer_id = :b1
   AND revenue > :b2;

Plan
--------------------------------------------------
SELECT STATEMENT
 VIEW  ORDERS
  FILTER
   SORT GROUP BY
    NESTED LOOPS
     TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
      INDEX RANGE SCAN SO_HEADERS_N1
     TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
      INDEX RANGE SCAN SO_LINES_N1

In Example 2-28, a view is created because the IN subquery requires a SORT UNIQUE on the values being selected. This view would be unnecessary if the columns being selected were unique, not requiring a sort.

Example 2-28 IN Subquery Causing a View

SELECT c.customer_name, h.order_number
  FROM ra_customers c, so_headers_all h
 WHERE c.customer_id = h.customer_id
   AND h.header_id IN
     (SELECT l.header_id FROM so_lines_all l
      WHERE l.inventory_item_id = :b1
      AND ordered_quantity > 10);
Plan
--------------------------------------------------
SELECT STATEMENT
 NESTED LOOPS
  NESTED LOOPS
   VIEW  VW_NSO_1
    SORT UNIQUE
     TABLE ACCESS BY INDEX ROWID SO_LINES_ALL
      INDEX RANGE SCAN SO_LINES_N5
   TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
    INDEX UNIQUE SCAN SO_HEADERS_U1
  TABLE ACCESS BY INDEX ROWID RA_CUSTOMERS
   INDEX UNIQUE SCAN RA_CUSTOMERS_U1

The query in Example 2-29 examines the distribution of orders and revenue by the number of lines in each order. The CBO uses temporary inline views to do the double grouping.

Example 2-29 Inline Query Causing a View

SELECT COUNT(*) "Orders", cnt "Lines", sum(rev) "Revenue"
FROM (SELECT header_id, COUNT(*) cnt, SUM(revenue_amount) rev
        FROM so_lines_all
       GROUP BY header_id)
GROUP BY cnt;

Plan
--------------------------------------------------
SELECT STATEMENT
 SORT GROUP BY
  VIEW
   SORT GROUP BY
    TABLE ACCESS FULL SO_LINES_ALL

How the CBO Evaluates Constants

Computation of constants is performed only once, when the statement is optimized, rather than each time the statement is executed.

For example, the following conditions all test for monthly salaries greater than 2000:

salary > 24000/12
salary > 2000
salary*12 > 24000

If a SQL statement contains the first condition, then the optimizer simplifies it into the second condition.

Note:

The optimizer does not simplify expressions across comparison operators: In the preceding examples, the optimizer does not simplify the third expression into the second. For this reason, application developers write conditions that compare columns with constants whenever possible, rather than conditions with expressions involving columns.

How the CBO Evaluates the UNION and UNION ALL Operators

This operator is useful for combining OR clauses into one compound statement or for breaking up a complex statement into a compound statement containing simpler select statements that are easier to optimize and understand.

As with concatenation, you do not want to duplicate expensive operations by using UNION ALL.

When the Optimizer Uses UNION or UNION ALL

The optimizer uses UNION or UNION ALL when the SQL statement contains UNION or UNION ALL clauses.

Example 2-30 shows a query without a UNION clause. The query finds customers who are new or have open orders.

Example 2-30 Query Without UNION Clause

SELECT c.customer_name, c.creation_date
  FROM ra_customers c
 WHERE c.creation_date > SYSDATE - 30
    OR customer_id IN
     (SELECT customer_id FROM so_headers_all h
        WHERE h.open_flag = 'Y');

Plan
--------------------------------------------------
SELECT STATEMENT
 FILTER
  TABLE ACCESS FULL RA_CUSTOMERS
  TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
   INDEX RANGE SCAN SO_HEADERS_N1

Because the driving conditions come from different tables, you cannot execute the query effectively in a single statement.

With a UNION clause, you can break the query into two statements:

New customers
Customers with open orders

These two statements can be optimized easily. Because you do not want duplicates (some customers meet both criteria), use UNION, which eliminates duplicates by using a sort. If you use two disjoint sets, then you can use UNION ALL, eliminating the sort. The query from Example 2-30 is shown using UNION in Example 2-31.

Example 2-31 Query Using the UNION Clause

SELECT c.customer_name, c.creation_date
  FROM ra_customers c
 WHERE c.creation_date > SYSDATE - 30
UNION ALL  
SELECT c.customer_name, c.creation_date
  FROM ra_customers c
 WHERE customer_id IN
     (SELECT customer_id FROM so_headers_all h
        WHERE h.open_flag = 'Y');

Plan
--------------------------------------------------
SELECT STATEMENT
 SORT UNIQUE
  UNION-ALL
   TABLE ACCESS BY INDEX ROWID RA_CUSTOMERS
    INDEX RANGE SCAN RA_CUSTOMERS_N2
   NESTED LOOPS
    TABLE ACCESS BY INDEX ROWID SO_HEADERS_ALL
     INDEX RANGE SCAN SO_HEADERS_N2
    TABLE ACCESS BY INDEX ROWID RA_CUSTOMERS
     INDEX UNIQUE SCAN RA_CUSTOMERS_U1

UNION and UNION ALL Hints

There are no hints for this operation.

How the CBO Evaluates the LIKE Operator

The optimizer simplifies conditions that use the LIKE comparison operator to compare an expression with no wildcard characters into an equivalent condition that uses an equality operator instead.

In the following example, the optimizer simplifies the first condition into the second:

last_name LIKE 'SMITH'

is transformed into

last_name = 'SMITH'

The optimizer can simplify these expressions only when the comparison involves variable-length datatypes. For example, if last_name was of type CHAR(10), then the optimizer cannot transform the LIKE operation into an equality operation due to the equality operator following blank-padded semantics and LIKE not following blank-padded semantics.

How the CBO Evaluates the IN Operator

The optimizer expands a condition that uses the IN comparison operator to an equivalent condition that uses equality comparison operators and OR logical operators.

In the following example, the optimizer expands the first condition into the second:

last_name IN ('SMITH', 'KING', 'JONES')

is transformed into

last_name = 'SMITH' OR last_name = 'KING' OR last_name = 'JONES'

See Also:

"How the CBO Merges an IN Subquery"

How the CBO Evaluates the ANY or SOME Operator

The optimizer expands a condition that uses the ANY or SOME comparison operator followed by a parenthesized list of values into an equivalent condition that uses equality comparison operators and OR logical operators.

In the following example, the optimizer expands the first condition into the second:

salary > ANY (:first_sal, :second_sal)

is transformed into

salary > :first_sal OR salary > :second_sal

The optimizer transforms a condition that uses the ANY or SOME operator followed by a subquery into a condition containing the EXISTS operator and a correlated subquery.

In the following example, the optimizer transforms the first condition into the second:

x > ANY (SELECT salary
    FROM employees
    WHERE job_id = 'IT_PROG')

is transformed into

EXISTS (SELECT salary
    FROM employees
    WHERE job_id = 'IT_PROG'
      AND x > salary)

How the CBO Evaluates the ALL Operator

The optimizer expands a condition that uses the ALL comparison operator followed by a parenthesized list of values into an equivalent condition that uses equality comparison operators and AND logical operators.

In the following example, the optimizer expands the first condition into the second:

salary > ALL (:first_sal, :second_sal)

is transformed into

salary > :first_sal AND salary > :second_sal

The optimizer transforms a condition that uses the ALL comparison operator followed by a subquery into an equivalent condition that uses the ANY comparison operator and a complementary comparison operator. In the following example, the optimizer transforms the first condition into the second:

x > ALL (SELECT salary
    FROM employees
    WHERE department_id = 50)

is transformed into

NOT (x <= ANY (SELECT salary
      FROM employees
      WHERE department_id = 50) )

The optimizer then further transforms the second query into the following query using the rule for transforming conditions with the ANY comparison operator, followed by a correlated subquery:

NOT EXISTS (SELECT salary
      FROM employees
      WHERE department_id = 50
        AND x <= salary)

How the CBO Evaluates the BETWEEN Operator

The optimizer always replaces a condition that uses the BETWEEN comparison operator with an equivalent condition that uses the >= and <= comparison operators. In the following example, the optimizer replaces the first condition with the second:

salary BETWEEN 2000 AND 3000

is transformed into

salary >= 2000 AND salary <= 3000

How the CBO Evaluates the NOT Operator

The optimizer simplifies a condition to eliminate the NOT logical operator. The simplification involves removing the NOT logical operator and replacing a comparison operator with its opposite comparison operator. In Example 2-32, the optimizer simplifies the first condition into the second one:

Example 2-32 How the CBO Evaluates the NOT Operator

NOT department_id = (SELECT department_id FROM employees WHERE last_name = 'Taylor')

is transformed into

department_id <> (SELECT department_id FROM employees WHERE last_name = 'Taylor')

Often, a condition containing the NOT logical operator can be written in many different ways. The optimizer attempts to transform such a condition so that the subconditions negated by NOTs are as simple as possible, even if the resulting condition contains more NOTs. In Example 2-33, the optimizer simplifies the first condition into the second, and then into the third.

Example 2-33 How the CBO Simplifies a NOT Statement

NOT (salary < 1000 OR commission_pct IS NULL)

is transformed into

NOT salary < 1000 AND commission_pct IS NOT NULL

which is further transformed into

salary >= 1000 AND commission_pct IS NOT NULL

How the CBO Evaluates Transitivity

If two conditions in the WHERE clause involve a common column, then the optimizer sometimes can infer a third condition, using the transitivity principle. The optimizer can then use the inferred condition to optimize the statement. The inferred condition can make available an index access path that was not made available by the original conditions.

Note:

Transitivity is used only by the CBO.

Consider a WHERE clause containing two conditions of these forms:

WHERE column1 comp_oper constant
  AND column1 = column2

In this case, the optimizer infers the condition:

column2 comp_oper constant

where:

comp_oper is any of the comparison operators =, !=, ^=, <, <>, >, <=, or >=
constant is any constant expression involving operators, SQL functions, literals, bind variables, and correlation variables

In Example 2-34, the WHERE clause contains two conditions, each of which uses the employees.department_id column.

Example 2-34 Query Causing Transitivity Evaluation

SELECT *
  FROM employees, departments
  WHERE employees.department_id = 20
    AND employees.department_id = departments.department_id;

Using transitivity, the optimizer infers the following condition:

departments.department_id = 20

If an index exists on the departments.department_id column, then this condition makes available access paths using that index.

The optimizer only infers conditions that relate columns to constant expressions, rather than columns to other columns. Consider a WHERE clause containing two conditions of these forms:

WHERE column1 comp_oper column3
  AND column1 = column2

In this case, the optimizer does not infer the condition, column2 comp_oper column3.

How the CBO Optimizes Common Subexpressions

Common subexpression optimization is an optimization heuristic that identifies, removes, and collects common subexpressions from disjunctive (OR) branches of a query. In most cases, it reduces the number of joins to be performed. This optimization is enabled with the OPTIMIZER_FEATURES_ENABLE initialization parameter.

A query is considered valid for common subexpression optimization if its WHERE clause is in the following form:

The top-level is a disjunction (a list of ORed logs).
Each disjunct is either a simple predicate or a conjunction (a list of ANDed logs).
Each conjunct is either a simple predicate or a disjunction of simple predicates. (A predicate is considered simple if it does not contain AND or OR.)
The expression appears in all the disjunctive branches of the query.

Examples of Common Subexpression Optimization

The query in Example 2-35 finds names of employees who work in a department located in London and who make more than 60K or who are accountants. The query contains common subexpressions in its two disjunctive branches.

Example 2-35 Optimization of Common Subexpressions in Two OR Branches

Initial Query

SELECT employees.last_name
 FROM employees E, departments D
 WHERE (D.department_id = E.department_id AND E.job_id = 'AC_ACCOUNT' AND 
D.location = 2400)
  OR
  E.department_id = D.department_id AND E.salary > 60000 AND D.location = 2400);

The elimination of the common subexpressions transforms this query into the following query, reducing the number of joins from two to one.

Optimized Query

SELECT employees.last_name 
FROM employees E, departments D
WHERE (D.department_id = E.department_id AND D.location = 2400)
   AND (E.job_id = 'AC_ACCOUNT' OR E.salary > 60000);

The query in Example 2-36 contains common subexpression in three disjunctive branches.

Example 2-36 Optimization of Common Subexpressions in Three OR Branches

Initial Query

SELECT SUM (l_extendedprice* (1 - l_discount))
FROM PARTS, LINEITEM
WHERE (p_partkey = l_partkey
        AND p_brand = 'Brand#12'
        AND p_container IN ('SM CASE', 'SM BOX', 'SM PACK', 'SM PKG')
        AND l_quantity >= 1 AND l_quantity <= 1 + 10
        AND p_size >= 1 AND p_size <= 5
        AND l_shipmode IN ('AIR', 'REG AIR')
        AND l_shipinstruct = 'DELIVER IN PERSON')
    OR (l_partkey = p_partkey)
        AND p_brand = 'Brand#23'
        AND p_container IN ('MED BAG', 'MED BOX', 'MED PKG', 'MED PACK')
        AND l_quantity >= 10 AND l_quantity <= 10 + 10
        AND p_size >= 1 AND p_size <= 10 AND p_size BETWEEN 1 AND 10
        AND l_shipmode IN ('AIR', 'REG AIR')
        AND l_shipinstruct = 'DELIVER IN PERSON')
    OR (p_partkey = l_partkey
        AND p_brand = 'Brand#34'
        AND p_container IN ('LG CASE', 'LG BOX', 'LG PACK', 'LG PKG')
        AND l_quantity >= 20 AND l_quantity <= 20 + 10
        AND p_size >= 1 AND p_size <= 15
        AND l_shipmode IN ('AIR', 'REG AIR')
        AND l_shipinstruct = 'DELIVER IN PERSON');

This query is transformed by common subexpression optimization to the following query, reducing the number joins from three to one.

Optimized Query

SELECT SUM (l_extendedprice* (1 - l_discount))
FROM PARTS, LINEITEM
WHERE (p_partkey = l_partkey  /* these are the four common subexpressions */
    AND p_size >= 1
    AND l_shipmode IN ('AIR', 'REG AIR')
    AND l_shipinstruct = 'DELIVER IN PERSON')
    AND
        ((p_brand = 'Brand#12'
        AND p_container IN  ( 'SM CASE', 'SM BOX', 'SM PACK', 'SM PKG')
        AND l_quantity >= 1 AND l_quantity <= 1 + 10
        AND p_size <= 5)
    OR (p_brand = 'Brand#23'
        AND p_container IN ('MED BAG', 'MED BOX', 'MED PKG', 'MED PACK')
        AND l_quantity >= 10 AND l_quantity <= 10 + 10
        AND p_size <= 10)
    OR (p_brand = 'Brand#34'
        AND p_container IN ('LG CASE', 'LG BOX', 'LG PACK', 'LG PKG')
        AND l_quantity >= 20 AND l_quantity <= 20 + 10
AND p_size <= 15));

How the CBO Evaluates DETERMINISTIC Functions

In some cases, the optimizer can use a previously calculated value rather than executing a user-written function. This is only safe for functions that behave in a restricted manner. The function must return the same output return value for any given set of input argument values.

The function's result must not differ because of differences in the content of package variables or the database, or session parameters such as the globalization support parameters. Furthermore, if the function is redefined in the future, then its output return value must be the same as that calculated with the prior definition for any given set of input argument values. Finally, there must be no meaningful side effects to using a precalculated value instead of executing the function again.

The creator of a function can promise to the Oracle server that the function behaves according to these restrictions by using the keyword DETERMINISTIC when declaring the function with a CREATE FUNCTION statement or in a CREATE PACKAGE or CREATE TYPE statement. The server does not attempt to verify this declaration--even a function that obviously manipulates the database or package variables can be declared DETERMINISTIC. It is the programmer's responsibility to use this keyword only when appropriate.

Calls to a DETERMINISTIC function might be replaced by the use of an already calculated value when the function is called multiple times within the same query, or if there is a function-based index or a materialized view defined that includes a relevant call to the function.

See Also:

Oracle9i Application Developer's Guide - Fundamentals for more information on DETERMINISTIC functions
Oracle9i SQL Reference for descriptions of CREATE FUNCTION, CREATE INDEX, and CREATE MATERIALIZED VIEW
Oracle9i Database Concepts for a description of function-based indexes
Oracle9i Data Warehousing Guide for detailed information about materialized views

How the Optimizer Transforms SQL Statements

SQL is a very flexible query language; there are often many statements you could use to achieve the same goal. Sometimes, the optimizer (Query Transformer) transforms one such statement into another that achieves the same goal if the second statement can be executed more efficiently.

This section discusses the following topics:

How the CBO Transforms ORs into Compound Queries
How the CBO Unnests Subqueries
How the CBO Merges Views
How the CBO Pushes Predicates
How the CBO Executes Compound Queries

See Also:
"Understanding Joins" for additional information about optimizing statements that contain joins, semi-joins, or anti-joins

How the CBO Transforms ORs into Compound Queries

If a query contains a WHERE clause with multiple conditions combined with OR operators, then the optimizer transforms it into an equivalent compound query that uses the UNION ALL set operator, if this makes the query execute more efficiently:

If each condition individually makes an index access path available, then the optimizer can make the transformation. The optimizer chooses an execution plan for the resulting statement that accesses the table multiple times using the different indexes and then puts the results together.
If any condition requires a full table scan because it does not make an index available, then the optimizer does not transform the statement. The optimizer chooses a full table scan to execute the statement, and Oracle tests each row in the table to determine whether it satisfies any of the conditions.
For statements that use the CBO, the optimizer might use statistics to determine whether to make the transformation, by estimating and then comparing execution costs of the original statement and the resulting statement.
The CBO does not use the OR transformation for IN-lists or ORs on the same column; instead, it uses the INLIST iterator operator.

See Also:
"Understanding Access Paths for the RBO" and "How the CBO Chooses an Access Path" for information on access paths and how indexes make them available

Example 2-37 shows how an OR query is transformed into a compound query. In this example, the WHERE clause contains two conditions combined with an OR operator.

Example 2-37 Transforming an OR Query into a Compound Query

Initial Query

SELECT * 
  FROM employees 
  WHERE job_id = 'ST_CLERK' 
    OR department_id = 50;

If there are indexes on both the job_id and department_id columns, then the optimizer might transform this query into the following equivalent query:

Optimized Query

SELECT * 
  FROM employees 
  WHERE job_id = 'ST_CLERK' 
UNION ALL 
SELECT * 
  FROM employees 
  WHERE department_id = 50 
    AND job_id <> 'ST_CLERK';

When the CBO is deciding whether to make a transformation, the optimizer compares the cost of executing the original query using a full table scan with that of executing the resulting query.

The execution plan for the transformed statement might look like the illustration in Figure 2-1. The shaded boxes indicate steps that physically retrieve data and the clear boxes indicate steps that operate on data returned from the previous step.

Figure 2-1 Execution Plan for a Transformed Query Containing OR

Text description of pfgrf199.gif follows

Text description of the illustration pfgrf199.gif

To execute the transformed query, Oracle performs the steps in Figure 2-1 in the following order:

Steps 3 and 5 scan the indexes on the job_id and department_id columns using the conditions of the component queries. These steps obtain rowids of the rows that satisfy the component queries.
Steps 2 and 4 use the rowids from steps 3 and 5 to locate the rows that satisfy each component query.
Step 1 puts together the row sources returned by steps 2 and 4.

If either the job_id column or the department_id column is not indexed, then the optimizer does not even consider the transformation, because the resulting compound query would require a full table scan to execute one of its component queries. Executing the compound query with a full table scan in addition to an index scan could not possibly be faster than executing the original query with a full table scan.

Example 2-38 shows the kind of query that is not transformed. The query in this example assumes that there is an index on the last_name column only:

Example 2-38 Query Containing OR That Is Not Transformed

SELECT * 
  FROM employees 
  WHERE last_name = 'Smith' 
    OR salary > commission_pct*100000;

Transforming this query would result in the compound query in Example 2-39.

Example 2-39 Compound Query Containing OR That Is Transformed

SELECT * 
  FROM employees 
  WHERE last_name = 'Smith' 
UNION ALL 
SELECT * 
  FROM employees 
  WHERE salary > commission_pct*100000;

Because the condition in the WHERE clause of the second component query (salary > commission_pct) does not make an index available, the compound query requires a full table scan. For this reason, the optimizer does not make the transformation, but chooses a full table scan to execute the original statement.

How the CBO Unnests Subqueries

To optimize a complex statement, the optimizer chooses to do one of the following:

Transform the complex statement into an equivalent join statement, and then optimize the join statement
Optimize the complex statement as it is

The optimizer transforms a complex statement into a join statement whenever the resulting join statement is guaranteed to return exactly the same rows as the complex statement. This transformation allows Oracle to execute the statement by taking advantage of join optimizer techniques.

Example 2-40 shows how the optimizer uses a join to unnest a subquery. The complex statement in this example selects all rows from the orders table for which owners appear in the customers table.

Example 2-40 How the CBO Unnests Subqueries

Initial Query

SELECT * 
  FROM orders 
  WHERE customer_id IN 
    (SELECT customer_id FROM customers);

If the customer_id of the customers table is a primary key or has a UNIQUE constraint, then the optimizer can transform the complex query into the following join statement that is guaranteed to return the same data:

Optimized Query

SELECT orders.* 
  FROM orders, customers 
  WHERE orders.customer_id = customers.customer_id;

To execute this statement, Oracle performs a nested-loops join operation. The execution plan for this statement might look like Figure 2-2.

Figure 2-2 Execution Plan for a Nested Loops Join

Text description of pfgrf200.gif follows

Text description of the illustration pfgrf200.gif

If the optimizer cannot transform a complex statement into a join statement, then it chooses execution plans for the parent statement and the subquery as though they were separate statements. Oracle then executes the subquery and uses the rows returned to execute the parent query.

Example 2-41 shows the kind of statement is not transformed. The complex statement in this example returns all rows from the customers table that have credit limits greater than the average credit limit.

Example 2-41 Complex Statement That Is Not Transformed

SELECT * 
  FROM customers 
  WHERE credit_limit > 
    (SELECT AVG(credit_limit) FROM customers);

No join statement can perform the function of this statement, so the optimizer does not transform the statement.

Note:

Complex queries whose subqueries contain aggregate functions such as AVG cannot be transformed into join statements.

See Also:

"Understanding Joins" for information on nested loops joins

How the CBO Merges Views

To merge the view's query into a referencing query block in the accessing statement, the optimizer replaces the name of the view with the names of its base tables in the query block and adds the condition of the view's query's WHERE clause to the accessing query block's WHERE clause.

This optimization applies to select-project-join views, which are views that contain only selections, projections, and joins. This means that these views do not contain items such as set operators, aggregate functions, DISTINCT, GROUP BY, and CONNECT BY. See "Mergeable and Nonmergeable Views".

Example 2-42 shows how the CBO merges views. The view in this example is of all employees who work in department 10:

Example 2-42 How the CBO Merges Views

CREATE VIEW emp_10 
  AS SELECT employee_id, last_name, job_id, manager_id, hire_date, salary, 
commission_pct, department_id 
    FROM employees 
    WHERE department_id = 10;

The following query accesses the view. The query selects the Ids greater than 170 of employees who work in department 10:

SELECT employee_id 
  FROM emp_10 
  WHERE employee_id > 170;

The optimizer transforms the query into the following query that accesses the view's base table:

SELECT employee_id 
  FROM employees 
  WHERE department_id = 10 
    AND employee_id > 170;

If there are indexes on the department_id or employee_id columns, then the resulting WHERE clause makes them available.

Mergeable and Nonmergeable Views

The optimizer can merge a view into a referencing query block when the view has one or more base tables, provided the view does not contain any of the following:

Set operators (UNION, UNION ALL, INTERSECT, MINUS)
A CONNECT BY clause
A ROWNUM pseudocolumn
Aggregate functions (AVG, COUNT, MAX, MIN, SUM) in the select list

When a view contains one of the following structures, it can be merged into a referencing query block only if Complex View Merging is enabled:

A GROUP BY clause
A DISTINCT operator in the select list

View merging is not possible for a view that has multiple base tables, if it is on the right side of an outer join. However, if a view on the right side of an outer join has only one base table, then the optimizer can use complex view merging, even if an expression in the view can return a nonnull value for a NULL.

If a query has a CURSOR expression, then no view merging will take place, even for views which would normally be mergeable. An example is the following:

CREATE VIEW emp_v AS
       SELECT last_name,employee_id FROM employees; 
SELECT CURSOR(select * from sys.dual), last_name, employee_id from emp_v;

This query will not merge the emp_v view.

See Also:

"Understanding Joins"

Complex View Merging

If a view's query contains a GROUP BY clause or DISTINCT operator in the select list, then the optimizer can merge the view's query into the accessing statement only if complex view merging is enabled. Complex merging of a view with a GROUP BY clause is illustrated in Example 2-43.

Complex merging can also be used to merge an IN subquery into the accessing statement if the subquery is uncorrelated, as shown in Example 2-44. Complex merging is not cost-based; it must be enabled with the initialization parameter OPTIMIZER_FEATURES_ENABLE or the MERGE hint. Without this hint or parameter setting, the optimizer uses another approach, pushing predicates.

See Also:

"How the CBO Pushes Predicates" for an alternate approach
Chapter 5, "Optimizer Hints" for details about the MERGE and NO_MERGE hints

Example 2-43 How the CBO Merges a View with a GROUP BY Clause

The view avg_salary_view contains the average salaries for each department:

CREATE VIEW avg_salary_view AS 
  SELECT department_id, AVG(salary) AS avg_sal_dept, 
    FROM employees 
    GROUP BY department_id;

If complex view merging is enabled, then the optimizer can transform the following query, which finds the average salaries of departments in London:

SELECT departments.location_id, avg_sal_dept 
  FROM departments, avg_salary_view 
  WHERE departments.department_id = avg_salary_view.department_id 
    AND departments.location_id = 2400;

into the following query:

SELECT departments.loc, AVG(salary) 
  FROM departments, employees 
  WHERE departments.department_id = employees.department_id 
    AND departments.location_id = 2400 
  GROUP BY departments.rowid, departments.location_id;

The transformed query accesses the view's base table, selecting only the rows of employees who work in London and grouping them by department.

Example 2-44 How the CBO Merges an IN Subquery

Complex merging can be used for an IN clause with a noncorrelated subquery, as well as for views. The view min_salary_view contains the minimum salaries for each department:

CREATE VIEW min_salary_view AS
SELECT department_id, MIN(salary) min_sal
  FROM employees
 GROUP BY department_id;

If complex merging is enabled, then the optimizer can transform the following query, which finds all employees who earn the minimum salary for their department in London:

SELECT employees.last_name, employees.salary
  FROM employees, departments
 WHERE (employees.department_id, employees.salary) IN
         (select department_id, min_sal from min_salary_view)
   AND employees.department_id = departments.department_id
   AND departments.location_id = 2400;

into the following query (where e1 and e2 represent the employees table as it is referenced in the accessing query block and the view's query block, respectively):

SELECT e1.last_name, e1.salary 
  FROM employees e1, departments, employees e2 
 WHERE e1.department_id = departments.department_id 
   AND departments.location_id = 2400
   AND e1.department_id = e2.department_id 
 GROUP BY e1.rowid, departments.rowid, e1.last_name, e1.salary 
HAVING e1.salary = MIN(e2.salary);

How the CBO Pushes Predicates

The optimizer can transform a query block that accesses a nonmergeable view by pushing the query block's predicates inside the view's query. Examples 2-45 and 2-46 illustrate this process.

In Example 2-45, the two_emp_tables view is the union of two employee tables. The view is defined with a compound query that uses the UNION set operator.

Example 2-45 How the CBO Pushes Predicates - the UNION Set Operator

CREATE VIEW two_emp_tables 
  (employee_id, last_name, job_id, manager_id, hire_date, salary, 
                                   commission_pct, department_id) AS 
  SELECT employee_id, last_name, job_id, manager_id, hire_date, salary, 
commission_pct, department_id 
    FROM emp1 
   UNION  
  SELECT employee_id, last_name, job_id, manager_id, hire_date, 
                          salary, commission_pct, department_id 
    FROM emp2;

The following query accesses the view. The query selects the IDs and names of all employees in either table who work in department 50:

SELECT employee_id, last_name 
  FROM two_emp_tables 
 WHERE department_id = 50;

Because the view is defined as a compound query, the optimizer cannot merge the view's query into the accessing query block. Instead, the optimizer can transform the accessing statement by pushing its predicate, the WHERE clause condition (department_id = 50), into the view's compound query.

The resulting statement looks like the following:

SELECT employee_id, last_name 
  FROM ( SELECT employee_id, last_name, job_id, manager_id, hire_date, 
                                salary, commission_pct, department_id 
           FROM emp1 
          WHERE department_id = 50 
          UNION 
         SELECT employee_id, last_name, job_id, manager_id, hire_date, 
                                 salary, commission_pct, department_id 
           FROM emp2 
          WHERE department_id = 50 );

If there is an index on the department_id column, then the resulting WHERE clauses make it available.

Figure 2-3 shows the execution plan of the resulting statement.

Figure 2-3 Accessing a View Defined with the UNION Set Operator

Text description of pfgrf201.gif follows

Text description of the illustration pfgrf201.gif

To execute this statement, Oracle performs the steps in Figure 2-3 in the following order:

Steps 5 and 6 perform full scans of the emp1 and emp2 tables.
Step 4 performs a UNION-ALL operation returning all rows returned by either step 5 or step 6, including all copies of duplicates.
Step 3 sorts the result of step 4, eliminating duplicate rows.
Step 2 extracts the desired columns from the result of step 3.
Step 1 indicates that the view's query was not merged into the accessing query.

In Example 2-46, the view emp_group_by_deptno contains the department number, average salary, minimum salary, and maximum salary of all departments that have employees.

Example 2-46 How the CBO Pushes Predicates - the GROUP BY Clause

CREATE VIEW emp_group_by_deptno 
  AS SELECT department_id, 
        AVG(salary) avg_sal, 
        MIN(salary) min_sal, 
        MAX(salary) max_sal 
       FROM employees 
      GROUP BY department_id;

The following query selects the average, minimum, and maximum salaries of department 50 from the emp_group_by_deptno view:

SELECT * 
  FROM emp_group_by_deptno 
 WHERE department_id = 50;

The optimizer transforms the statement by pushing its predicate (the WHERE clause condition) into the view's query. The resulting statement looks like the following:

SELECT department_id, 
   AVG(salary) avg_sal, 
   MIN(salary) min_sal, 
   MAX(salary) max_sal, 
  FROM employees 
 WHERE department_id = 50 
 GROUP BY department_id;

If there is an index on the department_id column, then the resulting WHERE clause makes it available. Figure 2-4 shows the execution plan for the resulting statement. The execution plan uses an index on the department_id column.

Figure 2-4 Accessing a View Defined with a GROUP BY Clause

Text description of pfgrf202.gif follows

Text description of the illustration pfgrf202.gif

To execute this statement, Oracle performs the steps in Figure 2-4 in the following order:

Step 4 performs a range scan on the index emp_department_ix (an index on the department_id column of the employees table) to retrieve the rowids of all rows in the employees table with a department_id value of 50.
Step 3 accesses the employees table using the rowids retrieved by step 4.
Step 2 sorts the rows returned by step 3 to calculate the average, minimum, and maximum salary values.
Step 1 indicates that the view's query was not merged into the accessing query.

How the CBO Applies an Aggregate Function to the View

The optimizer can transform a query that contains an aggregate function (AVG, COUNT, MAX, MIN, SUM) by applying the function to the view's query.

The query in Figure 2-47 accesses the emp_group_by_deptno view defined in Figure 2-46. The query derives the averages for the average department salary, the minimum department salary, and the maximum department salary from the employee table.

Example 2-47 How the CBO Applies an Aggregate Function to a View

SELECT AVG(avg_sal), AVG(min_sal), AVG(max_sal) 
  FROM emp_group_by_deptno;

The optimizer transforms this statement by applying the AVG aggregate function to the select list of the view's query:

SELECT AVG(AVG(salary)), AVG(MIN(salary)), AVG(MAX(salary)) 
  FROM employees 
 GROUP BY department_id;

Figure 2-5 shows the execution plan of the resulting statement.

Figure 2-5 Applying Aggregate Functions to a View Defined with GROUP BY Clause

Text description of pfgrf203.gif follows

Text description of the illustration pfgrf203.gif

To execute this statement, Oracle performs the steps in Figure 2-5 in the following order:

Step 4 performs a full scan of the employees table.
Step 3 sorts the rows returned by step 4 into groups based on their department_id values and calculates the average, minimum, and maximum salary value of each group.
Step 2 indicates that the view's query was not merged into the accessing query.
Step 1 calculates the averages of the values returned by step 2.

How the CBO Executes Views in Outer Joins

For a view that is on the right side of an outer join, the optimizer can use one of two methods, depending on how many base tables the view accesses:

If the view has only one base table, then the optimizer can use view merging.
If the view has multiple base tables, then the optimizer can push the join predicate into the view.

How the CBO Accesses the View's Rows with the Original Statement

The optimizer cannot transform all statements that access views into equivalent statements that access base table(s). For example, if a query accesses a ROWNUM pseudocolumn in a view, then the view cannot be merged into the query, and the query's predicate cannot be pushed into the view.

To execute a statement that cannot be transformed into one that accesses base tables, Oracle issues the view's query, collects the resulting set of rows, and then accesses this set of rows with the original statement as though it were a table. Example 2-48 illustrates this process.

Example 2-48 How the CBO Accesses the View's Rows

Consider the emp_group_by_deptno view defined in Figure 2-46:

CREATE VIEW emp_group_by_deptno 
  AS SELECT department_id, 
        AVG(salary) avg_sal,
        MIN(salary) min_sal, 
        MAX(salary) max_sal 
       FROM employees 
      GROUP BY department_id;

The following query accesses this view. The query joins the average, minimum, and maximum salaries from each department represented in the view to the name and location of the department in the departments table.

SELECT emp_group_by_deptno.department_id, avg_sal, min_sal,
       max_sal, department_name, location_id
  FROM emp_group_by_deptno, departments 
 WHERE emp_group_by_deptno.department_id = departments.department_id;

Because there is no equivalent statement that accesses only base tables, the optimizer cannot transform this statement. Instead, the optimizer chooses an execution plan that issues the view's query and then uses the resulting set of rows as it would the rows resulting from a table access.

See Also:

"Understanding Joins" for more information on how Oracle performs a nested loops join operation

Figure 2-6 shows the execution plan for this statement.

Figure 2-6 Joining a View Defined with a GROUP BY Clause to a Table

Text description of pfgrf204.gif follows

Text description of the illustration pfgrf204.gif

To execute this statement, Oracle performs the steps in Figure 2-6 in the following order:

Step 4 performs a full scan of the employees table.
Step 3 sorts the results of step 4 and calculates the average, minimum, and maximum salary values selected by the query for the emp_group_by_deptno view.
Step 2 used the data from the previous two steps for a view.
For each row returned by step 2, step 6 uses the department_id value to perform a unique scan of the dept_id_pk index.
Step 5 uses each rowid returned by step 6 to locate the row in the departments table with the matching department_id value.
Oracle combines each row returned by step 2 with the matching row returned by step 5 and returns the result.

How the CBO Executes Compound Queries

To choose the execution plan for a compound query, the optimizer chooses an execution plan for each of its component queries, and then combines the resulting row sources with the union, intersection, or minus operation, depending on the set operator used in the compound query. Examples 2-49, 2-50 and 2-51 illustrate the process.

The query in Example 2-49 uses the UNION ALL operator to select all occurrences of all parts in either the orders1 table or the orders2 table.

Example 2-49 How the CBO Executes Compound Queries with `UNION` `ALL`

SELECT part FROM orders1 
 UNION ALL 
SELECT part FROM orders2;

Figure 2-7 shows the execution plan for this statement:

Figure 2-7 Compound Query with UNION ALL Set Operator

Text description of pfgrf205.gif follows

Text description of the illustration pfgrf205.gif

To execute this statement, Oracle performs the steps in Figure 2-7 in the following order:

Steps 2 and 3 perform full table scans on the orders1 and orders2 tables.
Step 1 performs a UNION-ALL operation returning all rows that are returned by either step 2 or step 3 including all copies of duplicates.

The query in Example 2-50 uses the UNION operator to select all parts that appear in either the orders1 or orders2 table.

Example 2-50 How the CBO Executes Compound Queries with UNION

SELECT part FROM orders1 
 UNION 
SELECT part FROM orders2;

Figure 2-8 shows the execution plan for this statement:

Figure 2-8 Compound Query with UNION Set Operator

Text description of pfgrf206.gif follows

Text description of the illustration pfgrf206.gif

This execution plan is identical to the one for the UNION ALL operator shown in Figure 2-7, except that in this case, Oracle uses the SORT operation to eliminate the duplicates returned by the UNION ALL operation.

The query in Example 2-51 uses the INTERSECT operator to select only those parts that appear in both the orders1 and orders2 tables.

Example 2-51 How the CBO Executes Compound Queries with INTERSECT

SELECT part FROM orders1 
INTERSECT 
SELECT part FROM orders2;

Figure 2-9 shows the execution plan for this statement:

Figure 2-9 Compound Query with INTERSECT Set Operator

Text description of pfgrf207.gif follows

Text description of the illustration pfgrf207.gif

To execute this statement, Oracle performs the steps in Figure 2-9 in the following order:

Steps 3 and 5 perform full table scans of the orders1 and orders2 tables.
Steps 2 and 4 sort the results of steps 3 and 5, eliminating duplicates in each row source.
Step 1 performs an INTERSECTION operation that returns only rows that are returned by both steps 2 and 4.

2 Optimizer Operations

How the Optimizer Performs Operations

How the CBO Evaluates IN-List Iterators

Example 2-1 IN-List Iterators Initial Statement

Example 2-2 Alternate Statement with Equality

Example 2-3 IN-List Iterator with Unique Index

Example 2-4 IN-List Iterator with a Nested Loop

Example 2-5 Using the IN-List Iterator to Avoid an Expensive First Step

When the Optimizer Uses IN-List Iterators

IN-List Iterator Hints

Example 2-6 Calling the IN-List Iterator without the INDEX Hint

Example 2-7 Calling the IN-List Iterator with the INDEX Hint

How the CBO Evaluates Concatenation

Example 2-8 How the CBO Evaluates Concatenation

Example 2-9 Concatenation of a Query with a Nested OR Statement

Example 2-10 Query with Nested OR with Concatenation Disabled

Concatenation Hints

When Not to Use Concatenation

Example 2-11 When Not to Use Concatenation

Example 2-12 Using the Concatenation Hint

Example 2-13 Using the Index Hint

How the CBO Evaluates Remote Operations

Example 2-14 How the CBO Evaluates a Query with a Remote Driving Table

Example 2-15 How the CBO Evaluates a Query with a Local Driving Table

Example 2-16 How the CBO Evaluates a Query with a Local Driving Table and a Nested Loops Hint

Example 2-17 Querying a Remote Table Using No Filter

Example 2-18 Querying with a Filter Created by a GROUP BY Condition

Example 2-19 Querying with a Filter Created by a Subquery

How the CBO Executes Distributed Statements

How the CBO Executes Sort Operations

SORT UNIQUE

Example 2-20 DISTINCT Clause Causing SORT UNIQUE

Example 2-21 IN Subquery Causing SORT UNIQUE

Example 2-22 IN Subquery That Does Not Cause SORT UNIQUE

SORT AGGREGATE

Example 2-23 Query Causing SORT AGGREGATE

SORT GROUP BY

Example 2-24 Query That Causes SORT GROUP BY

SORT JOIN

Example 2-25 Query Causing SORT JOIN

SORT ORDER BY

Example 2-26 Query Causing SORT ORDER BY

How the CBO Executes Views

Example 2-27 Query Causing a View

Example 2-28 IN Subquery Causing a View

Example 2-29 Inline Query Causing a View

How the CBO Evaluates Constants

How the CBO Evaluates the UNION and UNION ALL Operators

When the Optimizer Uses UNION or UNION ALL

Example 2-30 Query Without UNION Clause

Example 2-31 Query Using the UNION Clause

UNION and UNION ALL Hints

How the CBO Evaluates the LIKE Operator

How the CBO Evaluates the IN Operator

How the CBO Evaluates the ANY or SOME Operator

How the CBO Evaluates the ALL Operator

How the CBO Evaluates the BETWEEN Operator

How the CBO Evaluates the NOT Operator

Example 2-32 How the CBO Evaluates the NOT Operator

Example 2-33 How the CBO Simplifies a NOT Statement

How the CBO Evaluates Transitivity

Example 2-34 Query Causing Transitivity Evaluation

How the CBO Optimizes Common Subexpressions

Examples of Common Subexpression Optimization

Example 2-35 Optimization of Common Subexpressions in Two OR Branches

Example 2-36 Optimization of Common Subexpressions in Three OR Branches

How the CBO Evaluates DETERMINISTIC Functions

How the Optimizer Transforms SQL Statements

How the CBO Transforms ORs into Compound Queries

Example 2-37 Transforming an OR Query into a Compound Query

Figure 2-1 Execution Plan for a Transformed Query Containing OR

Example 2-38 Query Containing OR That Is Not Transformed

Example 2-39 Compound Query Containing OR That Is Transformed

How the CBO Unnests Subqueries

Example 2-40 How the CBO Unnests Subqueries

Figure 2-2 Execution Plan for a Nested Loops Join

Example 2-41 Complex Statement That Is Not Transformed

How the CBO Merges Views

Example 2-42 How the CBO Merges Views

Mergeable and Nonmergeable Views

2
Optimizer Operations

Example 2-49 How the CBO Executes Compound Queries with `UNION` `ALL`