ITPub博客

首页 > Linux操作系统 > Linux操作系统 > Oracle Wait Interface

Oracle Wait Interface

原创 Linux操作系统 作者:yanggq 时间:2019-03-08 16:33:05 0 删除 编辑
From

Oracle Wait Events

the sample code accompanying this article (Listings 1-10)

The ability to analyze and correct Oracle Database physical read wait events is critical in any tuning project. The majority of activity in a database involves reading data, so this type of tuning can have a huge, positive impact on performance.

System wait tuning has become very popular because it can show you those wait events that are the primary bottleneck for your system. Some experts like the 10046 wait event (level 8 and higher) analysis technique and Oracle MetaLink now has an analysis tool called trcanlzr.sql (see ) to interpret bottlenecks via 10046 trace dumps. However, some Oracle professionals find dumps cumbersome and prefer to sample real-time wait events.

When doing wait analysis, it is critical to remember that all Oracle databases experience wait events, and that the presence of waits does not always indicate a problem. In fact, all well-tuned databases have some bottleneck. (For example, a computationally intensive database may be CPU-bound and a data warehouse may be bound by disk-read waits.) In theory, any Oracle database will run faster if access to hardware resources associated with waits is increased.

This article explores a small subset of wait analysis, but it illustrates a critical concept of Oracle tuning: the fact that all Oracle databases wait on some kind of system resource, and it's the Oracle professional's job to determine whether the database is I/O bound, CPU bound, memory bound, or bound waiting on latches or locks. When the source of the bottleneck has been identified, the savvy Oracle professional must then determine the causes of these events and attempt to remove them.

To aid us in identifying wait events, the Oracle Database provides numerous views such as v$system_event and v$session_wait to give us insight into our wait events. While the v$system_event dictionary views will give you information regarding the total number of I/O-related waits within your Oracle database, it does not tell you the specific object involved. In Oracle9i Release 2 the v$segment_statistics view gives this information. The v$session_wait view provides detailed file and block data and you can extract the object from the block number. Remember, Oracle event waits occur very quickly, and it is difficult to get data unless you are lucky enough to run the query at the exact moment the database is experiencing the wait. Hence, we must devise a method for using the v$session_wait view so we can capture a sample of the transient physical I/O waits.

If you look into the v$system_event view you will note that there are over 300 specific wait events. For the purpose of this article, we will limit our discussion to the two main physical I/O wait events. As you may recall, there are two critical I/O read waits within any Oracle database:

  • db file sequential read waits: A sequential read wait occurs within an Oracle database when a single block is read. A single read is most commonly an index probe by ROWID into an individual table, or the access of an index block. Sequential reads are single-block reads, as opposed to multiblock (scattered) reads.
  • db file scattered read waits: Scattered read waits occurs when multiblock I/O is invoked. When the Oracle Database performs a full-table scan or sort operation, multiblock block read is automatically invoked.

To tune these wait events, we must first identify those objects that experience physical read waits and when they do so, and then address the issue with tuning techniques. Let's start by reviewing the solutions, and then look at how to identify wait conditions.

Solutions to Physical Read Waits

When we have identified the objects that experience the physical read waits, we can use Statspack to extract the SQL associated with the waits and take the following actions to correct the problem. These corrective actions are presented in the order in which they are most likely to be effective, and some may not apply to your environment.

  • Tune the SQL: This is the single most important factor in reducing disk waits. If an SQL statement can be tuned to reduce disk I/O (for example, by using an index to remove an unnecessary large-table full-table scan), then the amount of disk I/O and associated waits are dramatically reduced. Other SQL tuning might include:
    • Change table join order: For sequential read waits, the SQL may be tuned to change the order that the tables are joined (often using the ORDERED hint)
    • Change indexes: You can tune the SQL by adding function-based indexes or using an INDEX hint to make the SQL less I/O-intensive by using a more selective index.
    • Change table join methods: Often, nested loop joins have fewer I/O waits than hash joins, especially for sequential reads. You can change table join methods with SQL hints (USE_NL, for example). If you are not yet using Oracle9i with pga_aggregate_target, you can change the propensity for hash join by adjusting the hash_area_size parameter.

  • Re-schedule contentious SQL: After you have identified the regular trends of repeating disk waits, you can often reschedule the execution of the SQL at another time, thereby relieving the physical waits.
  • Re-analyze schema using dbms_stats - In some cases, stale or non-representative statistics generated by the dbms_utility.analyze_schema package can cause suboptimal SQL execution plans, resulting in unnecessary disk waits. The solution is to use the dbms_stats package to analyze your schema. Also, note that if column data values are skewed adding histograms may also be necessary.
  • Distribute disk I/O across more spindles: Sometimes disk channel contention is responsible for physical read waits, which will show up in your disk monitor tool (iostat, EMC Symmetrics Manager, and so on). If you experience disk waits as a result of hardware contention, you can stripe the offending objects across multiple disk spindles by reorganizing the object and using the MINEXTENTS and NEXT parameters to stripe the object across multiple data files on multiple disks or use volume manager or I/O subsystem provided striping mechanisms.
  • Use the KEEP pool: For reducing scattered reads, many experts recommend implementing the KEEP pool. In the Oracle Magazine article "Advanced Tuning with Statspack" (Jan/Feb. 2003; see ), the author notes that that small-table full-table scans should be placed in the KEEP pool to reduce scattered read waits.
  • Increase the data buffer cache size: Obviously, the more data blocks we keep in RAM, the smaller the probability of read waits.

Of the above solutions, SQL tuning is clearly the most important in reducing physical read waits. In their landmark article "Diagnosing Performance with Statspack" (Oracle Magazine, Mar.April 2000; see ), Graham Wood and Connie Dialeris explain how to use Statspack to collect and analyze high-resource SQL statements. The stats$sqltext table keeps a record of historic SQL, and it is easy to extract the SQL that was executing at the time of the read waits. You can then gather the execution plans for the SQL statements and verify they are using an optimal execution plan.

Remember, one characteristic of suboptimal SQL execution is an unnecessary large-table full-table scan. For example, if you query only returns 10 rows, it would not be optimal to be performing a full-table scan on a 100 block table.

Now that we see the solutions, let's explore how we get the data we need to fix the causes of the physical read waits.

Collecting Real-Time Wait Events

This article should not be taken as a comprehensive approach to Oracle tuning, but it does provide tremendous insight into the source of some disk I/O waits. Given that the v$ views are accumulators, and we can only see the sum the total number of waits since the instance started, we must take a novel approach in order to capture the specific objects that are associated with the waits.

We can do so by using the v$session_wait view. As disk read waits occur within the Oracle Database, they appear in the v$session_wait view for a very brief period of time. Because of the transient appearance of read waits, it is impossible to catch all of the run-time waits. However, it is possible to take a frequent sample of the v$session_wait view and catch a representative sample of the system-waits details at the exact moment that the events occur.

The exciting thing about the v$session_wait view is that we can capture the exact time the wait occurred, and the file and block number that was being waited upon. When we have the file and block number, it's possible to determine the exact table or index where the wait occurred.

We need to start by defining a table to hold the information from the v$session_wait view. Listing 1 captures the salient columns within the v$session_wait view, including the event name, the wait time, the seconds in wait, and the all-important columns P1 and P2, where P1 is the file ID, P2 is the block number of the event that is being waited upon.

Now that we've defined a table to hold the information from v$session_wait, the next step is to write to INSERT statement that will capture the wait events as they occur. Please note that because of the transient nature of Oracle waits, on many occasions this INSERT statement will return no rows into the table. For the purpose of the simple example, we'll create this INSERT statement and then place it with a crontab file on our UNIX server so that it runs every minute. The simple script in Listing 2 should provide a brief representative sample of waits every 60 seconds.

Next, we must place this INSERT statement inside a script so that we can execute it from our cron (or using the AT command if you are on MS-Windows; see Listing 3).

Finally, here is the crontab entry on the UNIX server that will invoke the script every 60 seconds. That should be everything we need to do in order to begin collecting detailed information on run-time I/O waits:

# ************************************************************
# Run 60-second check for run-time waits
# ************************************************************
* * * * * /opt/oracle/oracheck/get_waits.ksh mysid > /tmp/wt.lst

After allowing the script run for several days, we should have a sufficient number of rows so that we can begin analyzing our run-time wait data. As we've noted from the v$session_wait view, it's very easy to display the tablespace names and the exact block names for each and every wait. Listing 4 shows the SQL to display the detail.

Here is the output from this script; we can see the exact time and block IDs when a physical I/O wait occurred:

System               Wait                            File         Block
Date                 Event                             ID            ID
-------------------- ------------------------------ ----- -------------
23/01/2003 17:40:02  db file sequential read            6       300,929
23/01/2003 18:00:03  db file sequential read            8        35,936
23/01/2003 21:04:02  db file sequential read            3        65,162
23/01/2003 21:08:02  db file sequential read            6        23,031
23/01/2003 21:09:01  db file sequential read            5        40,585
23/01/2003 21:10:01  db file sequential read            6       512,663
23/01/2003 21:11:02  db file sequential read            5        26,609
23/01/2003 21:12:01  db file sequential read            5        40,584

Next, we can easily roll-up the number of waits, organized by the type of wait and the hour of the day (see Listing 5). This is a very important part of the wait analysis because it will reveal those times when I/O tuning is required.

Below is the output from this script. Note that we have summed the real-time I/O waits by date and hour of the day, which can give us great insight into I/O processing times when the database is experiencing an I/O bottleneck. From this output we see that this system experiences a sequential I/O bottleneck each evening between 9:00PM and 10:00 PM:

                     Wait                            Wait
Date        Hr.      Event                          Count
-------------------- ------------------------------ -----
23-jan-2003 17       db file sequential read            1
23-jan-2003 18       db file sequential read            1
23-jan-2003 21       db file sequential read           53
23-jan-2003 22       db file sequential read            8
23-jan-2003 23       db file sequential read            7
24-jan-2003 00       db file sequential read            1
24-jan-2003 03       db file sequential read            1
24-jan-2003 05       db file sequential read            1
24-jan-2003 09       db file sequential read            4
24-jan-2003 11       db file sequential read            2
24-jan-2003 13       db file scattered read             1
24-jan-2003 13       db file sequential read            2
24-jan-2003 14       db file sequential read            2
24-jan-2003 17       db file sequential read            1
24-jan-2003 21       db file sequential read           75

Tracking I/O Waits on Specific Tables and Indexes

It should be clear that we still must be able to translate the file number and block number into a specific table or index name. We can do that by using the dba_extents view to determine the start block and end block for every extent in every table. Using dba_extents to identify the object and its data block boundaries, it becomes a trivial matter to read through our new table and identify those specific objects experiencing read waits or buffer busy waits. Now we add the segment name by joining into our dba_extents view (see Listing 6).

Here is the output from this script. Here we see all of the segments that have experienced more than 10 disk-read wait events:

Wait       Segment                             Segment            Wait
Event      Name                                Type              Count
---------- ----------------------------------- ---------- ------------
SEQ_READ   SYSPRD.S_EVT_ACT_F51                INDEX                72
SEQ_READ   SYSPRD.S_ACCNT_POSTN_M1             INDEX                41
SEQ_READ   SYSPRD.S_ASSET_M3                   INDEX                24
SEQ_READ   SYSPRD.S_ASSET_M51                  INDEX                19
SEQ_READ   SYSPRD.S_COMM_REQ_U1                INDEX                11

Here we see the exact indexes that are experiencing sequential read waits, and we now have an important clue for our SQL tuning or object redistribution strategy.

To finish the analysis, we next want to see all "hot blocks." We can do that by interrogating our stats$real_time_waits table, looking for any data blocks that have experienced multiple waits (see Listing 7). Here we see each segment, the exact block where the wait occurred, and the number of wait events:

                                                                Multiple
                                                                   Block
Wait       Segment                        Segment         Block     Wait
Event      Name                           Type           Number    Count
---------- ------------------------------ ---------- ---------- --------
SEQ_READ   SYSPRD.S_EVT_ACT_F51           INDEX         205,680        7
SEQ_READ   SYSPRD.S_EVT_ACT               TABLE         401,481        5
SEQ_READ   SYSPRD.S_EVT_ACT_F51           INDEX         471,767        5
SEQ_READ   SYSPRD.S_EVT_ACT               TABLE           3,056        4
SEQ_READ   SYSPRD.S_EVT_ACT_F51           INDEX         496,315        4
SEQ_READ   SYSPRD.S_DOC_ORDER_U1          INDEX          35,337        3

Next Steps

Download Oracle9i Database Rel 2:

Download trcanlzr.sql:

Read Don's "Oracle Wait Analysis Techniques" article at dbazine.com:

Read Rich Niemec's Oracle Magazine article, "Advanced Tuning with Statspack":

Read the Wood & Dialeris Oracle Magazine article, "Diagnosing Performance with Statspack":

This report is critical because it identifies those data blocks that have experienced multiple block waits. We can then go to each data block and see the contention on a segment header.

Trend-Base Event Wait Analysis

When we have the detailed event waits data, it is trivial to roll-up the data and create trend reports. The SQL in Listing 8 produces an hourly average of sequential read waits.

It is important to note that every database will have these "signatures," which are typically caused by regularly scheduled processing. When the signatures are identified, we must use Statspack to extract the SQL and ensure that it is properly optimized.

If the read waits persist, the next step is to gerrymander the schedule to execute the colliding SQL at different times. If you do not have the I/O bandwidth to run your full workload all at once, move some of the workload to different time window.

To display trend by day, a similar query may be run that will average the number of sequential read waits by day of the week (see Listing 9).

We can then quickly plot the I/O wait data (I used the Microsoft Excel chart wizard in this case) and see repeating trends within our database (see ). Best of all, we have the detailed information in our stats$real_time_waits view so that we can investigate the exact table or index that is experiencing the real-time wait. If we do that in conjunction with Statspack, we may also collect the SQL in the stats$sql_summary table and see the SQL that is precipitating the disk wait events.

Figure 1: Plotting real-time waits averages by hour of the day

In this case, we see a high number of real-time 'db file sequential read waits' waits from 2:00AMÐ3:00AM, and another spike between 9:00PM and midnight each night. We can use this information to go to Statspack and extract the SQL that was running during this period.

Using the supplied script, we can also average to read waits by day of the week (see ). Here we see a clear increase in scattered read waits every Tuesday and Thursday and we can extract the SQL during these periods.

Figure 2: Plotting real-time waits averages by day of the week

Normally, this insight would not be very useful because we could not see the source of the waits. (Of course, if we are using Oracle9i Rel 2, we can use the v$segment_statistics view to see some of this information if we set the statistics_level parameter to a value of 7 or higher.)

Regardless, because we have the original wait detail data stored in stats$real_time_waits, we can easily see the offending objects (see Listing 10).

We can now drill-in and see those specific table and indexes that were experiencing the sequential read waits.

                                                                Block
                     Segment                        Segment     Wait
Date        Hr.      Name                           Type        Count
-------------------- ------------------------------ ----------  --------
23-jan-2003 21       SYSPRD.S_COMM_REQ_SRC_U1       INDEX       23
23-jan-2003 21       SYSPRD.S_EVT_ACT               TABLE       44 
23-jan-2003 21       SYSPRD.S_EVT_ACT_F51           INDEX       16
23-jan-2003 22       SYSPRD.S_EVT_ACT               TABLE       32

It is important that we know the specific object that experiences the physical read wait, because we may wish to distribute the object over additional disk spindles.

Search and Tune

Using a real-time wait sampling method, you can easily capture the details about the objects that experience physical read waits. When they're identified, we can then use Statspack to locate the offensive SQL and begin the tuning. The tuning of physical read waits is SQL tuning, object striping across multiple disks, employing the KEEP pool for small objects, re-scheduling the SQL to relieve the contention or increasing the data buffer cache size.

Special thanks are in order to Graham Wood, the Oracle Corporation super-guru and creator of Statspack. Graham was instrumental in ensuring that this article was complete, accurate and useful.

Donald K. Burleson [] is one of the world's most widely-read Oracle database experts. He has written 17 books, published more than 100 articles in national magazines, and serves as editor-in-chief of Oracle Internals, a leading Oracle database journal. Burleson's latest book is Creating a self-tuning Oracle Database by Rampant Techpress. Don's Web sites are and .

Advice for Oracle9i Users

If you're using Oracle9i, you can utilize the row_wait_obj# column of the v$session view to immediately gather the object number of the object that precipitated the disk-read wait. (This column exists in Oracle8i but is not populated.)

The following scripts can be used in Oracle9i to populate the object ID. This approach speeds-up reports against the stats$real_time_waits table because the latter no longer has the expensive join into the dba_extents view.

9i_cr_real_waits_tab.sql

-- Waits Monitoring Load Script
connect perfstat/perfstat
create table stats$real_time_waits
(
   SAMPLE_TIME       DATE,
   EVENT             VARCHAR2(64),
   WAIT_TIME         NUMBER,
   SECONDS_IN_WAIT   NUMBER,
   P1                NUMBER,
   P2                NUMBER,
   ROW_WAIT_OBJ#     NUMBER
);

9i_get_real_waits.sql

insert into perfstat.stats$real_time_waits

(  SAMPLE_TIME,   event,   wait_time,   seconds_in_wait,   P1,   P2,   row_wait_obj#)
select 
   SYSDATE,   event,   wait_time,   seconds_in_wait,   P1,   P2,   row_wait_obj#
from
  v$session_wait w,
  v$session      s
where
   w.sid = s.sid
and
  event in ('db file sequential read',
            'db file scattered read');
commit;
exit;

来自 “ ITPUB博客 ” ,链接:http://blog.itpub.net/26706/viewspace-64595/,如需转载,请注明出处,否则将追究法律责任。

请登录后发表评论 登录
全部评论

注册时间:2002-10-29

  • 博文量
    78
  • 访问量
    55119