|
|
PostgreSQL 8.2.9 Documentation | ||||
---|---|---|---|---|
Prev | Fast Backward | Chapter 25. Monitoring Database Activity | Fast Forward | Next |
PostgreSQL provides facilities to support dynamic tracing of the database server. This allows an external utility to be called at specific points in the code and thereby trace execution. Currently, this facility is primarily intended for use by database developers, as it requires substantial familiarity with the code.
A number of trace points, often called probes, are already inserted into the source code. By default these probes are disabled, and the user needs to explicitly tell the configure script to make the probes available in PostgreSQL.
Currently, only the DTrace utility is supported, which is only available on Solaris Express and Solaris 10+. It is expected that DTrace will be available in the future on FreeBSD and Mac OS X. Supporting other dynamic tracing utilities is theoretically possible by changing the definitions for the PG_TRACE macros in src/include/pg_trace.h.
By default, trace points are disabled, so you will need to explicitly tell the configure script to make the probes available in PostgreSQL. To include DTrace support specify --enable-dtrace to configure. See Section 14.5 for further information.
A few standard trace points are provided in the source code (of course, more can be added as needed for a particular problem). These are shown in Table 25-3.
Table 25-3. Built-in Trace Points
Name | Parameters | Overview |
---|---|---|
transaction__start | (int transactionId) | The start of a new transaction. |
transaction__commit | (int transactionId) | The successful completion of a transaction. |
transaction__abort | (int transactionId) | The unsuccessful completion of a transaction. |
lwlock__acquire | (int lockid, int mode) | An LWLock has been acquired. |
lwlock__release | (int lockid, int mode) | An LWLock has been released. |
lwlock__startwait | (int lockid, int mode) | An LWLock was not immediately available and a backend has begun to wait for the lock to become available. |
lwlock__endwait | (int lockid, int mode) | A backend has been released from its wait for an LWLock. |
lwlock__condacquire | (int lockid, int mode) | An LWLock was successfully acquired when the caller specified no waiting. |
lwlock__condacquire__fail | (int lockid, int mode) | An LWLock was not successfully acquired when the caller specified no waiting. |
lock__startwait | (int locktag_field2, int lockmode) | A request for a heavyweight lock (lmgr lock) has begun to wait because the lock is not available. |
lock__endwait | (int locktag_field2, int lockmode) | A request for a heavyweight lock (lmgr lock) has finished waiting (i.e., has acquired the lock). |
The example below shows a DTrace script for analyzing transaction counts on the system, as an alternative to snapshotting pg_stat_database before and after a performance test.
#!/usr/sbin/dtrace -qs postgresql$1:::transaction-start { @start["Start"] = count(); self->ts = timestamp; } postgresql$1:::transaction-abort { @abort["Abort"] = count(); } postgresql$1:::transaction-commit /self->ts/ { @commit["Commit"] = count(); @time["Total time (ns)"] = sum(timestamp - self->ts); self->ts=0; }
Note how the double underline in trace point names needs to be replaced by a hyphen when using D script. When executed, the example D script gives output such as:
# ./txn_count.d `pgrep -n postgres` ^C Start 71 Commit 70 Total time (ns) 2312105013
You should remember that trace programs need to be carefully written and debugged prior to their use, otherwise the trace information collected may be meaningless. In most cases where problems are found it is the instrumentation that is at fault, not the underlying system. When discussing information found using dynamic tracing, be sure to enclose the script used to allow that too to be checked and discussed.
New trace points can be defined within the code wherever the developer desires, though this will require a recompilation.
A trace point can be inserted by using one of the trace macros. These are chosen according to how many variables will be made available for inspection at that trace point. Tracing the occurrence of an event can be achieved with a single line, using just the trace point name, e.g.
PG_TRACE (my__new__trace__point);
More complex trace points can be provided with one or more variables for inspection by the dynamic tracing utility by using the PG_TRACEn macro that corresponds to the number of parameters after the trace point name:
PG_TRACE3 (my__complex__event, varX, varY, varZ);
The definition of the transaction__start trace point is shown below:
static void StartTransaction(void) { ... /* * generate a new transaction id */ s->transactionId = GetNewTransactionId(false); XactLockTableInsert(s->transactionId); PG_TRACE1(transaction__start, s->transactionId); ... }
Note how the transaction ID is made available to the dynamic tracing utility.
The dynamic tracing utility may require you to further define these trace points. For example, DTrace requires you to add new probes to the file src/backend/utils/probes.d as shown here:
provider postgresql { ... probe transaction__start(int); ... };
You should take care that the data types specified for the probe arguments match the datatypes of the variables used in the PG_TRACE macro. This is not checked at compile time. You can check that your newly added trace point is available by recompiling, then running the new binary, and as root, executing a DTrace command such as:
dtrace -l -n transaction-start