Deep dive into postgres stats: pg_stat_all_tables

Everything you always wanted to know about Postgres stats

Today’s post is about pg_stat_all_tables. This view contains various statistics about tables usage and may be useful in different scenarios. I’m going to discuss the following ones:

Sequential scans.
Table’s write activity.
Autovacuum queue.

Sequential scans. One of the most useful types of information that you can get from checking pg_stat_all_tables is the number of scans. It shows you how many times tables were accessed directly or through indexes and how many rows were returned by these scans – this information is located in seq_scan/seq_tup_read and idx_scan/idx_tup_fetch columns.

We need the first two columns that show number of times the tables were accessed through sequential scans and number of tuples returned as a result.

Why use sequential scan? It’s not a problem when seqscan handles small tables, but in case of larger tables sequential scan might read the whole table and this might take a while. It also becomes an issue when postgres handles many sequential scans concurrently and performance of the storage drops significantly. As a rule, it’s an absence of index for new kind of queries, inaccurate statistics used by query planner or simply forgotten LIMIT clause in the query. Anyway, pg_stat_all_tables allow to quickly check, whether there are any sequential scans in the systems. Using the mentioned columns we can write quite simple query and get the right information:

# SELECT

schemaname, relname,

seq_scan, seq_tup_read,

seq_tup_read / seq_scan as avg_seq_tup_read

FROM pg_stat_all_tables

WHERE seq_scan > 0

ORDER BY 5 DESC LIMIT 5;

 schemaname |         relname        | seq_scan | seq_tup_read | avg_seq_tup_read 
------------+------------------------+----------+--------------+------------------
 public     | deals                  |      621 |  81712449358 |        131582044
 public     | client_balance         |       26 |    574164012 |         22083231
 public     | events_by_date_summary |     2698 |  57342287963 |         21253627
 public     | clients_summary        |     5924 |  91655173288 |         15471838
 public     | advertising_stats      |      505 |   5055606286 |         10011101

In this query we use manually calculated avg_seq_tup_read, It’s an average number of rows returned by one scan. You should pay attention to tables and queries when the average number of rows go beyond million rows per scan. You also can add pg_relation_size() function to get an idea of tables’ sizes and a rough estimate of the amount of data read during each scan.

So, when sequentially accessed tables are found, you should remember which queries use these tables, review them and try to fix causes of scans.

Table’s write activity. Table’s data input/output in Postgres is not a simple process, especially writing operations. The INSERT and DELETE commands are simpler than UPDATE, because that one doesn’t modify target rows – UPDATE command inserts new versions of the rows and marks old versions as removed. Also, if there are any indexes referenced in the updated rows, similar changes within indexes will be made. Thus, update operations aren’t lightweight as they might seem. In other words, Postgres doesn’t like heavy update-intensive workload. To alleviate overhead caused by writing operations various improvements have been made and the most known one is the HOT (Heap-only tuples) updates introduced in 8.3.

In short, it allows leave index’s entries untouched when non-indexed values are updated within rows. This, however, only works when free space (or space which marked for reuse) is available within page where the target rows are – HOT updates don’t work when page is completely filled by the rows.

What about pg_stat_all_tables? Using this view we can estimate HOT updates ratio for the most updated tables. Each table within this view has n_tup_upd and n_tup_hot_upd columns that correspond to total number of regular and HOT updates for particular tables. Thus, the task comes down to find the tables with highest writing activity and calculate their HOT rate. Example of the query can be found here.

What’s next? Tables with high HOT rate are the “good” tables, we should pay attention to tables with high number of writes and low or zero HOT rate. General rule for these is change in fillfactor settings – that allows to reserve free space when new rows inserted and table is extended. Reserved space guarantees that rows will be updated within page and there is a high chance that HOT update will occur. The fillfactor setting can be changed on the fly with ALTER TABLE command and good starting value for it is 70 or 80. Also, you need to know a few rules to properly work with fillfactor.

The first, after fillfactor changes your table will take up more space on the disk.
The second is that fillfactor is applied only to newly allocated pages and if you want new fillfactor for all table’ pages you will need to rebuild the table and it might be quite painful (hello VACUUM FULL).
The third and the last, is that fillfactor is useful only in cases of queries that update non-indexed values, otherwise there won’t be any positive effect.

Autovacuum queue. Autovacuum is the really important feature in Postgres that allows you to keep tables and indexes intact – it cleans dead rows’ versions, thus in case of ineffective autovacuum, tables and indexes will bloat so it would permanently affect the performance. Number of concurrently working autovacuums is limited by autovacuum_max_workers parameter and by default it’s 3. When database has a lot of tables with high number of writing operations, autovacuum_max_workers might create a bottleneck and tables that require vacuuming might wait a long time before they will be cleaned. Since Postgres 9.6 autovacuum can be observed with new pg_stat_progress_vacuum view, but there is still no information about how many tables require vacuuming. Using information from other views it’s possible to estimate size of so-called autovacuum queue.

I’d like to introduce another useful query for listing tables that require autovacuum. This query is slightly long, so here is the link. Without getting into nitty-gritty of this query workings , I’d like to mention a few key points to keep in mind:

the query shows list of tables that require normal or wraparound vacuum, or analyze.
query takes into account information about table’s storage parameters.
important to remember: the query also shows tables that are currently processed by autovacuum.

Using this query, you can estimate queue size and tune autovacuum accordingly. It’s ok, when queue is empty, it means that autovacuum can cope with the amount of work. When the queue isn’t empty, it might be a good idea to configure the autovacuum in a more aggressive way. There are several possible ways to do that:

increase number of autovacuum_max_workers – this allows to run more autovacuum workers concurrently.
increase autovacuum_vacuum_cost_limit or vacuum_cost_limit – this allows to process more pages per round and vacuum faster.
decrease autovacuum_vacuum_cost_delay – this is a pause between rounds when autovacuum sleeps, reducing delay allows the vacuum to rest less and work more.
decrease autovacuum_vacuum_scale_factor and autovacuum_analyze_scale_factor – this is the ratio of dead rows or changes since the last analyze, and is used to calculate threshold that vacuum or analyze of table should trigger. After reducing these scale factors, vacuum operations will occur more frequently on tables that require vacuuming.

All these methods can be applied in various combinations and the most important point here is to remember to check the storage utilization, because too aggressive vacuum might generate higher number of IO operations which in turn might affect responsiveness and overall performance efficiency.

It would be a good idea to add information on running autovacuum into the monitoring and to always have a clear picture about it.

Finally, I’d like to point out that the pg_stat_all_tables is quite useful view and this post by all means doesn’t cover all possible cases for its use. Moreover, there is a similar pg_statio_all_tables view, that contains information about table’s buffer IO and you can join this table with pg_stat_all_tables to make your stats queries even more informative.

Hope you enjoyed this post, if you have any questions please comment!

Comments: 8

8 responses to “Deep dive into postgres stats: pg_stat_all_tables”

Milind says:

April 12, 2021 at 10:59 am

Nice article thank you. Do you know to obtain the transaction per seconds in postgresql.

Reply
- Alexey Lesovsky says:
  
  April 12, 2021 at 12:25 pm
  
  Hi, Milind! You could use the value of pg_stat_database.xact_commit which represents a total number of committed transactions per single database.
  
  Reply
Anonimous says:

May 23, 2022 at 9:49 am

How do i convert this ORA Code into Postgresql.
ALTER INDEX alpha.col1 REBUILD COMPUTE STATISTICS;
ALTER INDEX alpha.col2 REBUILD COMPUTE STATISTICS;
of one table Tab1
ALTER INDEX alpha.col3 REBUILD COMPUTE STATISTICS;
ALTER INDEX alpha.col4 REBUILD COMPUTE STATISTICS;
of another table Tab2

Reply
- Alexander Nikitin says:
  
  May 23, 2022 at 9:53 am
  
  Hi Anonimous,
  COMPUTE STATISTICS became depricated back in Oracle 10.2
  ALTER INDEX is required for index rebuild and to rebuild indexing Postgres you need to perform
  reindex index concurrently index_name;
  Following index rebuild you will need to check the index validity:
  Inquire table definition based on which it has been built
  \d table_name
  If next to this index won’t appear INVALID you need to perform:
  analyze table_name;
  to make sure that the operation has been completed successfully.
  If you will have INVALID, it’ll mean that you need to remove and recreate the index.
  
  Reply
Anonimous says:

May 23, 2022 at 9:56 am

ERROR: syntax error at or near “concurrently”
LINE 1: reindex concurrently alpha.alpha5;
^
SQL state: 42601
Character: 9

Reply
- Alexander Nikitin says:
  
  May 23, 2022 at 9:57 am
  
  It might have to do with older PostgreSQL version you have (before 12) that wouldn’t have reindex concurrently. If that’s the case you can only use pg_repack for concurrent reindexing.
  
  Reply
Anonymous says:

December 13, 2022 at 4:59 pm

Hi,
Requesting you to please check. Oracle code taking less than 4 seconds to execute whereas PostgreSQL taking more than an hr (still running).

Merge into is not available in old version of postgresql/pgadmin. Amazon RDS won’t support in PRD as of now.So,i;ve tried to re-write but that’s taking too much time.

—— ORACLE CODE ——-

Merge INTO abc.cust_tot CT
Using (SELECT /*+ PARALLEL(ct,4) PARALLEL(cd,4) USE_HASH(ct) */
ct.cust_num, cust_yp, cust_yp_dt
from cust_tot ct
inner join cust_dt cd
on (ct.cust_num = cd.cust_num)
where ct.cust_num not in
(select cust_no from Cust_h_bal)) S
ON (CT.cust_num = S.cust_num)
WHEN MAtched then
Update
set cust_bal = case
when Trunc(NVL(S.cust_yp_dt,
To_date(’01/01/1975′, ‘MM/DD/YYYY’))) >
ADD_MONTHS(Trunc(sysdate), -12) then
case
when S.cust_yp >= 0 then
case
when NVL(S.cust_yp, 1) NVL(cust_bal, 0) then
NVL(S.cust_yp, 0)
else
cust_bal
end
else
0
End
else
0
End,
timestamp = sysdate;

——————- PostgreSQL:

—-
UPDATE abc.cust_tot
SET cust_bal =
CASE
WHEN DATE_TRUNC(‘DAY’,(COALESCE(s.cust_yp_dt, TO_DATE(’01/01/1975′, ‘MM/DD/YYYY’)))) >
DATE_TRUNC(‘DAY’,CURRENT_DATE) – INTERVAL ’12 MONTHS’ THEN
CASE
WHEN s.cust_yp >= 0 THEN
CASE
WHEN COALESCE(s.cust_yp, 1) COALESCE(CT.cust_bal, 0)
THEN COALESCE(s.cust_yp, 0)
ELSE CT.cust_bal
END
ELSE 0
END
ELSE 0
END ,timestamp = CURRENT_DATE
FROM abc.cust_tot AS ct
inner JOIN (SELECT /*+ Parallel(ct 4 hard) Parallel(cd 4 hard) HashJoin(ct) */
ct.cust_no_717, cust_yp, cust_yp_dt
FROM
abc.cust_tot AS ct
inner JOIN abc.cust_dt AS cd
ON ct.cust_no_717 = cd.cust_no_717
WHERE NOT EXISTS
(SELECT 1
FROM abc.Cust_h_bal as chb where chb.cust_num = ct.cust_num)
) AS s
ON ct.cust_num = s.cust_num;

—–

Reply
- Valeria K says:
  
  December 20, 2022 at 11:27 am
  
  Hi Anonymous,
  Thank you for reaching out. Comment in our blog is not an ideal forum for these types of requests. We’d be happy to look into this for you as part of our Consulting package. If you are interested please let us know via [email protected]
  
  Reply

Package	Price*
PostgreSQL configuration analysis	from 500 €
Hardware and OS configuration analysis	from 600 €
Slow SQL queries analysis	from 500 €
Comprehensive performance Health Check*	from 3500 €
* For single PostgreSQL installation (1 master + 1 replica)

Basic	Premium	VIP
Incl. up to 5h/month DBA time	Incl. up to 15h/month DBA time	Incl. up to 30h/month DBA time
Slack, Mattermost, Telegram	Slack, Mattermost, Telegram	Slack, Mattermost, Telegram
Response time — 1 Hour	Response time — 1 Hour	Response time — 30 minutes
Up to 2 PostgreSQL instances	Up to 5 PostgreSQL instances	Up to 10 PostgreSQL instances

News & Blog back