Deep dive into postgres stats: Introduction

Deep dive into postgres stats: Introduction

Everything you always wanted to know about PostgreSQL stats

This blogpost is all about postgres activity stats. Stats are the very important since they allow us to understand what’s going on with the system. With that in mind, stats also have some weak points and that’s what I would like to discuss here.

In these blog series I will try to explain how to use stats effectively and how to detect and solve problems with their help. I will start with an overview and will move on discussion on specific usages with receipts and built in tools.

I hope my posts will provide you with some hands-on tools that will help you using postgres stats and will reassure you that stats aren’t as scary as they seems at first glance.
What is postgres? For people who are not familiar with postgres, it’s a bunch of processes in the ‘ps auxf’ output. What they can see are operating system’s metrics, such as CPU usage, memory or swap consumed by these processes, but nothing more. This, however, is not sufficient to effectively troubleshoot postgres. To do that one needs to know how postgres stats are collected throughout postgres lifetime and to also be able to properly use them.

If we look inside postgres and try to understand what it consists of, we see that there are many subsystems that work together and problems in one part may cause problems in the other parts. Generally, postgres can be splitted with two abstract parts, first is the servicing clients and second is the background service operations, e.g. vacuums, checkpoints, write ahead logs. Thus slow down in background operations will negatively affect servicing clients tasks and vice versa. Postgres activity stats are used for observing, predicting or eliminating possible problems, and almost all postgres parts have specific stats which describe what’s going on there.

Even provided all these strengths, stats also have a few weak points. First, is the fact that there are so many of them and one should know which source to use in each particular case. Second, almost all stats are provided as permanently incremented counters and often you can see billion values that say nothing to you. Third, stats don’t provide history, so there is no built-in way to check what happened five, ten or thirty minutes ago. And last and the least is that postgres doesn’t provide handy tool for working with stats, and end-user has to use only sql clients, such as psql.

Despite all that, stats are the first thing which can help you to troubleshoot postgres. Let’s take a closer look. Stats provide different types of information, these are

Events that happened in postgres instance (counters), such as table or index operations, number of commits and rollbacks, such as block hits, reads etc.
Database’s objects properties (current values), e.g. transactions or queries start time and states and relations sizes.
Time spent for reading and writing operations (counters).

Tracking stats are enabled by default and there are no additional steps for configuring them (except for stats that are provided by contrib modules). Stats interface based on functions and views, these functions and views available by default in all existing and new created databases. Thus getting stats is possible using psql or any other client and SELECT queries to these views.

As mentioned above, there are many stats functions and views and in addition postgres has multiple subsystems. This information can be represented in the following diagram.

In my next post in this series I will be focusing on particular stats view and will explain what kind of problems it allows to solve and how to do it.

Comments: 3

3 responses to “Deep dive into postgres stats: Introduction”

Tom Kincaid says:

February 21, 2017 at 7:53 pm

Nice blog, thanks for publishing!

Reply
Unknown says:

February 27, 2017 at 5:40 am

Nice Blog 😊

Reply
GRANT says:

February 28, 2017 at 9:45 pm

Really good! thanks for the sharing!

Reply

Package	Price*
PostgreSQL configuration analysis	from 500 €
Hardware and OS configuration analysis	from 600 €
Slow SQL queries analysis	from 500 €
Comprehensive performance Health Check*	from 3500 €
* For single PostgreSQL installation (1 master + 1 replica)

Basic	Premium	VIP
Incl. up to 5h/month DBA time	Incl. up to 15h/month DBA time	Incl. up to 30h/month DBA time
Slack, Mattermost, Telegram	Slack, Mattermost, Telegram	Slack, Mattermost, Telegram
Response time — 1 Hour	Response time — 1 Hour	Response time — 30 minutes
Up to 2 PostgreSQL instances	Up to 5 PostgreSQL instances	Up to 10 PostgreSQL instances

News & Blog back

Deep dive into postgres stats: Introduction

Everything you always wanted to know about PostgreSQL stats

You may also like:

Back from PGDay UK 2024

PITR and Streaming Replication environments

Handling Cancellation Request

Back from PGConf Belgium 2024

News & Blog back

Deep dive into postgres stats: Introduction

Everything you always wanted to know about PostgreSQL stats

You may also like:

Back from PGDay UK 2024

PITR and Streaming Replication environments

Handling Cancellation Request

Back from PGConf Belgium 2024

Experience required:

Key responsibilities:

Why choose Data Egret as your place of work?

We offer:

– PHASE 1 –

Audit and Optimisation

During this phase our DBA will be answering any questions regarding:

Training

– PHASE 2 –

Ongoing support

Who will you be working with?

What skills will you gain?

What will your job be like?

What do we expect from you?

What skills would be a plus?

Why choose Data Egret as your place of work?

We offer:

Questions?