pg_squeeze: Optimizing PostgreSQL storage

10.2020 | Category: How To | Tags: administration, pg_squeeze

10.2020

Category: How To

Is your database growing at a rapid rate? Does your database system slow down all the time? And maybe you have trouble understanding why this happens? Maybe it is time to take a look at pg_squeeze and fix your database once and for all. pg_squeeze has been designed to shrink your database tables without downtime. No more need for VACUUM FULL - pg_squeeze has it all.

Table of Contents

The first question any PostgreSQL person will ask is: Why not use VACUUM or VACUUM FULL? There are various reasons: A normal VACUUM does not really shrink the table in disk. Normal VACUUM will look for free space, but it won’t return this space to the operating system. VACUUM FULL does return space to the operating system but it needs a table lock. In case your table is small this usually does not matter. However, what if your table is many TBs in size? You cannot simply lock up a large table for hours just to shrink it after table bloat has ruined performance. pg_squeeze can shrink large tables using only a small, short lock.

However, there is more. The following listing contains some of the operations pg_squeeze can do with minimal locking:

Shrink tables
Move tables and indexes from one tablespace to another
Index organize (“cluster”) a table
Change the on-disk FILLFACTOR

After this basic introduction it is time to take a look and see how pg_squeeze can be installed and configured.

PostgreSQL: Installing pg_squeeze

pg_squeeze can be downloaded for free from our GitHub repository. However, binary packages are available for most Linux distributions. If you happen to run Solar, AIX, FreeBSD or some other less widespread operating system just get in touch with us. We are eager to help.

After you have compiled pg_squeeze or installed the binaries some changes have to be made to postgresql.conf:

wal_level = logical
max_replication_slots = 10 # minimum 1
shared_preload_libraries = 'pg_squeeze'

wal_level = logical

max_replication_slots = 10 # minimum 1

shared_preload_libraries = 'pg_squeeze'

The most important thing is to set the wal_level to logical. Internally, pg_squeeze works as follows: It creates a new datafile (snapshot) and then applies changes made to the table while this snapshot is copied over. This is done using logical decoding. Of course logical decoding needs replication slots. Finally the library has to be loaded when PostgreSQL is started. This is basically it - pg_squeeze is ready for action.

Understanding table bloat in PostgreSQL

Before we dive deeper into pg_squeeze it is important to understand table bloat in general. Let us take a look at the following example:

test=# CREATE TABLE t_test (id int);
CREATE TABLE
test=# INSERT INTO t_test SELECT *
FROM generate_series(1, 2000000);
INSERT 0 2000000
test=# SELECT pg_size_pretty(pg_relation_size('t_test'));
pg_size_pretty
----------------
69 MB
(1 row)

test=# CREATE TABLE t_test (id int);

CREATE TABLE

test=# INSERT INTO t_test SELECT *

FROM generate_series(1, 2000000);

INSERT 0 2000000

test=# SELECT pg_size_pretty(pg_relation_size('t_test'));

pg_size_pretty

----------------

69 MB

(1 row)

Once we have imported 2 million rows the size of the table is 69 MB. What happens if we update these rows and simply add one?

test=# UPDATE t_test SET id = id + 1;
UPDATE 2000000
test=# SELECT pg_size_pretty(pg_relation_size('t_test'));
pg_size_pretty
----------------
138 MB
(1 row)

test=# UPDATE t_test SET id = id + 1;

UPDATE 2000000

test=# SELECT pg_size_pretty(pg_relation_size('t_test'));

pg_size_pretty

----------------

138 MB

(1 row)

The size of the table is going to double. Remember, UPDATE has to duplicate the row which of course eats up some space. The most important observation, however, is: If you run VACUUM the size of the table on disk is still 138 MB - storage IS NOT returned to the operating system. VACUUM can shrink tables in some rare instances. However, in reality the table is basically never going to return space to the filesystem which is a major issue. Table bloat is one of the most frequent reasons for bad performance, so it is important to either prevent it or make sure the table is allowed to shrink again.

PostgreSQL: Shrinking tables again

If you want to use pg_squeeze you have to make sure that a table has a primary key. It is NOT enough to have unique indexes - it really has to be a primary key. The reason is that we use replica identities internally, so we basically suffer from the same restrictions as other tools using logical decoding.
Let us add a primary key and squeeze the table:

test=# ALTER TABLE t_test ADD PRIMARY KEY (id);
ALTER TABLE
test=# SELECT squeeze.squeeze_table('public', 't_test', null, null, null);
squeeze_table
---------------
(1 row)

test=# ALTER TABLE t_test ADD PRIMARY KEY (id);

ALTER TABLE

test=# SELECT squeeze.squeeze_table('public', 't_test', null, null, null);

squeeze_table

---------------

(1 row)

Calling pg_squeeze manually is one way to handle a table. It is the preferred method if you want to shrink a table once. As you can see the table is smaller than before:

test=# SELECT pg_size_pretty(pg_relation_size('t_test'));
pg_size_pretty
----------------
69 MB
(1 row)

test=# SELECT pg_size_pretty(pg_relation_size('t_test'));

pg_size_pretty

----------------

69 MB

(1 row)

The beauty is that minimal locking was needed to do that.

Scheduling table reorganization

pg_squeeze has a builtin job scheduler which can operate in many ways. It can tell the system to squeeze a table within a certain timeframe or trigger a process in case some thresholds have been reached. Internally pg_squeeze uses configuration tables to control its behavior. Here is how it works:

test=# d squeeze.tables
Table 'squeeze.tables'
        Column    | Type             | Collation | Nullable | Default
------------------+------------------+-----------+----------+--------------------------------------------
 id               | integer          |           | not null | nextval('squeeze.tables_id_seq'::regclass)
 tabschema        | name             |           | not null |
 tabname          | name             |           | not null |
 clustering_index | name             |           |          |
 rel_tablespace   | name             |           |          |
 ind_tablespaces  | name[]           |           |          |
 free_space_extra | integer          |           | not null | 50
 min_size         | real             |           | not null | 8
 vacuum_max_age   | interval         |           | not null | '01:00:00'::interval
 max_retry        | integer          |           | not null | 0
 skip_analyze     | boolean          |           | not null | false
 schedule         | squeeze.schedule |           | not null |
Indexes:
'tables_pkey' PRIMARY KEY, btree (id)
'tables_tabschema_tabname_key' UNIQUE CONSTRAINT, btree (tabschema, tabname)
Check constraints:
'tables_free_space_extra_check' CHECK (free_space_extra >= 0 AND free_space_extra < 100) 
'tables_min_size_check' CHECK (min_size > 0.0::double precision)
Referenced by:
TABLE 'squeeze.tables_internal' CONSTRAINT 'tables_internal_table_id_fkey' FOREIGN KEY 
(table_id) REFERENCES squeeze.tables(id) ON DELETE CASCADE
TABLE 'squeeze.tasks' CONSTRAINT 'tasks_table_id_fkey' FOREIGN KEY 
(table_id) REFERENCES squeeze.tables(id) ON DELETE CASCADE
Triggers:
tables_internal_trig AFTER INSERT ON squeeze.tables FOR EACH ROW EXECUTE FUNCTION 
squeeze.tables_internal_trig_func()

test=# d squeeze.tables

Table 'squeeze.tables'

Column | Type | Collation | Nullable | Default

------------------+------------------+-----------+----------+--------------------------------------------

id | integer | | not null | nextval('squeeze.tables_id_seq'::regclass)

tabschema | name | | not null |

tabname | name | | not null |

clustering_index | name | | |

rel_tablespace | name | | |

ind_tablespaces | name[] | | |

free_space_extra | integer | | not null | 50

min_size | real | | not null | 8

vacuum_max_age | interval | | not null | '01:00:00'::interval

max_retry | integer | | not null | 0

skip_analyze | boolean | | not null | false

schedule | squeeze.schedule | | not null |

Indexes:

'tables_pkey' PRIMARY KEY, btree (id)

'tables_tabschema_tabname_key' UNIQUE CONSTRAINT, btree (tabschema, tabname)

Check constraints:

'tables_free_space_extra_check' CHECK (free_space_extra >= 0 AND free_space_extra < 100)

'tables_min_size_check' CHECK (min_size > 0.0::double precision)

Referenced by:

TABLE 'squeeze.tables_internal' CONSTRAINT 'tables_internal_table_id_fkey' FOREIGN KEY

(table_id) REFERENCES squeeze.tables(id) ON DELETE CASCADE

TABLE 'squeeze.tasks' CONSTRAINT 'tasks_table_id_fkey' FOREIGN KEY

(table_id) REFERENCES squeeze.tables(id) ON DELETE CASCADE

Triggers:

tables_internal_trig AFTER INSERT ON squeeze.tables FOR EACH ROW EXECUTE FUNCTION

squeeze.tables_internal_trig_func()

The last column here is worth mentioning: It is a custom data type capable of holding cron-style scheduling information. The custom data type looks as follows:

test=# d squeeze.schedule
Composite type 'squeeze.schedule'
Column         | Type             | Collation | Nullable | Default
---------------+------------------+-----------+----------+---------
minutes        | squeeze.minute[] |           |          |
hours          | squeeze.hour[]   |           |          |
days_of_month  | squeeze.dom[]    |           |          |
months         | squeeze.month[]  |           |          |
days_of_week   | squeeze.dow[]    |           |          |

test=# d squeeze.schedule

Composite type 'squeeze.schedule'

Column | Type | Collation | Nullable | Default

---------------+------------------+-----------+----------+---------

minutes | squeeze.minute[] | | |

hours | squeeze.hour[] | | |

days_of_month | squeeze.dom[] | | |

months | squeeze.month[] | | |

days_of_week | squeeze.dow[] | | |

If you want to make sure that pg_squeeze takes care of a table simple insert the configuration into the table:

test=# INSERT INTO squeeze.tables (tabschema, tabname, schedule) VALUES ('public', 't_test', 
('{30}', '{22}', NULL, NULL, '{3, 5}'));
INSERT 0 1

test=# INSERT INTO squeeze.tables (tabschema, tabname, schedule) VALUES ('public', 't_test',

('{30}', '{22}', NULL, NULL, '{3, 5}'));

INSERT 0 1

In this case public.t_test will be squeezed at 22:30h in the evening every 3rd and 5th day of the week. The main question is: When is that? In our setup days 0 and 7 are sundays. So 3 and 5 means wednesday and friday at 22:30h.
Let us check what the configuration looks like:

test=# x
Expanded display is on.
test=# SELECT *, (schedule).* FROM squeeze.tables;
-[ RECORD 1 ]----+----------------------
id               | 1
tabschema        | public
tabname          | t_test
clustering_index |
rel_tablespace   |
ind_tablespaces  |
free_space_extra | 50
min_size         | 8
vacuum_max_age   | 01:00:00
max_retry        | 0
skip_analyze     | f
schedule         | ({30},{22},,,'{3,5}')
minutes          | {30}
hours            | {22}
days_of_month    |
months           |
days_of_week     | {3,5}

test=# x

Expanded display is on.

test=# SELECT *, (schedule).* FROM squeeze.tables;

-[ RECORD 1 ]----+----------------------

id | 1

tabschema | public

tabname | t_test

clustering_index |

rel_tablespace |

ind_tablespaces |

free_space_extra | 50

min_size | 8

vacuum_max_age | 01:00:00

max_retry | 0

skip_analyze | f

schedule | ({30},{22},,,'{3,5}')

minutes | {30}

hours | {22}

days_of_month |

months |

days_of_week | {3,5}

Once this configuration is in place, pg_squeeze will automatically take care of things. Everything is controlled by configuration tables so you can easily control and monitor the inner workings of pg_squeeze.

Handling errors

If pg_squeeze decides to take care of a table it can happen that the reorg process actually fails. Why is that the case? One might drop a table and recreate it, the structure might change or pg_squeeze might not be able to get the brief lock at the end. Of course it is also possible that the tablespace you want to move a table too does not have enough space. There are many issues which can lead to errors. Therefore one has to track those reorg processes.
The way to do that is to inspect squeeze.errors:

test=# SELECT * FROM squeeze.errors;
 id | occurred | tabschema | tabname | sql_state | err_msg | err_detail
----+----------+-----------+---------+-----------+---------+------------
(0 rows)

test=# SELECT * FROM squeeze.errors;

----+----------+-----------+---------+-----------+---------+------------

(0 rows)

This log table contains all the relevant information needed to track things fast and easily.

Finally …

pg_squeeze is not the only Open Source tool we have published for PostgreSQL. If you are looking for a cutting edge scheduler we recommend taking a look at what pg_timetable has to offer.

One response to “pg_squeeze: Optimizing PostgreSQL storage”

Darshan Shah says:

October 20, 2020 at 9:25 am

Hello,
This is very nice tool with less locking we can reduce and free the space on operation system. I have two questions.

1) I'm using streaming replication with Postgresql 11. wal_level = hot_standby is set up. in this case Can I use pg_squeeze?

2) I have huge database with many tables. In this how we can run pg_squeeze on whole database instead of single table?

Thanks
Darshan Shah

Reply

pg_squeeze: Optimizing PostgreSQL storage

PostgreSQL: Installing pg_squeeze

Understanding table bloat in PostgreSQL

PostgreSQL: Shrinking tables again

Scheduling table reorganization

Handling errors

Finally …

One response to “pg_squeeze: Optimizing PostgreSQL storage”

Leave a Reply Cancel reply

Hans-Jürgen Schönig

Blog Tags

NEWSLETTER

Articles by our PostgreSQL Experts