Join strategies and performance in PostgreSQL

	Nested Loop Join	Hash Join	Merge Join
Algorithm	For each outer relation row, scan the inner relation	Build a hash from the inner relation, scan the outer relation, probe the hash	Sort both relations and merge rows
Indexes that help	Index on the join keys of the inner relation	None	Indexes on the join keys of both relations
Good strategy if	the outer table is small	the hash table fits into `work_mem`	both tables are large

7 responses to “Join strategies and performance in PostgreSQL”

rbohush says:

July 28, 2020 at 8:32 pm

Perfect article. Thanks a lot!

Reply
Rama says:

August 17, 2020 at 6:18 am

Nice Article. Fan of Cybertec.

Reply
Guillermo E. Villanueva says:

September 17, 2020 at 1:50 pm

muy bueno!

Reply
yhuelf says:

February 23, 2023 at 3:36 pm

Hi Laurenz,

Are you sure that the outer relation is always scanned sequentially with nested loop join?

Are you sure that both relations are always scanned sequentially with hash join?

I think that an index can be used for both relations and both strategies.

Here is an example :

explain (costs off) select a,foo.b from foo join bar using(a) where foo.b = 'c81e728d9d4c2f636f067f89cc14862c'; QUERY PLAN -------------------------------------------------------------------------- Hash Join Hash Cond: (foo.a = bar.a) -> Bitmap Heap Scan on foo Recheck Cond: (b = 'c81e728d9d4c2f636f067f89cc14862c'::text) -> Bitmap Index Scan on foo_b_idx Index Cond: (b = 'c81e728d9d4c2f636f067f89cc14862c'::text) -> Hash -> Index Only Scan using bar_a_idx on bar

But I might have misunderstood what you wrote. If not, I can paste the full test case somewhere.

Best regards,
Frédéric

Reply
- laurenz says:
  
  February 24, 2023 at 7:55 am
  
  With "outer relation" In mean "the thing that is on the outer side of the join". That relation does not necessarily have to be a "base relation" (a table); in your case it is a Bitmap Heap Scan. That "relation" has to be read in its entirety from front to back.
  
  I am aware that this use of the word "relation" is somewhat specific to PostgreSQL optimizer jargon and can lead to misunderstanding. I have reworked the definition of "relation" at the beginning of the article to clarify that "read the relation sequentially" does not have to mean a sequential scan on a table.
  
  Reply
  - yhuelf says:
    
    February 24, 2023 at 1:47 pm
    
    OK I got it, thank you very much!
    
    Reply
Minecraft App says:

April 13, 2025 at 7:56 pm

This blog post provides valuable insights into join strategies and their impact on performance in PostgreSQL. I found the detailed explanations and practical examples incredibly helpful for optimizing my queries. The comparisons between different join types were particularly eye-opening! Thank you for sharing such useful information!

Reply

Join strategies and performance in PostgreSQL

Terminology of Join Strategies

Relation

Inner and outer relation

Join condition and join key

Nested loop join strategy

Indexes that can help with nested loop joins

Use cases for the nested loop join strategy

Hash join strategy

Indexes that can help with hash joins

Use cases for the hash join strategy

Merge join strategy

Indexes that help with a merge join

Use cases for the merge join strategy

Summary table for PostgreSQL join strategies

Impact on query performance

How to make PostgreSQL choose the correct join strategy

Conclusion - join strategies and query tuning

7 responses to “Join strategies and performance in PostgreSQL”

Leave a Reply Cancel reply

Laurenz Albe

Blog Tags

NEWSLETTER

Articles by our PostgreSQL Experts