Case-insensitive pattern matching in PostgreSQL

Performance comparison of case-insensitive search techniques
	`WHERE ... = 'abcd'`	`WHERE ... LIKE 'abcd%'`	`WHERE ... COLLATE "C" ILIKE 'abcd%'`	`WHERE ... COLLATE "C" ~* 'abcd%'` (`~` for `lower()`)
`citext`	540 ms	536 ms	1675 ms	2500 ms
`lower()`	9000 ms	9000 ms	3000 ms	3800 ms
`english_ci`	830 ms	ERROR	2000 ms	1940 ms

9 responses to “Case-insensitive pattern matching in PostgreSQL”

Anders Høgsbro Madsen says:

August 10, 2022 at 4:53 am

Thanks for good explanations. In my opinion case-insensitive ICU collations sounds good. It is OK with extra work when defining tables and columns. But it is not acceptable when the like operator doesn't work. In 2022 common queries should work without work arounds. Please just use the same solution as in MS SQL or MySQL to be competitive.

Reply
- laurenz says:
  
  November 4, 2022 at 7:56 am
  
  A late answer: I hear you, but PostgreSQL has always put correctness above all, even above performance and usability. We don't care if other databases have lower standards concerning correctness. PostgreSQL doesn't want to compete with "fast and loose" solutions. Some people love PostgreSQL for that.
  That said, I think everybody would be happy if you could point the way to a correct solution. How do MySQL and Microsoft SQL Server do it?
  
  Reply
twiggy79 says:

November 3, 2022 at 8:30 pm

I think ultimately the end user of postgres should be able to decide if they want to accept these limitations and say I think LIKE should work in the mysqlmssql way. 99.99% of the time that is acceptable and less clunky the citext, lower() indexes, etc.

In the case of FUSSBALL I'd create a searchable column and a "real" column. I've seen this a lot in french systems where they store the name in plain ASCII and use that column for sorting and searching. Since when folks show up and are asked their name the employee really doesn't want to be bother to ask about accents. In that case they store it upper case and then LIKE is pretty straightforward already.

However, I don't really want/need this since I'm simply storing ascii. I don't even allow unicode and have no interest in doing so. the rest of the world can hate me, but it's just not worth investment given the systems I integrate with wouldn't except it anyways and I end up throwing it away or losing data when I'm given Unicode.

I would say the in 2022 the biggest thing SQL engines could do is just interoperate with each other even if it means holding your nose and doing it. Would it be so bad for Postgres to support select top 1 * from table; Given the parser can't really be extended by extensions it makes a lot of app conversions harder then they need to be.... in the long run database freedom will lead to more folks on postgres, leading to more contribution, and a better world. I'd also argue having to add collate to every column is more than inconvenient and really just a missing feature of specifying database or schema level defaults.

Reply
- laurenz says:
  
  November 4, 2022 at 8:03 am
  
  You are ranting, and you might have a different opinion if you ever learned a foreign language.
  Concerning interoperability: there is an SQL standard, which is all about interoperability. The solution is not for PostgreSQL to implement random syntax introduced by database systems that don't care about the standard, it is the other way around.
  
  Reply
Mahfoud Bouabdallah says:

January 28, 2023 at 3:18 pm

Thanks for the great article.
Can I apply case insensitive in my current database (postgresql 15) I mean at database level

Reply
- laurenz says:
  
  January 30, 2023 at 7:50 am
  
  Would you care to elaborate what you mean?
  
  Reply
Jonathan Brune says:

April 12, 2023 at 9:53 pm

Case-insensitive search is a much-requested feature, partly (I suspect) to maintain compatibility with Microsoft SQL Server

It could be because I am coming from an MS-SQL background, but I don't understand why anyone would want, for example, Last_Name = 'DeRoSalia' to not return 'deroSalia', and 'DEROSALIA'. I am at a loss for when I would not want 'a' to equal 'A', other than a few edge cases. Better to program for those rather than the 99.9% where case doesn't matter. (I'm putting the German ß in the .01%. I would think that generally persons would want it to match on either ss or SS.)

Thank you for the article, very informative and a good summary of the options and pros and cons.

Edit: clarity and add thanks.

Reply
connector7177 says:

April 4, 2025 at 8:29 am

great to see the table comparing "query performance".

for completeness, I would like to see a similar table comparing "insert performance", how expensive to maintain the indexes.

Reply
- Laurenz Albe says:
  
  April 4, 2025 at 3:47 pm
  
  True! Join the club and write a blog!
  
  Reply

Case-insensitive pattern matching in PostgreSQL

Alternatives for case-insensitive search

Explicit conversion with `lower()` or `upper()`

Using the `citext` extension

Using case-insensitive ICU collations

The trouble with pattern matching and case-insensitive collations

The difficult case of German soccer

Case insensitive pattern matching in PostgreSQL v18

A semiotic aside

A solution for case-insensitive pattern matching

A performance test for case-insensitive comparisons

Conclusion

9 responses to “Case-insensitive pattern matching in PostgreSQL”

Leave a Reply Cancel reply

Laurenz Albe

Blog Tags

NEWSLETTER

Articles by our PostgreSQL Experts

Case-insensitive pattern matching in PostgreSQL

Alternatives for case-insensitive search

Explicit conversion with lower() or upper()

Using the citext extension

Using case-insensitive ICU collations

The trouble with pattern matching and case-insensitive collations

The difficult case of German soccer

Case insensitive pattern matching in PostgreSQL v18

A semiotic aside

A solution for case-insensitive pattern matching

A performance test for case-insensitive comparisons

Conclusion

9 responses to “Case-insensitive pattern matching in PostgreSQL”

Leave a Reply Cancel reply

Laurenz Albe

Blog Tags

NEWSLETTER

Articles by our PostgreSQL Experts

Explicit conversion with `lower()` or `upper()`

Using the `citext` extension