DECLARE CURSOR in POSTGRESQL

7 responses to “DECLARE CURSOR in PostgreSQL or how to reduce memory consumption”

Luca Veronese says:

April 17, 2019 at 10:33 am

The problem in general is using a cursor across pooled connections. Nowadays web applications need to use a connection pool and the requirement would be to open a cursor on one connection and then fetch from it on another one and close it on another. What we need is some sort of handle to the cursor that is independent from the connection that created it. Lacking this we need to implement this functionality in client code which is in general suboptimal. The result is most programmers resort to keysets or LIMIT OFFSET clauses, repeating the whole query every time and losing consistency.

Reply
- laurenz says:
  
  April 17, 2019 at 2:34 pm
  
  This won't be efficient for all queries, but typically with keyset pagination and the proper index you don't have to repeat the whole query, ony a query that specifically gets the required rows and can be fast. True, you don't get a consistent snapshot of the database.
  
  Reply
  - Luca Veronese says:
    
    April 17, 2019 at 6:18 pm
    
    I agree but this is application code. Wouldn't it be better if ithis was a functionality exposed by the RDBMS engine? Considering the quality of the MVCC implementation in Postgres it could be possibile to hold on a query result for a significant amount of time without query materialization, which is another possibility you have if the RDBMS prefers this route. This would be very useful for interactively paged datasets and it may be possibile to build additional functionality on top of it like filtering and sorting.
    
    Reply
    - laurenz says:
      
      April 17, 2019 at 7:20 pm
      
      Of course you'd use keyset pagination in application code - where else?
      
      Given the PostgreSQL architecture, it is virtually impossible to implement a cursor that is shared by several database sessions. Each session has its own backend process, and query execution and cursors are in process private memory, not in shared memory.
      
      Reply
      - Luca Veronese says:
        
        April 17, 2019 at 7:29 pm
        
        I understand it is probably very difficult. But I was expressing a requirement from a user standpoint and not from an implementer one. Still I think it may not be impossibile. Impossible is an ultimate word and nothing in software is really ultimate.
      - laurenz says:
        
        April 18, 2019 at 8:16 am
        
        I understand your concern.
        A "cursor" like you imagine it is pretty much like a table. So why not just use a table? Fill it with the query result, add numbering and read your data from there.
        If your complaint with this solution is that several such things need to exist in parallel and it is annoying to have to assign a globally unique name to them, you'd have the same problem with a "cursor" that is visible in all database sessions.
      - Luca Veronese says:
        
        April 18, 2019 at 8:36 am
        
        Actually I've been doing this sometimes. The problem is actually you need so many tables as you said and they are visible as schema objects while the idea of a cursor was to be a hidden object that is only accessible through a "handle" and when the handle is closed it goes away, like a temporary file. Also using a table requires you to write the full query result to it while a cursor can be more efficient and access the data on demand as clearly described in the post.

DECLARE CURSOR in PostgreSQL or how to reduce memory consumption

The purpose of a cursor in PostgreSQL

Using DECLARE CURSOR and FETCH

PostgreSQL optimizer

Using cursors across transactions

7 responses to “DECLARE CURSOR in PostgreSQL or how to reduce memory consumption”

Leave a Reply Cancel reply

Hans-Jürgen Schönig

Blog Tags

NEWSLETTER

Articles by our PostgreSQL Experts