Connection pooling

Q: Should I use PgBouncer or an application-level pool?

Both, if you can. They serve different purposes and compose rather well together. Application-level pools (SQLAlchemy, HikariCP, node-postgres) manage connections within a single application process, reducing overhead and providing features like connection validation. PgBouncer sits in front of PostgreSQL and multiplexes connections across all application instances — critical when you have many application servers or containers that would otherwise each maintain their own pool.

Q: What is the difference between session, transaction, and statement pooling in PgBouncer?

In session mode, a client gets a dedicated server connection for the entire session — similar to connecting directly. In transaction mode, the server connection is assigned only for the duration of a transaction and returned to the pool between transactions. In statement mode, the connection is returned after every individual statement. Transaction mode gives the best connection reuse while still supporting multi-statement transactions. Statement mode offers the highest multiplexing ratio but breaks any feature that depends on session state, including prepared statements and SET commands.

Q: How many connections should I allow to PostgreSQL?

Fewer than you might expect, and the numbers are worth understanding. Each connection consumes memory (~5-10 MB), and the scheduler overhead of hundreds of active backends degrades performance. A common starting point is 2-4x your CPU core count for the pool size hitting PostgreSQL directly, with a connection pooler in front to absorb the larger number of application connections. A 4-core server often performs best with 10-20 database connections, even if it serves hundreds of application clients.

Q: What happens when the pool is exhausted?

New connection requests queue until a connection is returned to the pool or a timeout is reached. In application-level pools, this usually raises an exception after the configured timeout (e.g., SQLAlchemy's pool_timeout). In PgBouncer, clients wait in a FIFO queue. If your pool is consistently exhausted, the fix is usually not to increase pool size — it is to investigate why connections are held for so long. Increasing the pool is treating the symptom. Long-running queries, idle-in-transaction sessions, and missing connection release in application code are the typical culprits.

Q: Does connection pooling work with prepared statements?

It depends on the pool mode. Session-mode pooling preserves prepared statements because the client keeps the same server connection. Transaction-mode pooling in PgBouncer does not natively support prepared statements, because a different server connection may be assigned for the next transaction. PgBouncer 1.21+ added prepared statement support in transaction mode by tracking and re-preparing statements transparently. Application-level pools generally handle prepared statements correctly since each pool connection maintains its own server session.

Q: Can connection pooling cause "idle in transaction" problems?

Not directly, but it can mask them — and masking a problem is often worse than causing one. An "idle in transaction" connection holds a transaction open without doing work, which blocks autovacuum and holds row locks. With a pooler, these connections consume a server-side slot that could serve other clients. In transaction-mode PgBouncer, an idle-in-transaction connection is never returned to the pool until the transaction completes, which at least makes the problem visible: your pool will drain. The fix is always application-side: ensure every BEGIN has a matching COMMIT or ROLLBACK, and set idle_in_transaction_session_timeout as a safety net.

One does not hire a new servant for every errand. Reusing database connections instead of opening a new one for every request — the difference between 20 PostgreSQL backends and 2,000.

The Waiter of Gold Lapel · Updated Mar 30, 2026 Published Mar 21, 2026 · 8 min read

Connection pooling maintains a set of open database connections and lends them to application requests on demand. When a request finishes, the connection returns to the pool rather than being closed — a small act of reuse that compounds into extraordinary savings. This avoids the overhead of establishing a new PostgreSQL connection for every query, an operation that involves process forking, authentication, and memory allocation. Tools like PgBouncer provide external pooling, while libraries like SQLAlchemy and HikariCP pool connections within the application itself.

What connection pooling is

Every connection to PostgreSQL is a process. As the connection establishment documentation explains, when a client connects, the postmaster process forks a new backend process dedicated to that session. This backend allocates memory for work_mem, temp_buffers, catalog caches, and other per-session state. When the client disconnects, the process exits and all of that memory is freed.

For a web application handling hundreds or thousands of concurrent requests, creating and destroying a process per request adds up quickly. Most requests need a database connection for only a few milliseconds — the time it takes to run one or two queries. The connection setup and teardown often takes longer than the queries themselves.

A connection pool solves this by keeping a fixed number of connections open and reusing them. Application requests check out a connection, use it, and return it. The PostgreSQL backend never knows the difference — from its perspective, the connection has been open the whole time. Proper resource management at its finest.

Why it matters

PostgreSQL's process-per-connection model has a cost that scales poorly. The damage, if you'll permit a brief inventory.

Memory. Each backend process consumes roughly 5-10 MB of resident memory even when idle. At 100 connections, that is 0.5-1 GB dedicated to connection overhead. At 500 connections, you are spending 2.5-5 GB before a single query runs. This memory competes with shared_buffers and the OS page cache — the very resources that make your queries fast.

Connection latency. Establishing a new PostgreSQL connection involves a TCP handshake, SSL negotiation (if enabled), authentication, and process forking. On a local connection, this takes 2-5 ms. Over a network with SSL, it can reach 20-50 ms. For a web request that runs a 1 ms query, spending 20 ms on connection setup is a cost worth eliminating.

max_connections. PostgreSQL defaults max_connections to 100. Raising it sounds simple — and it is the first thing everyone tries — but each additional connection adds memory overhead and increases contention on internal locks like ProcArrayLock. PostgreSQL was not designed for thousands of direct connections, and performance degrades well before you hit the configured limit.

SQL

-- Check current connection limit
SHOW max_connections;

-- See how many connections are currently open
SELECT count(*) FROM pg_stat_activity;

-- Breakdown by state
SELECT state, count(*)
FROM pg_stat_activity
GROUP BY state
ORDER BY count DESC;

SQL

-- Each PostgreSQL connection is a forked process
-- Check memory per backend (approximate)
SELECT
  pid,
  usename,
  application_name,
  state,
  backend_start,
  query_start
FROM pg_stat_activity
WHERE backend_type = 'client backend'
ORDER BY backend_start;

-- On Linux, each idle backend uses ~5-10 MB of RSS
-- 200 idle connections = 1-2 GB of memory doing nothing

Types of pooling

Connection pooling can happen at different layers, and the right choice depends on your architecture. If I may, a brief tour of the options.

Application-level pooling

Most database drivers and ORMs include a built-in connection pool. SQLAlchemy (Python), HikariCP (Java), node-postgres (Node.js), and ActiveRecord (Ruby) all maintain a pool of connections per application process. This is the simplest approach — no additional infrastructure — and handles the common case where a single application server needs to reuse connections efficiently.

The limitation is scope. Each application process maintains its own pool. If you have 50 containers each holding a pool of 20 connections, that is 1,000 connections to PostgreSQL — even if most sit idle. Application-level pooling does not help with cross-process connection sharing.

Python / Java

# SQLAlchemy connection pool configuration
from sqlalchemy import create_engine

engine = create_engine(
    "postgresql://user:pass@localhost/mydb",
    pool_size=20,          # maintained connections
    max_overflow=10,       # temporary connections above pool_size
    pool_timeout=30,       # seconds to wait for a connection
    pool_recycle=1800,     # recycle connections after 30 minutes
    pool_pre_ping=True,    # verify connections before checkout
)

# HikariCP (Java) — similar concept
# maximumPoolSize=20
# minimumIdle=5
# connectionTimeout=30000
# idleTimeout=600000

External pooling (PgBouncer, Pgpool-II)

An external pooler sits between your application and PostgreSQL as a lightweight proxy. PgBouncer is the most widely used — it is a single-threaded, event-driven process that can handle thousands of client connections and multiplex them onto a much smaller number of PostgreSQL connections.

Pgpool-II offers connection pooling alongside load balancing, replication management, and query caching. It is more feature-rich but heavier. For pure connection pooling, PgBouncer is the standard choice due to its simplicity and low overhead.

Cloud-native pooling

Managed PostgreSQL services increasingly ship built-in poolers. Supabase offers Supavisor (a multi-tenant pooler written in Elixir). Neon provides a serverless connection proxy that pools automatically. AWS RDS Proxy handles pooling for RDS and Aurora. These remove the operational burden of running PgBouncer yourself, though they vary considerably in configurability and pooling mode support. As with all managed services, convenience and control are in tension.

PgBouncer modes

PgBouncer supports three pooling modes, each with a different trade-off between connection reuse and session feature compatibility.

pgbouncer.ini

# pgbouncer.ini — minimal configuration

[databases]
mydb = host=127.0.0.1 port=5432 dbname=mydb

[pgbouncer]
listen_addr = 0.0.0.0
listen_port = 6432
auth_type = md5
auth_file = /etc/pgbouncer/userlist.txt

# Pool mode: session, transaction, or statement
pool_mode = transaction

# Pool sizing
default_pool_size = 20        # connections per user/database pair
min_pool_size = 5             # keep this many open even when idle
max_client_conn = 1000        # max application connections accepted
max_db_connections = 50       # max connections to PostgreSQL

Session mode

A client gets a dedicated server connection for the entire duration of its session. The connection is returned to the pool only when the client disconnects. This is the safest mode — every PostgreSQL feature works exactly as it would with a direct connection — but it provides the least multiplexing benefit. If your clients hold sessions open for long periods, session mode offers little advantage over connecting directly.

Transaction mode

A server connection is assigned when a client begins a transaction and returned when the transaction completes. Between transactions, the client has no dedicated server connection. This is the most commonly used mode in production because it provides excellent connection reuse while still supporting multi-statement transactions.

The trade-off: any feature that depends on session-level state does not carry across transactions. This includes SET commands, advisory locks, LISTEN/NOTIFY, and — historically — prepared statements. PgBouncer 1.21+ added transparent prepared statement support in transaction mode, addressing what had been the most common pain point for years. A welcome addition.

Statement mode

The server connection is returned after every individual statement. This provides maximum connection reuse but breaks multi-statement transactions entirely — each statement may execute on a different server connection. Only suitable for simple, autocommit workloads with no transaction boundaries.

Pool sizing

Pool sizing is about finding the balance between two failure modes, and most teams err in the same direction. Too few connections and requests queue up waiting for a free slot, adding latency. Too many and PostgreSQL spends resources managing backends that are mostly idle, while contention on internal locks increases.

SQL

-- A starting formula for pool sizing:
-- connections = (core_count * 2) + effective_spindle_count
--
-- For a 4-core server with SSDs:
-- connections = (4 * 2) + 1 = 9 to 10
--
-- This is a floor, not a ceiling. Adjust based on:
-- - Workload mix (read-heavy vs write-heavy)
-- - Average query duration
-- - Acceptable queuing latency

-- Monitor pool utilization with pg_stat_activity
SELECT
  count(*) FILTER (WHERE state = 'active') AS active,
  count(*) FILTER (WHERE state = 'idle') AS idle,
  count(*) FILTER (WHERE state = 'idle in transaction') AS idle_in_tx,
  count(*) FILTER (WHERE wait_event_type = 'Client') AS waiting_for_client
FROM pg_stat_activity
WHERE backend_type = 'client backend';

A common starting formula comes from the PostgreSQL wiki: connections = (core_count * 2) + effective_spindle_count. For a 4-core server with SSDs, this suggests roughly 9-10 connections. This sounds low — I have seen the look on people's faces when they hear it — but PostgreSQL is most efficient when active connections roughly match CPU cores.

The key distinction is between connections to PostgreSQL and connections from your application. With PgBouncer in front, your application can open 1,000 connections to the pooler while PostgreSQL sees only 20. The pooler absorbs the connection fan-out. Size the PostgreSQL-facing pool for throughput; size the application-facing limit for concurrency.

Signs your pool is too small: rising queue depth, connection timeout errors, and latency spikes uncorrelated with query complexity. Signs your pool is too large: many idle connections in pg_stat_activity, memory pressure on the database server, and no improvement in throughput when adding connections. The right number is, like most things in PostgreSQL, discovered through measurement rather than intuition.

How Gold Lapel relates

Gold Lapel is a PostgreSQL proxy, and connection multiplexing is built into its architecture. Your application connections terminate at Gold Lapel, which maintains its own carefully sized pool of connections to PostgreSQL. Many application connections share a smaller number of database connections — the same multiplexing benefit as PgBouncer, without a separate process to operate and monitor.

I would not suggest this replaces an understanding of pool sizing and configuration. The principles on this page apply regardless of which component manages the pool. But if you are already running Gold Lapel for query optimization, connection pooling comes as part of the same deployment. One fewer piece of infrastructure to maintain is, in my experience, one fewer thing to troubleshoot at 3 AM.

Connection pooling

What connection pooling is

Why it matters

Types of pooling

Application-level pooling

External pooling (PgBouncer, Pgpool-II)

Cloud-native pooling

PgBouncer modes

Session mode

Transaction mode

Statement mode

Pool sizing

How Gold Lapel relates

Frequently asked questions

Should I use PgBouncer or an application-level pool?

What is the difference between session, transaction, and statement pooling in PgBouncer?

How many connections should I allow to PostgreSQL?

What happens when the pool is exhausted?

Does connection pooling work with prepared statements?

Can connection pooling cause "idle in transaction" problems?

Related content