Vacuum

Q: Does VACUUM lock the table?

Standard VACUUM does not. It runs concurrently with reads and writes — it only acquires a SHARE UPDATE EXCLUSIVE lock, which blocks other VACUUMs and certain DDL but not normal queries. VACUUM FULL is the exception: it acquires an ACCESS EXCLUSIVE lock that blocks everything, including SELECT. This is precisely why VACUUM FULL should be a last resort — it closes the room to all visitors while the renovation is underway.

Q: How often should autovacuum run?

There is no single answer, because it depends on your write rate. The default settings work well for moderate workloads. For write-heavy tables (thousands of updates or deletes per second), you will likely need to lower autovacuum_vacuum_scale_factor to 0.01 or even lower, and possibly increase autovacuum_max_workers. The goal is to keep dead tuple counts from growing faster than vacuum can clean them.

Q: What is the difference between VACUUM and ANALYZE?

They solve different problems. VACUUM reclaims space from dead tuples and updates the visibility map. ANALYZE updates the planner statistics that PostgreSQL uses to choose query plans. They are often run together (VACUUM ANALYZE), but they are independent operations. A table that is read-heavy but rarely modified needs ANALYZE after data changes but may rarely need VACUUM.

Q: Can I disable autovacuum on a specific table?

You can, with ALTER TABLE SET (autovacuum_enabled = false). But I must counsel strongly against it. A table with autovacuum disabled will accumulate dead tuples indefinitely, bloat without bound, and eventually risk transaction ID wraparound. The only defensible reason is if you are running manual VACUUM on a strict schedule and are absolutely certain that schedule will never be missed. Dismissing the cleaning staff and promising to do it yourself is a commitment that rarely ages well.

Q: What happens if transaction ID wraparound actually occurs?

PostgreSQL will not let it happen silently. As you approach the limit, PostgreSQL emits warnings in the log. At approximately 40 million transactions remaining, it refuses to start new transactions entirely — the database effectively shuts down for writes. The only recovery is to run VACUUM manually in single-user mode. This is a hard stop, not a gradual degradation. If the warnings in your logs go unread, you will discover the problem when your application stops writing. Monitor xid_age. Please.

Q: Why is my table still large after running VACUUM?

Standard VACUUM marks dead space as reusable but does not return it to the operating system. The table file stays the same size on disk — PostgreSQL will reuse the freed space for future inserts and updates. If you need to actually shrink the file, you need VACUUM FULL or a tool like pg_repack. But in most cases, reusing the space internally is sufficient and avoids the heavy lock.

PostgreSQL's housekeeping process — and the single most neglected duty in the entire manor.

The Waiter of Gold Lapel · Updated Mar 30, 2026 Published Mar 21, 2026 · 9 min read

PostgreSQL's MVCC model never updates a row in place. Every UPDATE creates a new version of the row and leaves the old one behind. Every DELETE marks a row as dead but does not remove it. Left unattended, these dead tuples accumulate — consuming space, slowing scans, and generally making the place unfit for guests. VACUUM is the process that cleans them up: reclaiming space for reuse, updating the visibility map, and freezing old transaction IDs to prevent wraparound. Autovacuum handles this automatically for most workloads, but write-heavy tables can outpace it if the defaults are not tuned. A household that does not clean is not a household. It is a storage facility.

What VACUUM does

Allow me to outline the duties. VACUUM performs three distinct jobs in a single pass:

Reclaims dead tuples — scans the table for row versions that are no longer visible to any active transaction and marks that space as available for reuse by future inserts and updates.
Updates the visibility map — tracks which pages contain only universally-visible rows. This enables index-only scans (the executor can skip heap fetches for pages the visibility map has marked all-visible) and lets future VACUUM runs skip pages that are already clean.
Freezes old transaction IDs — replaces aging transaction IDs on old rows with a special "frozen" marker. This is what prevents transaction ID wraparound, a failure mode unique to PostgreSQL's MVCC implementation.

SQL

-- Standard VACUUM on a single table
VACUUM my_table;

-- VACUUM with verbose output (shows what it did)
VACUUM VERBOSE my_table;

-- VACUUM and update planner statistics in one pass
VACUUM ANALYZE my_table;

Standard VACUUM does not return disk space to the operating system. It marks dead space as reusable within the table file. The file stays the same size on disk, but PostgreSQL will fill those gaps with new data rather than appending to the end. Think of it as clearing rooms for the next guest rather than demolishing the wing. The VACUUM command reference documents all available options including FREEZE, VERBOSE, and ANALYZE.

VACUUM vs VACUUM FULL

This distinction trips up nearly everyone at least once. They share a name, but that is where the resemblance ends.

VACUUM (standard) runs concurrently with normal queries. It acquires only a lightweight lock (SHARE UPDATE EXCLUSIVE) that does not block reads or writes. It marks dead space as reusable but does not physically compact the table. It is safe to run at any time, including production hours.

VACUUM FULL rewrites the entire table to a new file, eliminating all dead space and physically shrinking the table on disk. This requires an ACCESS EXCLUSIVE lock — no other session can read or write the table until the rewrite completes. On a large table, this can mean minutes or hours of downtime.

SQL

-- VACUUM FULL rewrites the entire table to reclaim disk space
-- WARNING: acquires an ACCESS EXCLUSIVE lock — no reads or writes
VACUUM FULL my_table;

-- For most cases, prefer pg_repack or pg_squeeze instead
-- They achieve the same result without the heavy lock

The rule of thumb: standard VACUUM is daily housekeeping that should happen continuously. VACUUM FULL is the equivalent of gutting a room and refurnishing it from scratch — an emergency measure for tables that have already bloated beyond what routine cleaning can manage. If you find yourself reaching for VACUUM FULL regularly, the better answer is to fix whatever is preventing standard VACUUM from keeping up. One does not renovate the parlour every week. One tidies it properly the first time.

Autovacuum

PostgreSQL ships with an autovacuum daemon — a member of staff who vacuums the floors on a schedule, unprompted, without being asked. For most tables, autovacuum keeps dead tuple counts low without any manual intervention. It is, if you'll permit me, one of PostgreSQL's better hires. The routine vacuuming documentation covers autovacuum behavior, tuning parameters, and the wraparound prevention mechanism in depth.

The trigger formula is straightforward: autovacuum fires when the number of dead tuples in a table exceeds autovacuum_vacuum_threshold + (autovacuum_vacuum_scale_factor * number of live tuples). With the defaults (threshold 50, scale factor 0.2), a table with 1 million rows gets vacuumed after accumulating roughly 200,050 dead tuples.

postgresql.conf

-- Key autovacuum parameters (shown with defaults)

-- Minimum dead tuples before a table is vacuumed
autovacuum_vacuum_threshold = 50

-- Fraction of table size added to threshold
autovacuum_vacuum_scale_factor = 0.2

-- How often autovacuum checks for work (seconds)
autovacuum_naptime = 60

-- Max concurrent autovacuum workers
autovacuum_max_workers = 3

-- Cost-based throttling (higher = more aggressive)
autovacuum_vacuum_cost_delay = 2   -- ms pause between batches
autovacuum_vacuum_cost_limit = 200 -- cost units per batch

-- The effective trigger formula:
-- vacuum when dead tuples > threshold + (scale_factor * table rows)
-- For a 1M row table with defaults: 50 + (0.2 * 1,000,000) = 200,050

Autovacuum is also throttled by cost-based delay settings to avoid overwhelming disk I/O. Each vacuum operation accumulates "cost" as it reads and writes pages, and pauses for autovacuum_vacuum_cost_delay milliseconds after reaching the autovacuum_vacuum_cost_limit. A thoughtful arrangement — the cleaning staff pauses when guests are about, so as not to disrupt the proceedings.

When autovacuum falls behind

On write-heavy tables, the default autovacuum settings are too conservative. The staff is diligent, but they have been given a mop for a job that requires a power washer. The symptoms are recognizable:

Growing dead tuple counts — n_dead_tup in pg_stat_user_tables climbs faster than autovacuum can clean it
Table bloat — the table is significantly larger on disk than its live data warrants
Slower sequential scans — dead tuples consume pages that sequential scans still have to read through
Index bloat — indexes accumulate entries pointing to dead tuples

SQL — monitoring vacuum health

-- Check vacuum health per table
SELECT
  schemaname,
  relname,
  n_dead_tup,
  n_live_tup,
  round(n_dead_tup::numeric / nullif(n_live_tup, 0) * 100, 2) AS dead_pct,
  last_vacuum,
  last_autovacuum,
  autovacuum_count
FROM pg_stat_user_tables
WHERE n_dead_tup > 10000
ORDER BY n_dead_tup DESC;

The fix is almost always per-table tuning rather than changing global settings. Lower the scale factor so vacuum triggers sooner, and optionally raise the cost limit so each vacuum run does more work per wake cycle. You are not changing the staff — you are giving them better instructions.

SQL — per-table tuning

-- For a write-heavy table, lower the scale factor
ALTER TABLE hot_table SET (
  autovacuum_vacuum_scale_factor = 0.01,
  autovacuum_vacuum_threshold = 1000
);

-- More aggressive freeze to prevent wraparound
ALTER TABLE hot_table SET (
  autovacuum_freeze_max_age = 100000000
);

Key indicators to watch in pg_stat_user_tables: n_dead_tup (current dead tuple count), last_autovacuum (when it last ran), and autovacuum_count (how many times it has run total). If n_dead_tup is consistently high and last_autovacuum is recent, vacuum is running but not keeping up — lower the scale factor. If last_autovacuum is old or null, the threshold may never be reached — lower the threshold.

Transaction ID wraparound

I must be direct about this matter, because it is grave. PostgreSQL uses 32-bit transaction IDs. With a limit of approximately 2.1 billion usable IDs, a busy database can exhaust the range in weeks or months. When the counter wraps around, rows that were committed in the past would suddenly appear to be from the future — effectively invisible. This is not performance degradation. This is catastrophic data loss.

PostgreSQL prevents this through freezing. VACUUM replaces old transaction IDs with a special FrozenTransactionId that is always considered "in the past" regardless of the current counter position. Autovacuum triggers aggressive freezing when a table's oldest unfrozen transaction ID approaches the danger zone (controlled by autovacuum_freeze_max_age, default 200 million).

SQL — checking wraparound proximity

-- Check how close each database is to wraparound
SELECT
  datname,
  age(datfrozenxid) AS xid_age,
  round(age(datfrozenxid)::numeric / 2147483647 * 100, 2) AS pct_to_wraparound
FROM pg_database
ORDER BY age(datfrozenxid) DESC;

-- Check per-table freeze age
SELECT
  schemaname,
  relname,
  age(relfrozenxid) AS xid_age
FROM pg_stat_user_tables
ORDER BY age(relfrozenxid) DESC
LIMIT 20;

When age(datfrozenxid) approaches 2 billion, you are in serious trouble. PostgreSQL will begin emitting warnings at approximately 40 million transactions remaining. If those warnings go unheeded, it refuses to start new transactions entirely — a protective shutdown that requires manual single-user-mode vacuuming to resolve. The database, in effect, locks the doors and turns away all callers until someone attends to the mess.

Preventing wraparound is simple in principle: make sure autovacuum is running and not blocked by long-running transactions. Long-running transactions hold back the freeze horizon — VACUUM cannot freeze rows that might still be visible to an open transaction. A forgotten BEGIN without a COMMIT in a monitoring session or a stuck replication slot can silently prevent freezing across the entire database. I have seen it happen. A single idle transaction, left open for days, quietly preventing the entire household from being cleaned.

How Gold Lapel relates

Gold Lapel monitors vacuum health as part of its continuous database analysis. It tracks dead tuple ratios, bloat estimates, and transaction ID age across all tables — flagging tables where autovacuum is falling behind before the symptoms reach your queries.

When Gold Lapel detects that a slow query is scanning a bloated table — one where dead tuples are inflating the number of pages the query must read — it factors that into its optimization decisions. A query that looks slow because of a missing index may actually be slow because the table is three times larger than it should be. The distinction matters enormously for choosing the right fix, and it is the sort of thing one only notices if one is paying close attention to the state of every room.

Gold Lapel does not run VACUUM itself. That remains PostgreSQL's responsibility. But it ensures you know which rooms need attention before the dust becomes structural.

Vacuum

What VACUUM does

VACUUM vs VACUUM FULL

Autovacuum

When autovacuum falls behind

Transaction ID wraparound

How Gold Lapel relates

Frequently asked questions

Does VACUUM lock the table?

How often should autovacuum run?

What is the difference between VACUUM and ANALYZE?

Can I disable autovacuum on a specific table?

What happens if transaction ID wraparound actually occurs?

Why is my table still large after running VACUUM?

Related content