GiST index

Q: Is GiST slower than B-tree for simple equality lookups?

Yes, and without hesitation. B-tree is purpose-built for equality and range comparisons on scalar values, and it will be faster for those operations. GiST is not a replacement for B-tree — it is a complement. Use B-tree for standard equality and ordering, and GiST for spatial, range, and containment queries that B-tree cannot support. They are colleagues, not competitors.

Q: Can a GiST index support ORDER BY?

Only with distance operators. GiST supports index-ordered scans via operators like (distance), which enables efficient nearest-neighbor queries. It does not support general ORDER BY on scalar columns the way a B-tree does. If your query is ORDER BY some_column ASC, you need a B-tree. If your query is ORDER BY geom point, GiST handles it natively.

Q: Should I use GiST or GIN for full-text search?

GIN is generally faster for full-text reads and should be your default for tsvector indexing. GiST is smaller on disk, faster to update, and supports a ranking-aware scan via the distance operator. In practice, choose GIN for read-heavy full-text search workloads. Choose GiST when write performance matters more than read speed, or when you need ranked nearest-match queries that benefit from the distance operator. The answer, as with most staffing decisions, depends on the duties you are assigning.

Q: Does GiST support multicolumn indexes?

Yes. You can create a GiST index on multiple columns, and each column can use a different operator class. This is useful when queries filter on more than one spatial or range dimension — for example, a 2D spatial column and a time range together. The planner can use the multicolumn GiST index to satisfy combined filters in a single scan.

Q: How big is a GiST index compared to GIN?

GiST indexes are typically smaller than GIN indexes for the same data. GIN stores a posting list for every distinct key (every unique word in a tsvector, every element in an array), which can get large. GiST stores bounding representations at internal nodes and is more compact. The trade-off is that GiST reads are slower because it must traverse the tree and check lossy matches against the heap.

Q: What does "lossy" mean in the context of GiST?

GiST internal nodes store simplified representations of the data — bounding boxes for geometry, bounding ranges for other types. These representations can produce false positives: the index may say a row matches when it does not. PostgreSQL handles this transparently by rechecking candidate rows against the actual data in the heap. You do not need to do anything special — but it means GiST scans involve more heap lookups than an exact index like B-tree or GIN. The approximation is the price of versatility, and PostgreSQL pays it quietly on your behalf.

Generalized Search Tree — the versatile specialist on PostgreSQL's indexing staff. If B-tree is the head waiter, GiST is the member of staff who handles everything the head waiter politely declines.

The Waiter of Gold Lapel · Updated Mar 30, 2026 Published Mar 21, 2026 · 8 min read

A GiST (Generalized Search Tree) index is a balanced tree structure that supports arbitrary indexing schemes through operator classes. Where B-tree handles sortable scalar data with admirable efficiency, GiST extends the household's reach to geometric shapes, IP ranges, text search vectors, hierarchical paths, and any data type that defines a consistent decomposition into bounding regions. It is the index type behind PostGIS spatial queries, range overlap checks, and nearest-neighbor searches — the work that no other index on staff is equipped to perform.

What a GiST index is

GiST stands for Generalized Search Tree, and the name is earned rather than aspirational. The GiST documentation describes it not as a single data structure but a framework — an API that allows different data types to define how they are indexed. The framework provides the tree mechanics (balancing, splitting, page management), and each operator class plugs in the logic for how to decompose data into bounding regions, how to measure distance, and how to determine containment.

At each internal node, a GiST index stores a predicate that describes the set of values reachable through that branch. For geometry, these predicates are bounding boxes. For ranges, they are bounding ranges. For text search vectors, they are lossy signature bitmaps. The search algorithm descends the tree by checking which branches could contain matching values, pruning entire subtrees when the predicate rules them out.

This bounding-box decomposition is what gives GiST its power and its limitations. It can answer questions like "which shapes overlap this region?" or "which ranges contain this point?" — questions that B-tree cannot express. But the bounding representations are approximate, so GiST must recheck candidate rows against the actual data. This makes reads slower than an exact index but enables an entire class of queries that would otherwise require a sequential scan. A trade-off, to be sure — but one that opens doors no other index can.

When to use GiST

GiST is the right index type when your queries involve containment, overlap, proximity, or multi-dimensional data — the specialized work that a generalist simply cannot handle. The most common use cases:

PostGIS spatial queries — finding points within a polygon, polygons that intersect a region, or the nearest features to a location. GiST is required for PostGIS geometry and geography types. This is, if I may say so, the assignment that made GiST indispensable — PostGIS without GiST would be a map without a compass. For practical spatial query patterns and GiST tuning parameters, see the PostGIS for application developers guide.
Range types — tsrange, int4range, daterange, and custom range types. GiST indexes support overlap (&&), containment (@>, <@), and adjacency operators on ranges.
Nearest-neighbor searches — the <-> distance operator enables index-ordered scans that return results sorted by proximity without computing all distances first.
ltree hierarchical data — indexing and querying tree-structured label paths.
Full-text search — GiST can index tsvector columns, though GIN is typically preferred for read-heavy workloads.
Trigram similarity — with pg_trgm, GiST indexes support fuzzy string matching via the % (similarity) and <-> (distance) operators.

PostGIS nearest-neighbor query

-- Find the 10 nearest coffee shops to a point (PostGIS)
SELECT name, ST_Distance(geom, ST_MakePoint(-73.985, 40.748)) AS dist
FROM coffee_shops
ORDER BY geom <-> ST_MakePoint(-73.985, 40.748)
LIMIT 10;

-- The <-> operator triggers an index-ordered nearest-neighbor scan.
-- Without a GiST index, PostgreSQL must compute every distance and sort.

Range overlap and containment queries

-- Find all reservations that overlap a given period
SELECT *
FROM reservations
WHERE during && tsrange('2026-03-21 14:00', '2026-03-21 16:00');

-- Find reservations that contain a specific timestamp
SELECT *
FROM reservations
WHERE during @> '2026-03-21 15:00'::timestamp;

-- GiST handles &&, @>, <@, and << / >> on range types.

GiST vs GIN

GiST and GIN both handle composite data types, but they take different approaches and have different performance characteristics. Knowing which to assign to a given task is the sort of decision that separates a well-run household from one that merely has staff.

	GiST	GIN
Index size	Smaller	Larger (stores full posting lists)
Build time	Faster	Slower
Write overhead	Lower	Higher (pending list helps, but still more work)
Read speed	Slower (lossy, requires rechecks)	Faster (exact matches)
Distance operators	Supported (`<->`)	Not supported
Best for	Spatial, ranges, nearest-neighbor, write-heavy FTS	Arrays, JSONB, read-heavy FTS

For full-text search, GIN is the default recommendation because reads are faster and most workloads are read-heavy. Choose GiST when write throughput is the bottleneck, when you need distance-based ranking, or when index size is a concern. For spatial and range data, GiST is typically the only option — GIN does not support those types. Each has its strengths; the error is deploying one where the other belongs.

GiST vs B-tree

B-tree and GiST solve different problems, and neither has cause to envy the other. B-tree handles equality, range comparisons, and sorting on scalar values — integers, timestamps, text. It is faster for those operations and should remain your default.

GiST handles multi-dimensional and containment queries that B-tree cannot express:

B-tree can answer "find rows where created_at > '2026-01-01'" — a one-dimensional range on a sortable column.
GiST can answer "find rows where geom && ST_MakeEnvelope(...)" — a two-dimensional overlap on a spatial column.
B-tree can answer "find rows where id = 42" — equality on a scalar.
GiST can answer "find rows where time_range @> now()" — containment on a range type.

The rule is pleasantly clear: if your column is a standard scalar type and your queries use =, <, >, BETWEEN, or ORDER BY, use B-tree. If your column holds geometry, ranges, network addresses, tsvectors, or hierarchical paths, use GiST. Assigning the right member of staff to the right duty is not a luxury. It is the foundation of a well-ordered household.

Creating GiST indexes

GiST indexes are created with the USING gist clause, as documented in the CREATE INDEX reference. Each indexed column must have an operator class that implements the GiST interface for its data type.

SQL

-- GiST index on a geometric column
CREATE INDEX idx_locations_point ON locations USING gist (coordinates);

-- GiST index on a range column
CREATE INDEX idx_reservations_during ON reservations USING gist (during);

-- GiST index on a tsvector column (full-text search)
CREATE INDEX idx_articles_search ON articles USING gist (search_vector);

-- GiST index on an ltree column (hierarchical data)
CREATE INDEX idx_categories_path ON categories USING gist (path);

Some data types have multiple operator classes. You can specify one explicitly when the default is not what you need:

Operator classes

-- Specify an operator class for non-default behavior
CREATE INDEX idx_inet_range ON access_log USING gist (client_ip inet_ops);

-- For full-text search with tsvector
CREATE INDEX idx_docs_tsv ON documents USING gist (body_tsv);

-- For trigram similarity searches (requires pg_trgm)
CREATE INDEX idx_names_trgm ON people USING gist (name gist_trgm_ops);

Starting with PostgreSQL 12, GiST indexes support the INCLUDE clause, which adds non-key columns to the index leaf pages. This enables index-only scans for queries that filter on the GiST-indexed column but return other columns.

INCLUDE (PostgreSQL 12+)

-- GiST with INCLUDE columns (PostgreSQL 12+)
CREATE INDEX idx_reservations_during_room ON reservations
  USING gist (during) INCLUDE (room_id);

-- The INCLUDE column is stored in the index leaf pages
-- but is not part of the search key. This enables index-only scans
-- for queries that filter on 'during' and return 'room_id'.

For large tables, use CREATE INDEX CONCURRENTLY to avoid blocking writes during the build. GiST index builds are typically faster than GIN builds for the same data — one of the quieter advantages of a specialist who travels light.

How Gold Lapel relates

Gold Lapel analyzes every query that passes through its proxy layer, including spatial and range queries that rely on GiST indexes. When it detects query patterns that would benefit from a GiST index — repeated range overlap checks on unindexed columns, spatial filters triggering sequential scans, or nearest-neighbor queries sorting without an index — it surfaces the recommendation with the specific operator class and rationale.

GiST recommendations require more context than B-tree recommendations, and I should note that this is precisely the kind of nuance that matters. The operator class matters, the query patterns are more varied, and the trade-offs between GiST and GIN depend on the read/write ratio. Gold Lapel considers these factors rather than defaulting to one index type for every situation. Recommending the wrong specialist is worse than recommending no one at all.

GiST index

What a GiST index is

When to use GiST

GiST vs GIN

GiST vs B-tree

Creating GiST indexes

How Gold Lapel relates

Frequently asked questions

Is GiST slower than B-tree for simple equality lookups?

Can a GiST index support ORDER BY?

Should I use GiST or GIN for full-text search?

Does GiST support multicolumn indexes?

How big is a GiST index compared to GIN?

What does "lossy" mean in the context of GiST?

Related content