PostgreSQL Index Usage Analysis

Question

Is there a tool or method to analyze Postgres, and determine what missing indexes should be created, and which unused indexes should be removed? I have a little experience doing this with the "profiler" tool for SQLServer, but I'm not aware of a similar tool included with Postgres.

trinchet · Accepted Answer · 2021-01-09 15:06:30Z

208

I like this to find missing indexes:

SELECT
  relname                                               AS TableName,
  to_char(seq_scan, '999,999,999,999')                  AS TotalSeqScan,
  to_char(idx_scan, '999,999,999,999')                  AS TotalIndexScan,
  to_char(n_live_tup, '999,999,999,999')                AS TableRows,
  pg_size_pretty(pg_relation_size(relname :: regclass)) AS TableSize
FROM pg_stat_all_tables
WHERE schemaname = 'public'
      AND 50 * seq_scan > idx_scan -- more than 2%
      AND n_live_tup > 10000
      AND pg_relation_size(relname :: regclass) > 5000000
ORDER BY relname ASC;

This checks if there are more sequence scans than index scans. If the table is small, it gets ignored, since Postgres seems to prefer sequence scans for them.

Above query does reveal missing indexes.

The next step would be to detect missing combined indexes. I guess this is not easy, but doable. Maybe analyzing the slow queries ... I heard pg_stat_statements could help...

edited Jan 9, 2021 at 15:06

trinchet

6,8734 gold badges39 silver badges60 bronze badges

answered Oct 10, 2012 at 11:23

guettli

27.8k95 gold badges391 silver badges723 bronze badges

27

To make this work with quoted identifiers change the query to: SELECT relname, seq_scan-idx_scan AS too_much_seq, case when seq_scan-idx_scan>0 THEN 'Missing Index?' ELSE 'OK' END, pg_relation_size(relid::regclass) AS rel_size, seq_scan, idx_scan FROM pg_stat_all_tables WHERE schemaname='public' AND pg_relation_size(relid::regclass)>80000 ORDER BY too_much_seq DESC;
– Mr. Muskrat
Commented Jan 20, 2016 at 16:30
1

To @cen 's point, when too_much_seq is positive and large you should be concerned.
– mountainclimber11
Commented Sep 27, 2017 at 18:11
1

@KishoreKumar I guess the stats in postgres still contain the queries which were executed before you updated your index. Depending on your traffic the stats will be ok again after some hours.
– guettli
Commented Sep 10, 2019 at 8:31
1

::regclass won't work on uppercase identifiers, @Mr. Muskrat has a good solution, it is also possible to use ('"' || relname || '"')::regclass instead.
– Adrien
Commented Sep 18, 2020 at 7:16
1

If you want NULL too_much_seq to come last, add NULLS LAST after ORDER BY too_much_seq DESC
– Rafs
Commented Mar 15, 2022 at 10:02

| Show 1 more comment

the Tin Man · Accepted Answer · 2015-12-10 17:15:36Z

27

Check the statistics. pg_stat_user_tables and pg_stat_user_indexes are the ones to start with.

See "The Statistics Collector".

edited Dec 10, 2015 at 17:15

the Tin Man

160k44 gold badges218 silver badges306 bronze badges

answered Jul 23, 2010 at 14:16

Frank Heikens

124k26 gold badges150 silver badges144 bronze badges

Add a comment |

the Tin Man · Accepted Answer · 2015-12-10 17:11:28Z

20

On the determine missing indexes approach....Nope. But there's some plans to make this easier in future release, like pseudo-indexes and machine readable EXPLAIN.

Currently, you'll need to EXPLAIN ANALYZE poor performing queries and then manually determine the best route. Some log analyzers like pgFouine can help determine the queries.

As far as an unused index, you can use something like the following to help identify them:

select * from pg_stat_all_indexes where schemaname <> 'pg_catalog';

This will help identify tuples read, scanned, fetched.

edited Dec 10, 2015 at 17:11

the Tin Man

160k44 gold badges218 silver badges306 bronze badges

answered Jul 23, 2010 at 14:03

rfusca

7,5852 gold badges31 silver badges34 bronze badges

Add a comment |

n1000 · Accepted Answer · 2015-12-30 12:08:19Z

19

Another new and interesting tool for analyzing PostgreSQL is PgHero. It is more focused on tuning the database and makes numerous analysis and suggestions.

answered Dec 30, 2015 at 12:08

n1000

5,25411 gold badges39 silver badges65 bronze badges

Add a comment |

David Dehghan · Accepted Answer · 2018-07-11 16:42:41Z

You can use below query to find Index usage and Index size:

Reference is taken from this blog.

SELECT
    pt.tablename AS TableName
    ,t.indexname AS IndexName
    ,to_char(pc.reltuples, '999,999,999,999') AS TotalRows
    ,pg_size_pretty(pg_relation_size(quote_ident(pt.tablename)::text)) AS TableSize
    ,pg_size_pretty(pg_relation_size(quote_ident(t.indexrelname)::text)) AS IndexSize
    ,to_char(t.idx_scan, '999,999,999,999') AS TotalNumberOfScan
    ,to_char(t.idx_tup_read, '999,999,999,999') AS TotalTupleRead
    ,to_char(t.idx_tup_fetch, '999,999,999,999') AS TotalTupleFetched
FROM pg_tables AS pt
LEFT OUTER JOIN pg_class AS pc 
    ON pt.tablename=pc.relname
LEFT OUTER JOIN
( 
    SELECT 
        pc.relname AS TableName
        ,pc2.relname AS IndexName
        ,psai.idx_scan
        ,psai.idx_tup_read
        ,psai.idx_tup_fetch
        ,psai.indexrelname 
    FROM pg_index AS pi
    JOIN pg_class AS pc 
        ON pc.oid = pi.indrelid
    JOIN pg_class AS pc2 
        ON pc2.oid = pi.indexrelid
    JOIN pg_stat_all_indexes AS psai 
        ON pi.indexrelid = psai.indexrelid 
)AS T
    ON pt.tablename = T.TableName
WHERE pt.schemaname='public'
ORDER BY 1;

Shree Prakash · Accepted Answer · 2019-09-11 07:29:52Z

14

It can be found by using following query in postgres console

use db_name
select * from pg_stat_user_indexes;
select * from pg_statio_user_indexes;

For More Details https://www.postgresql.org/docs/current/monitoring-stats.html

answered Sep 11, 2019 at 7:29

Shree Prakash

2,1683 gold badges23 silver badges34 bronze badges

Add a comment |

the Tin Man · Accepted Answer · 2015-12-10 17:09:50Z

There are multiple links to scripts that will help you find unused indexes at the PostgreSQL wiki. The basic technique is to look at pg_stat_user_indexes and look for ones where idx_scan, the count of how many times that index has been used to answer queries, is zero, or at least very low. If the application has changed and a formerly used index probably isn't now, you sometimes have to run pg_stat_reset() to get all the statistics back to 0 and then collect new data; you might save the current values for everything and compute a delta instead to figure that out.

There isn't any good tools available yet to suggest missing indexes. One approach is to log the queries you're running and analyze which ones are taking a long time to run using a query log analysis tool like pgFouine or pqa. See "Logging Difficult Queries" for more info.

The other approach is to look at pg_stat_user_tables and look for tables that have large numbers of sequential scans against them, where seq_tup_fetch is large. When an index is used the idx_fetch_tup count is increased instead. That can clue you into when a table is not indexed well enough to answer queries against it.

Actually figuring out which columns you should then index on? That usually leads back to the query log analysis stuff again.

n1000 · Accepted Answer · 2015-12-30 11:56:34Z

1

PoWA seems like an interesting tool for PostgreSQL 9.4+. It collects statistics, visualizes them, and suggests indexes. It uses the pg_stat_statements extension.

PoWA is PostgreSQL Workload Analyzer that gathers performance stats and provides real-time charts and graphs to help monitor and tune your PostgreSQL servers. It is similar to Oracle AWR or SQL Server MDW.

answered Dec 30, 2015 at 11:56

n1000

5,25411 gold badges39 silver badges65 bronze badges

Add a comment |

madjardi · Accepted Answer · 2018-10-10 09:36:52Z

0

CREATE EXTENSION pgstattuple; 
CREATE TABLE test(t INT); 
INSERT INTO test VALUES(generate_series(1, 100000)); 
SELECT * FROM pgstatindex('test_idx'); 

version            | 2 
tree_level         | 2 
index_size         | 105332736 
root_block_no      | 412 
internal_pages     | 40 
leaf_pages         | 12804 
empty_pages        | 0 
deleted_pages      | 13 
avg_leaf_density   | 9.84 
leaf_fragmentation | 21.42

edited Oct 10, 2018 at 9:36

answered Oct 10, 2018 at 9:26

madjardi

5,8692 gold badges38 silver badges39 bronze badges

Add a comment |

Collectives™ on Stack Overflow

PostgreSQL Index Usage Analysis

9 Answers 9

Not the answer you're looking for? Browse other questions tagged
sql
database-design
postgresql
or ask your own question.

Linked

Hot Network Questions

Collectives™ on Stack Overflow

9 Answers 9

Not the answer you're looking for? Browse other questions tagged sqldatabase-designpostgresql or ask your own question.

Linked

Related

Not the answer you're looking for? Browse other questions tagged
sql
database-design
postgresql
or ask your own question.