<div class="highlight highlight-source-sql notranslate position-relative overflow-auto" dir="auto" d

FYI. i used this command to see disk usage: <div class="snippet-clipboard-content

Hey <a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url=

fwiw, test : <div class="highlight highlight-source-sql notranslate position

Using your , I replicate your results exactly. corrected my qu

30x size used by index vs tables about pgvector HOT 7 CLOSED

ncoder commented on May 21, 2024

30x size used by index vs tables

from pgvector.

Comments (7)

ncoder commented on May 21, 2024

FYI. i used this command to see disk usage:

SELECT pg_relation_filepath(oid), relpages*8 as kb, relname FROM pg_class order by kb desc;

from pgvector.

ankane commented on May 21, 2024

Hey @ncoder, use pg_table_size to get the table size.

from pgvector.

ankane commented on May 21, 2024

fwiw, test script:

CREATE TABLE items (embedding vector(1500));
INSERT INTO items (embedding)
    SELECT (
        SELECT array_agg(i) FROM generate_series(1, 1500) i
    ) FROM generate_series(1, 100000) n;
SELECT pg_size_pretty(pg_table_size('items')) AS table_size;
SET maintenance_work_mem = '500MB';
CREATE INDEX my_index ON items USING ivfflat (embedding) WITH (lists = 1000);
SELECT pg_size_pretty(pg_total_relation_size('my_index')) AS index_size;

and output:

CREATE TABLE
INSERT 0 100000
 table_size 
------------
 795 MB
(1 row)

SET
CREATE INDEX
 index_size 
------------
 797 MB
(1 row)

from pgvector.

ncoder commented on May 21, 2024

db=# SELECT pg_size_pretty(pg_table_size('qa')) as pg_table_size, pg_size_pretty(pg_total_relation_size('qa')) as pg_total_relation_size;
 pg_table_size | pg_total_relation_size
---------------+------------------------
 25 GB         | 48 GB

from pgvector.

ncoder commented on May 21, 2024

wait, i made a second index to test on this one... hold up.

(Good, now Edited...)

from pgvector.

ncoder commented on May 21, 2024

Using your script, I replicate your results exactly.

corrected my query to be equivalent on my data:

SELECT pg_size_pretty(pg_table_size('qa')) as pg_table_size, pg_size_pretty(pg_total_relation_size('qa_embedding_idx')) as pg_total_relation_size;
 pg_table_size | pg_total_relation_size
---------------+------------------------
 25 GB         | 24 GB
(1 row)