Двунаправленный индекс

Question

Двунаправленный индекс

Есть ли способ для двунаправленного индекса (для эффективного заказа ASC/DESC)?

Вот таблица:

CREATE TABLE t1(
   id VARCHAR NOT NULL PRIMARY KEY, 
   d TIMESTAMP)

и есть DESC индекс для d поле:

CREATE INDEX d_index ON t1 (d DESC);

Так как d_index является DESC тогда упорядочение по desc будет более эффективным, чем упорядочение по asc.

ОБНОВЛЕНИЕ Выше приведен абстрактный пример. Настоящая схема:

CREATE TABLE user (id VARCHAR NOT NULL PRIMARY KEY);

CREATE TABLE event(
   id VARCHAR NOT NULL PRIMARY KEY,
   author VARCHAR REFERENCES user(id),
   created TIMESTAMP without time zone,
   param1 VARCHAR,
   param2 VARCHAR,
   param3 VARCHAR);
CREATE INDEX event_author_index ON event (author);
CREATE INDEX event_created_index ON event (created);

CREATE TABLE subscribe (
   id VARCHAR NOT NULL PRIMARY KEY,
   uid VARCHAR REFERENCES user(id),
   target_uid VARCHAR REFERENCES user(id));

CREATE INDEX subscribe_uid_index ON subscribe (uid, target_uid);

количество пользователей ~ 1,5 миллиона
количество событий ~ 1,0 млн
количество подписчиков ~ 1,2 миллиона

Запросы генерировались типизированным пятном 3 (scala).

DESC заказ (очень медленно):

 explain analyze 
   select x2.x3, x2.x4 from 
    (select x5."id" as x3, x5."created" as x4 from "subscribe" x6, "event" x5 where 
       (x6."uid" = 'u1') and (x5."author" = x6."target_uid") 
   order by x5."created" desc limit 10) x2;

Limit  (cost=0.85..30.08 rows=10 width=28) (actual time=11629.178..11629.289 rows=10 loops=1)
   Output: x5.id, x5.created
   ->  Nested Loop  (cost=0.85..529307.30 rows=181083 width=28) (actual time=11629.177..11629.284 rows=10 loops=1)
         Output: x5.id, x5.created
         ->  Index Scan Backward using event_created_index on public.event x5  (cost=0.42..39295.00 rows=1002105 width=40) (actual time=38.574..8828.120 rows=923101 loops=1)
               Output: x5.id, x5.created, x5.author, x5.param1, x5.param2, x5.param3   
         ->  Index Only Scan using subscribe_uid_index on public.subscribe x6  (cost=0.43..0.48 rows=1 width=14) (actual time=0.002..0.002 rows=0 loops=923101)
               Output: x6.uid, x6.target_uid
               Index Cond: ((x6.uid = 'u1'::text) AND (x6.target_uid = (x5.author)::text))
               Heap Fetches: 0
 Planning time: 121.017 ms
 Execution time: 11629.749 ms

Порядок ASC (тот же запрос):

explain analyze 
  select x2.x3, x2.x4 from 
    (select x5."id" as x3, x5."created" as x4 from "subscribe" x6, "event" x5 where 
      (x6."uid" = 'u1') and (x5."author" = x6."target_uid") 
  order by x5."created" limit 10) x2;

 Limit  (cost=0.85..30.08 rows=10 width=28) (actual time=453.712..453.813 rows=10 loops=1)
   ->  Nested Loop  (cost=0.85..529307.30 rows=181083 width=28) (actual time=453.710..453.807 rows=10 loops=1)
         ->  Index Scan using event_created_index on event x5  (cost=0.42..39295.00 rows=1002105 width=40) (actual time=31.938..214.687 rows=79015 loops=1)
         ->  Index Only Scan using subscribe_uid_index on subscribe x6  (cost=0.43..0.48 rows=1 width=14) (actual time=0.003..0.003 rows=0 loops=79015)
               Index Cond: ((uid = 'u1'::text) AND (target_uid = (x5.author)::text))
               Heap Fetches: 0
 Planning time: 121.426 ms
 Execution time: 454.235 ms

UPD 2 (результаты из ответа):

DROP INDEX event_author_index;
DROP INDEX event_created_index;
CREATE INDEX event_c1_index ON event (author, created);
REINDEX TABLE event;

DESC план заказа (для ASC та же):

 Limit  (cost=36782.56..36782.58 rows=10 width=28) (actual time=2186.408..2186.412 rows=10 loops=1)
   ->  Sort  (cost=36782.56..37235.26 rows=181083 width=28) (actual time=2186.407..2186.408 rows=10 loops=1)
         Sort Key: x5.created
         Sort Method: quicksort  Memory: 25kB
         ->  Merge Join  (cost=30037.41..32869.42 rows=181083 width=28) (actual time=2186.352..2186.374 rows=10 loops=1)
               Merge Cond: ((x5.author)::text = (x6.target_uid)::text)
               ->  Index Scan using event_c1_index on event x5  (cost=0.42..65573.44 rows=1002105 width=40) (actual time=38.211..112.868 rows=2101 loops=1)
               ->  Sort  (cost=30036.99..30037.57 rows=233 width=14) (actual time=2072.850..2072.852 rows=6 loops=1)
                     Sort Key: x6.target_uid
                     Sort Method: quicksort  Memory: 25kB
                     ->  Seq Scan on subscribe x6  (cost=0.00..30027.83 rows=233 width=14) (actual time=0.010..2072.823 rows=2 loops=1)
                           Filter: ((uid)::text = 'u1'::text)
                           Rows Removed by Filter: 1214224
 Planning time: 118.962 ms
 Execution time: 2186.460 ms

Производительность увеличена. Но стоимость резко увеличивается. Это нормально?

1

sql postgresql indexing slick postgresql-performance

Источник

user2921380 31 мар '16 в 18:22

1 ответ

Решение

Другие вопросы по тегам sql postgresql indexing slick postgresql-performance

user2115135 31 мар '16 в 20:07 2016-03-31 20:07 · Accepted Answer · 2016-03-31 20:07

Порядок индекса в вашем запросе не имеет значения, если вы добавите индекс с порядком DSEC, результат будет таким же. Что важно loops=79,015 против loops=923,101, Ваши данные таковы, что pg должен сделать в 10 раз больше проверок на subscribe_uid_index чтобы получить желаемое количество результатов.

Попробуйте вместо этого:

CREATE INDEX subscribe_target_uid_index ON subscribe (target_uid, uid);

Или даже:

CREATE INDEX subscribe_target_uid_index_f ON subscribe (target_uid) WHERE "uid" = 'u1'