Другой планировщик и время запроса на разработку postgres против производственной базы данных

Я сталкиваюсь со странной проблемой. Разрабатывая новые возможности моего приложения, я создаю дампы производственной базы данных, загружая их в свою среду разработки. Если я пытаюсь выполнить один и тот же запрос в процессе разработки и производства, я получаю 2 разных плана, возвращая действительно разные времена запроса:

Запрос следующий:

 SELECT     film_occupations.id,
           Upper(theaters.NAME) AS theater_name,
           Upper(films.title)   AS film_title,
           film_occupations.start,
           To_char(film_occupations.start at time zone 'UTC' at time zone 'Europe/Rome',    'HH24MI')                                                   AS projections_day,
           count_seats.seats_count - COALESCE(count_tickets.counter, 0) - COALESCE(count_reservations.counter, 0) - COALESCE(count_locks.counter,     0) AS available_seats
FROM       "film_occupations"
INNER JOIN "theaters"
ON         "theaters"."id" = "film_occupations"."theater_id"
INNER JOIN "films"
ON         "films"."id" = "film_occupations"."film_id"
LEFT JOIN
           (
                  SELECT seats_count,
                         id
                  FROM   theaters) AS count_seats
ON         count_seats.id = film_occupations.theater_id
LEFT JOIN
           (
                    SELECT   Count(id) AS counter,
                             film_occupation_id
                    FROM     film_tickets
                    WHERE    film_tickets.void = 'f'
                    GROUP BY film_occupation_id) AS count_tickets
ON         count_tickets.film_occupation_id = film_occupations.id
LEFT JOIN
           (
                    SELECT   Count(id) AS counter,
                             film_occupation_id
                    FROM     film_reservations
                    WHERE    film_reservations.archived = 'f'
                    AND      film_reservations.film_ticket_id IS NULL
                    GROUP BY film_occupation_id) AS count_reservations
ON         count_reservations.film_occupation_id = film_occupations.id
LEFT JOIN
           (
                    SELECT   Count(id) AS counter,
                             film_occupation_id
                    FROM     seat_locks
                    WHERE    seat_locks.persistent = 't'
                    GROUP BY film_occupation_id) AS count_locks
ON         count_locks.film_occupation_id = film_occupations.id
WHERE      "film_occupations"."subsystem_id" IN (1,
                                                 3)
AND        "theaters"."cinema_id" = 7
AND        "film_occupations"."hidden" = 'f'
AND        ((
                                 NOT disable_online)
           OR         (
                                 NOT disable_counter))
AND        (
                      film_occupations.start > '2016-08-09 11:02:45.010316'
           AND        film_occupations.start < '2016-08-10 02:59:59.999999')
AND        ((
                                 film_occupations.disable_online IS NULL
                      OR         film_occupations.disable_online = 'f')
           OR         (
                                 film_occupations.disable_online_booking IS NULL
                      OR         film_occupations.disable_online_booking = 'f')
           OR         (
                                 film_occupations.disable_counter IS NULL
                      OR         film_occupations.disable_counter = 'f')
           OR         (
                                 online_visibility_expiration >= '2016-08-09'
                      OR         online_visibility_expiration IS NULL))
GROUP BY   film_occupations.id,
           theater_name,
           film_title,
           film_occupations.start,
           projections_day,
           theaters.seats_count,
           count_seats.seats_count,
           count_tickets.counter,
           count_reservations.counter,
           count_locks.counter
ORDER BY   theater_name,
           film_title,
           film_occupations.start ASC,
           film_title 

Он генерирует следующий план в запущенном в разработку:

  QUERY PLAN
  --------------------------------------------------------------------------------------------------------------------------------------------------    --------------------------------------------------------------------------------------------------------------------------------------------------    --------------------------------------------------------------------------------------------------------------------------------------------------      ------------------------------------------------------------------------      -----------------------------------------------------------------------------------------------------------------
   Group  (cost=41835.73..41835.78 rows=1 width=583)
     Group Key: (upper((theaters.name)::text)), (upper((films.title)::text)), film_occupations.start, film_occupations.id, (to_char(timezone('Europe    /Rome'::text, (film_occupations.start)::timestamp with time zone), 'HH24MI'::text)), theaters.seats_count, theaters_1.seats_count, (count(   film_tickets.id)), (count(film_reservations.id)), (count(seat_locks.id))
     ->  Sort  (cost=41835.73..41835.73 rows=1 width=583)
           Sort Key: (upper((theaters.name)::text)), (upper((films.title)::text)), film_occupations.start, film_occupations.id, (to_char(      timezone('Europe/Rome'::text, (film_occupations.start)::timestamp with time zone), 'HH24MI'::text)), theaters.seats_count,       theaters_1.seats_count, (count(film_tickets.id)), (count(film_reservations.id)), (count(seat_locks.id))
           ->  Nested Loop Left Join  (cost=40790.60..41835.72 rows=1 width=583)
                 Join Filter: (seat_locks.film_occupation_id = film_occupations.id)
                 ->  Nested Loop Left Join  (cost=40782.32..41826.56 rows=1 width=575)
                       Join Filter: (film_reservations.film_occupation_id = film_occupations.id)
                       ->  Nested Loop Left Join  (cost=40325.03..41369.14 rows=1 width=567)
                             Join Filter: (film_tickets.film_occupation_id = film_occupations.id)
                             ->  Nested Loop Left Join  (cost=19.99..544.35 rows=1 width=559)
                                   Join Filter: (theaters_1.id = film_occupations.theater_id)
                                   ->  Nested Loop  (cost=19.99..542.68 rows=1 width=559)
                                         ->  Nested Loop  (cost=19.71..534.37 rows=1 width=540)
                                               ->  Seq Scan on theaters  (cost=0.00..1.38 rows=1 width=524)
                                                     Filter: (cinema_id = 7)
                                               ->  Bitmap Heap Scan on film_occupations  (cost=19.71..532.99 rows=1 width=20)
                                                     Recheck Cond: (theater_id = theaters.id)
                                                     Filter: ((NOT hidden) AND ((NOT disable_online) OR (NOT disable_counter)) AND (subsystem_id =      ANY ('{1,3}'::integer[])) AND (start > '2016-08-09 11:02:45.010316'::timestamp without time     zone) AND (start < '2016-08-10 02:59:59.999999'::timestamp without time zone) AND ((    disable_online IS NULL) OR (NOT disable_online) OR (disable_online_booking IS NULL) OR (NOT   disable_online_booking) OR (disable_counter IS NULL) OR (NOT disable_counter) OR (      online_visibility_expiration >= '2016-08-09 00:00:00'::timestamp without time zone) OR (     online_visibility_expiration IS NULL)))
                                                     ->  Bitmap Index Scan on index_film_occupations_on_theater_id  (cost=0.00..19.71 rows=990       width=0)
                                                           Index Cond: (theater_id = theaters.id)
                                         ->  Index Scan using films_pkey on films  (cost=0.28..8.29 rows=1 width=27)
                                               Index Cond: (id = film_occupations.film_id)
                                   ->  Seq Scan on theaters theaters_1  (cost=0.00..1.30 rows=30 width=8)
                             ->  HashAggregate  (cost=40305.04..40464.96 rows=15992 width=8)
                                   Group Key: film_tickets.film_occupation_id
                                   ->  Seq Scan on film_tickets  (cost=0.00..36098.88 rows=841233 width=8)
                                         Filter: (NOT void)
                       ->  HashAggregate  (cost=457.29..457.33 rows=4 width=8)
                             Group Key: film_reservations.film_occupation_id
                             ->  Seq Scan on film_reservations  (cost=0.00..457.17 rows=24 width=8)
                                   Filter: ((NOT archived) AND (film_ticket_id IS NULL))
                 ->  HashAggregate  (cost=8.28..8.54 rows=27 width=8)
                       Group Key: seat_locks.film_occupation_id
                       ->  Seq Scan on seat_locks  (cost=0.00..6.85 rows=285 width=8)
                             Filter: persistent
  (36 rows)

с общим пробегом всегда менее 500 мс, пока он планирует это:

QUERY PLAN
-------------------------------------------------------------------------------------------------------------------------------------------------   -------------------------------------------------------------------------------------------------------------------------------------------------    -------------------------------------------------------------------------------------------------------------------------------------------------   -----------------------------------------------------------------------    ----------------------------------------------------------------------------------------------------------------------------------
Group  (cost=149898.67..149898.72 rows=1 width=73)
  ->  Sort  (cost=149898.67..149898.67 rows=1 width=73)
        Sort Key: (upper((theaters.name)::text)), (upper((films.title)::text)), film_occupations.start, film_occupations.id, (to_char(timezone('Europe/Rome'::text, timezone('UTC'::text, film_occupations.start)), 'HH24MI'::text)), theaters.seats_count, theaters_1.seats_count, (count(film_tickets.id)), (count(film_reservations.id)), (count(seat_locks.id))
        ->  Merge Right Join  (cost=143225.72..149898.66 rows=1 width=73)
              Merge Cond: (film_tickets.film_occupation_id = film_occupations.id)
              ->  GroupAggregate  (cost=141806.56..148269.62 rows=15939 width=8)
                    ->  Sort  (cost=141806.56..143907.78 rows=840489 width=8)
                          Sort Key: film_tickets.film_occupation_id
                          ->  Seq Scan on film_tickets  (cost=0.00..36110.79 rows=840489 width=8)
                                Filter: (NOT void)
              ->  Materialize  (cost=1419.16..1429.77 rows=1 width=65)
                    ->  Nested Loop Left Join  (cost=1419.16..1429.77 rows=1 width=65)
                          Join Filter: (seat_locks.film_occupation_id = film_occupations.id)
                          ->  Nested Loop Left Join  (cost=1410.88..1420.62 rows=1 width=57)
                                Join Filter: (theaters_1.id = film_occupations.theater_id)
                                ->  Nested Loop  (cost=1410.88..1418.94 rows=1 width=57)
                                      ->  Merge Left Join  (cost=1410.61..1410.64 rows=1 width=38)
                                            Merge Cond: (film_occupations.id = film_reservations.film_occupation_id)
                                            ->  Sort  (cost=953.39..953.40 rows=1 width=30)
                                                  Sort Key: film_occupations.id
                                                  ->  Merge Join  (cost=953.34..953.38 rows=1 width=30)
                                                        Merge Cond: (film_occupations.theater_id = theaters.id)
                                                        ->  Sort  (cost=951.91..951.92 rows=2 width=20)
                                                              Sort Key: film_occupations.theater_id
                                                              ->  Seq Scan on film_occupations  (cost=0.00..951.90 rows=2 width=20)
                                                                    Filter: ((NOT hidden) AND ((NOT disable_online) OR (NOT disable_counter)) AND (subsystem_id = ANY ('{1,3}'::integer[])) AND (start > '2016-08-09 11:24:23.934692'::timestamp without time zone) AND (start < '2016-08-10 02:59:59.999999'::timestamp without time zone) AND ((disable_online IS NULL) OR (NOT disable_online) OR (disable_online_booking IS NULL) OR (NOT disable_online_booking) OR (disable_counter IS NULL) OR (NOT disable_counter) OR (online_visibility_expiration >= '2016-08-09 00:00:00'::timestamp without time zone) OR (online_visibility_expiration IS NULL)))
                                                        ->  Sort  (cost=1.43..1.45 rows=5 width=14)
                                                              Sort Key: theaters.id
                                                              ->  Seq Scan on theaters  (cost=0.00..1.38 rows=5 width=14)
                                                                    Filter: (cinema_id = 7)
                                            ->  Sort  (cost=457.21..457.22 rows=3 width=12)
                                                  Sort Key: film_reservations.film_occupation_id
                                                  ->  HashAggregate  (cost=457.13..457.16 rows=3 width=8)
                                                        ->  Seq Scan on film_reservations  (cost=0.00..457.04 rows=18 width=8)
                                                              Filter: ((NOT archived) AND (film_ticket_id IS NULL))
                                      ->  Index Scan using films_pkey on films  (cost=0.28..8.29 rows=1 width=27)
                                            Index Cond: (id = film_occupations.film_id)
                                ->  Seq Scan on theaters theaters_1  (cost=0.00..1.30 rows=30 width=8)
                          ->  HashAggregate  (cost=8.28..8.54 rows=27 width=8)
                                ->  Seq Scan on seat_locks  (cost=0.00..6.85 rows=285 width=8)
                                      Filter: persistent
41 rows)

С временем работы свыше 1,5 с в производстве. Пожалуйста, учтите, что производственная база данных регулярно очищается пылесосом (я только что сделал подробный анализ вакуума, также, но ничего не изменилось).

Я думаю, что первое, что пойдет не так, это оценка стоимости (149898,67 в производстве против 41835,73 в dev). Как я могу улучшить планировщик в производстве?

Заранее спасибо!

Alberto

ОБНОВЛЕНИЕ здесь вы можете найти различия между 2 postgres.conf. пожалуйста, учтите, что все обнаруженные различия прокомментированы, так что я думаю, что оба pg используют значения по умолчанию (что я думаю, что они прокомментированы. Также обратите внимание, что dev работает под управлением v9.5.3, в то время как prod работает под управлением v9.3.4

value                 production development
work_mem              1MB        4MB
maintenance_work_mem  16MB       64MB
max_replication_slots missing    0
effective_cache_size  128MB      4GB

0 ответов

Другие вопросы по тегам