资讯

历史

科技

环境与自然

成长

游戏

财经

文学与艺术

美食

健康

家居

文化

情感

汽车

三农

军事

旅行

运动

教育

生活

星座命理

PostgreSQL vs MySQL：谁的执行计划更胜一筹？

创作时间:

2025-01-22 19:47:49

作者:

@小白创作中心

PostgreSQL vs MySQL：谁的执行计划更胜一筹？

在数据库性能优化领域，PostgreSQL和MySQL一直是备受关注的焦点。本文将对比这两种主流数据库的执行计划优化策略，从生成执行计划的方法、核心要素解析到实战案例，全面剖析它们的优势和适用场景。无论你是数据库新手还是资深专家，都能从中获得有价值的见解，提升你的数据库管理技巧。

执行计划生成方法

PostgreSQL的执行计划生成

PostgreSQL使用EXPLAIN ANALYZE命令来生成和分析执行计划。这个命令会实际执行SQL语句，并提供详细的执行统计信息，包括每个操作的执行时间、行数等。通过分析这些信息，可以深入了解查询的执行过程，识别性能瓶颈。

例如，以下是一个简单的查询及其执行计划：

EXPLAIN ANALYZE SELECT *
FROM tenk1 t1, tenk2 t2
WHERE t1.unique1 < 100 AND t1.unique2 = t2.unique2
ORDER BY t1.fivethous;

执行计划如下：

QUERY PLAN
--------------------------------------------------------------------------------------------------------------------------------------------
Sort  (cost=717.34..717.59 rows=101 width=488) (actual time=7.761..7.774 rows=100 loops=1)
  Sort Key: t1.fivethous
  Sort Method: quicksort  Memory: 77kB
  ->  Hash Join  (cost=230.47..713.98 rows=101 width=488) (actual time=0.711..7.427 rows=100 loops=1)
        Hash Cond: (t2.unique2 = t1.unique2)
        ->  Seq Scan on tenk2 t2  (cost=0.00..445.00 rows=10000 width=244) (actual time=0.007..2.583 rows=10000 loops=1)
        ->  Hash  (cost=229.20..229.20 rows=101 width=244) (actual time=0.659..0.659 rows=100 loops=1)
              Buckets: 1024  Batches: 1  Memory Usage: 28kB
              ->  Bitmap Heap Scan on tenk1 t1  (cost=5.07..229.20 rows=101 width=244) (actual time=0.080..0.526 rows=100 loops=1)
                    Recheck Cond: (unique1 < 100)
                    ->  Bitmap Index Scan on tenk1_unique1  (cost=0.00..5.04 rows=101 width=0) (actual time=0.049..0.049 rows=100 loops=1)
                          Index Cond: (unique1 < 100)
Planning time: 0.194 ms
Execution time: 8.008 ms

从执行计划中可以看出，PostgreSQL使用了哈希连接（Hash Join）和位图堆扫描（Bitmap Heap Scan）等优化策略，最终实现了高效的查询执行。

MySQL的执行计划生成

MySQL的执行计划生成主要依赖于EXPLAIN命令。与PostgreSQL类似，EXPLAIN命令可以显示查询的执行计划，帮助开发者理解查询的执行过程。但是，MySQL的EXPLAIN命令不会实际执行查询，而是仅显示可能的执行计划。

例如，以下是一个简单的查询及其执行计划：

EXPLAIN SELECT *
FROM tenk1 t1, tenk2 t2
WHERE t1.unique1 < 100 AND t1.unique2 = t2.unique2
ORDER BY t1.fivethous;

执行计划如下：

+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+----------------+
| id | select_type | table | partitions | type | possible_keys | key  | key_len | ref  | rows | filtered | Extra          |
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+----------------+
|  1 | SIMPLE      | t1    | NULL       | ALL  | NULL          | NULL | NULL    | NULL | 10000|   100.00 | Using where    |
|  1 | SIMPLE      | t2    | NULL       | ALL  | NULL          | NULL | NULL    | NULL | 10000|   100.00 | Using where; Using join buffer (Block Nested Loop) |
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+----------------+

从执行计划中可以看出，MySQL选择了全表扫描（ALL）的策略，并使用了块嵌套循环连接（Block Nested Loop）。

执行计划对比分析

优化策略对比

PostgreSQL和MySQL在执行计划的优化策略上存在一些差异。PostgreSQL更倾向于使用位图扫描、哈希连接等高级优化策略，而MySQL则更多地依赖于索引和嵌套循环连接。

例如，在处理大量数据时，PostgreSQL的哈希连接通常比MySQL的嵌套循环连接更高效。但是，在某些情况下，MySQL的索引优化策略可能更优。

性能对比

在性能方面，PostgreSQL和MySQL各有优劣。PostgreSQL在复杂查询和大数据量处理方面通常表现更优，而MySQL在简单查询和高并发场景下可能更具优势。

例如，PostgreSQL的执行计划通常能更好地利用多核CPU的并行处理能力，而MySQL在单线程执行效率方面可能更优。

实战案例分析

为了更好地理解两种数据库的执行计划差异，我们来看一个具体的案例。假设我们有两个表：orders和order_items，我们需要查询每个订单的总金额。

PostgreSQL的执行计划

EXPLAIN ANALYZE
SELECT o.order_id, SUM(oi.quantity * oi.price) as total_amount
FROM orders o
JOIN order_items oi ON o.order_id = oi.order_id
GROUP BY o.order_id;

执行计划如下：

HashAggregate  (cost=37.50..37.52 rows=1 width=12) (actual time=0.204..0.207 rows=10 loops=1)
  Group Key: o.order_id
  ->  Hash Join  (cost=1.13..37.49 rows=1 width=12) (actual time=0.037..0.170 rows=100 loops=1)
        Hash Cond: (oi.order_id = o.order_id)
        ->  Seq Scan on order_items oi  (cost=0.00..32.00 rows=1000 width=8) (actual time=0.005..0.064 rows=1000 loops=1)
        ->  Hash  (cost=1.05..1.05 rows=5 width=4) (actual time=0.017..0.017 rows=10 loops=1)
              Buckets: 1024  Batches: 1  Memory Usage: 9kB
              ->  Seq Scan on orders o  (cost=0.00..1.05 rows=5 width=4) (actual time=0.003..0.005 rows=10 loops=1)
Planning time: 0.103 ms
Execution time: 0.244 ms

MySQL的执行计划

EXPLAIN
SELECT o.order_id, SUM(oi.quantity * oi.price) as total_amount
FROM orders o
JOIN order_items oi ON o.order_id = oi.order_id
GROUP BY o.order_id;

执行计划如下：

+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+----------------+
| id | select_type | table | partitions | type | possible_keys | key  | key_len | ref  | rows | filtered | Extra          |
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+----------------+
|  1 | SIMPLE      | o     | NULL       | ALL  | NULL          | NULL | NULL    | NULL | 10   |   100.00 | Using temporary; Using filesort |
|  1 | SIMPLE      | oi    | NULL       | ALL  | NULL          | NULL | NULL    | NULL | 1000 |   100.00 | Using where; Using join buffer (Block Nested Loop) |
+----+-------------+-------+------------+------+---------------+------+---------+------+------+----------+----------------+

从这个案例中可以看出，PostgreSQL使用了哈希连接和聚合优化策略，而MySQL则使用了全表扫描和临时表排序。在处理大量数据时，PostgreSQL的执行计划通常更优。