PostgreSQL索引失效会发生什么?_F11 - 专业站长和开发者的学习网站

网站首页收藏本站

广告位联系

APP正在开发中...

返回顶部

分享到

PostgreSQL索引失效会发生什么?

数据库其他 来源：互联网作者：佚名发布时间：2022-09-22 08:59:46 人浏览

摘要

前段时间碰到个奇怪的索引失效的问题，实际情况类似下面这样： 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 bill=# begin; BEGIN bill=*# create index idx_t1 on t1(id); CREATE INDEX bill=*# explain select * f

前段时间碰到个奇怪的索引失效的问题，实际情况类似下面这样：

bill=# begin;

BEGIN

bill=*# create index idx_t1 on t1(id);

CREATE INDEX

bill=*# explain select * from t1 where id = 1;

QUERY PLAN

----------------------------------------------------

Seq Scan on t1 (cost=0.00..25.88 rows=6 width=36)

Filter: (id = 1)

(2 rows)

bill=*# end;

COMMIT

bill=# explain select * from t1 where id = 1;

QUERY PLAN

---------------------------------------------------------------------

Bitmap Heap Scan on t1 (cost=1.50..7.01 rows=6 width=36)

Recheck Cond: (id = 1)

-> Bitmap Index Scan on idx_t1 (cost=0.00..1.50 rows=6 width=0)

Index Cond: (id = 1)

(4 rows)

很显然的问题就是，我在事务中创建了索引，却没办法使用。但是当事务提交了后便可以正常使用了，这是什么情况呢？

这个其实和pg_index中indcheckxmin属性有关，关于这个字段的解释如下：

If true, queries must not use the index until the xmin of this pg_index row is below their TransactionXmin event horizon, because the table may contain broken HOT chains with incompatible rows that they can see

经检查也确实如此:

bill=*# select indcheckxmin from pg_index where indexrelid = 'idx_t1'::regclass;

indcheckxmin

--------------

(1 row)

那么问题来了，什么情况下创建索引时会将索引的该属性设置为true呢？

从前面官方文档对于该字段的解释，如果表中包含broken HOT chains 则会为true，那什么是broken HOT chains ？似乎和HOT机制有关。那是不是只有存在broken HOT chains 才会设置为true呢？

这里就不卖关子了，直接给出结论，然后我们再去一一验证。

经测试发现，以下两种情况会导致索引的indcheckxmin设置为true：

当前事务中表上存在broken HOT chains，即官方文档中所说；
当old_snapshot_threshold被设置时。

场景一：broken HOT chains

这种情况，只要在当前事务中表中存在HOT更新的行时就会存在。那么什么时候会进行HOT更新呢？两个前提：

新的元组和旧元组必须在同一个page中；
索引字段不能进行更新。

既然如此，实际中常见的两种情况就是：

对表上最后一个page进行更新；
表设置了fillfactor，即每个page上有预留的空闲空间。

例子：

表中插入10条数据，自然只有1个page：

1 2	bill=# insert into t1 select generate_series(1,10),md5(random()::text); INSERT 0 10

进行更新：

1 2	bill=# update t1 set info = 'bill' where id = 10; UPDATE 1

查看发现的确是HOT更新：

关于t_infomask2字段的解释这里就不再赘述。

接下来我们创建索引：

可以发现indcheckxmin被设置为true，在当前事务中索引不可用。

经过验证，在index_build阶段，判断到BrokenHotChain，便将indcheckxmin修改为true。

具体的修改代码如下：

/*此时indexInfo->ii_BrokenHotChain已被修改为true */

if ((indexInfo->ii_BrokenHotChain || EarlyPruningEnabled(heapRelation)) &&

!isreindex &&

!indexInfo->ii_Concurrent)

{

Oid indexId = RelationGetRelid(indexRelation);

Relation pg_index;

HeapTuple indexTuple;

Form_pg_index indexForm;

pg_index = table_open(IndexRelationId, RowExclusiveLock);

indexTuple = SearchSysCacheCopy1(INDEXRELID,

ObjectIdGetDatum(indexId));

if (!HeapTupleIsValid(indexTuple))

elog(ERROR, "cache lookup failed for index %u", indexId);

indexForm = (Form_pg_index) GETSTRUCT(indexTuple);

/* If it's a new index, indcheckxmin shouldn't be set ... */

Assert(!indexForm->indcheckxmin);

/*将indcheckxmin修改为true */

indexForm->indcheckxmin = true;

CatalogTupleUpdate(pg_index, &indexTuple->t_self, indexTuple);

heap_freetuple(indexTuple);

table_close(pg_index, RowExclusiveLock);

}

同样我们也可以验证得知，的确是因为brokenhotchains导致的indcheckxmin被设置为true。

场景二：old_snapshot_threshold

先来看例子：

最简单的场景，完全的一张空表，在事务中创建索引indcheckxmin就会被设置为true，果然索引也是不可用。

bill=# drop table t1;

DROP TABLE

bill=# create table t1(id int,info text);

CREATE TABLE

bill=# begin;

BEGIN

bill=*# create index idx_t1 on t1(id);

CREATE INDEX

bill=*# select indcheckxmin from pg_index where indexrelid = 'idx_t1'::regclass;

indcheckxmin

--------------

(1 row)

bill=*# explain select * from t1 where id = 1;

QUERY PLAN

----------------------------------------------------

Seq Scan on t1 (cost=0.00..25.88 rows=6 width=36)

Filter: (id = 1)

(2 rows)

那么为什么old_snapshot_threshold会产生这样的影响呢？

经过跟踪发现，当开启该参数时，在事务中创建索引的snapshotdata结构如下：

(SnapshotData) $6 = {
snapshot_type = SNAPSHOT_MVCC
xmin = 856
xmax = 856
xip = 0x00007fd55c804fc0
xcnt = 0
subxip = 0x00007fd55ad5d000
subxcnt = 0
suboverflowed = false
takenDuringRecovery = false
copied = false
curcid = 1
speculativeToken = 0
vistest = NULL
active_count = 0
regd_count = 0
ph_node = {
first_child = NULL
next_sibling = NULL
prev_or_parent = NULL
}
whenTaken = 691752041261069
lsn = 208079736
}

而禁用该参数呢？

(SnapshotData) $7 = {
snapshot_type = SNAPSHOT_MVCC
xmin = 828
xmax = 828
xip = 0x00007fad31704780
xcnt = 0
subxip = 0x00007fad3155d000
subxcnt = 0
suboverflowed = false
takenDuringRecovery = false
copied = false
curcid = 1
speculativeToken = 0
active_count = 0
regd_count = 0
ph_node = {
first_child = NULL
next_sibling = NULL
prev_or_parent = NULL
}
whenTaken = 0
lsn = 0
}

可以看到，区别在于不使用该参数时，创建snapshotdata不会设置whenTaken和lsn，那么这两个参数是干嘛的呢？

先来看看snapshotdata的结构：

typedef struct SnapshotData

{

SnapshotType snapshot_type; /* type of snapshot */

* The remaining fields are used only for MVCC snapshots, and are normally

* just zeroes in special snapshots. (But xmin and xmax are used

* specially by HeapTupleSatisfiesDirty, and xmin is used specially by

* HeapTupleSatisfiesNonVacuumable.)

* An MVCC snapshot can never see the effects of XIDs >= xmax. It can see

* the effects of all older XIDs except those listed in the snapshot. xmin

* is stored as an optimization to avoid needing to search the XID arrays

* for most tuples.

TransactionId xmin; /* all XID < xmin are visible to me */

TransactionId xmax; /* all XID >= xmax are invisible to me */

* For normal MVCC snapshot this contains the all xact IDs that are in

* progress, unless the snapshot was taken during recovery in which case

* it's empty. For historic MVCC snapshots, the meaning is inverted, i.e.

* it contains *committed* transactions between xmin and xmax.

* note: all ids in xip[] satisfy xmin <= xip[i] < xmax

TransactionId *xip;

uint32 xcnt; /* # of xact ids in xip[] */

* For non-historic MVCC snapshots, this contains subxact IDs that are in

* progress (and other transactions that are in progress if taken during

* recovery). For historic snapshot it contains *all* xids assigned to the

* replayed transaction, including the toplevel xid.

* note: all ids in subxip[] are >= xmin, but we don't bother filtering

* out any that are >= xmax

TransactionId *subxip;

int32 subxcnt; /* # of xact ids in subxip[] */

bool suboverflowed; /* has the subxip array overflowed? */

bool takenDuringRecovery; /* recovery-shaped snapshot? */

bool copied; /* false if it's a static snapshot */

CommandId curcid; /* in my xact, CID < curcid are visible */

* An extra return value for HeapTupleSatisfiesDirty, not used in MVCC

* snapshots.

uint32 speculativeToken;

* For SNAPSHOT_NON_VACUUMABLE (and hopefully more in the future) this is

* used to determine whether row could be vacuumed.

struct GlobalVisState *vistest;

* Book-keeping information, used by the snapshot manager

uint32 active_count; /* refcount on ActiveSnapshot stack */

uint32 regd_count; /* refcount on RegisteredSnapshots */

pairingheap_node ph_node; /* link in the RegisteredSnapshots heap */

TimestampTz whenTaken; /* timestamp when snapshot was taken */

XLogRecPtr lsn; /* position in the WAL stream when taken */

* The transaction completion count at the time GetSnapshotData() built

* this snapshot. Allows to avoid re-computing static snapshots when no

* transactions completed since the last GetSnapshotData().

uint64 snapXactCompletionCount;

} SnapshotData;

如上所示，TimestampTz表示snapshot何时产生的，为什么启用old_snapshot_threshold时会设置该值呢？

因为该值正是用来判断快照是否过旧的：

* Implement slower/larger portions of TestForOldSnapshot

* Smaller/faster portions are put inline, but the entire set of logic is too

* big for that.

void

TestForOldSnapshot_impl(Snapshot snapshot, Relation relation)

{

if (RelationAllowsEarlyPruning(relation)

&& (snapshot)->whenTaken < GetOldSnapshotThresholdTimestamp())

ereport(ERROR,

(errcode(ERRCODE_SNAPSHOT_TOO_OLD),

errmsg("snapshot too old")));

}

这样我们也比较好理解为什么设置了该参数时创建的索引在当前事务中不可用：

因为我们不设置该参数时，在事务中创建索引是可以保证MVCC的一致性，那么索引便是安全可用的。

而使用参数时，由于TimestampTz被设置，数据库会对其进行判断该行数据是否已经过期，如果过期了那便会被清理掉，这样对于索引来说便是不安全的，没法保证数据的一致性，对于不是hot-safe的索引，自然要将其indcheckxmin设置为true，防止在事务中创建索引后数据实际已经过期被删除的情况。

* At this moment we are sure that there are no transactions with the

* table open for write that don't have this new index in their list of

* indexes. We have waited out all the existing transactions and any new

* transaction will have the new index in its list, but the index is still

* marked as "not-ready-for-inserts". The index is consulted while

* deciding HOT-safety though. This arrangement ensures that no new HOT

* chains can be created where the new tuple and the old tuple in the

* chain have different index keys.

* We now take a new snapshot, and build the index using all tuples that

* are visible in this snapshot. We can be sure that any HOT updates to

* these tuples will be compatible with the index, since any updates made

* by transactions that didn't know about the index are now committed or

* rolled back. Thus, each visible tuple is either the end of its

* HOT-chain or the extension of the chain is HOT-safe for this index.

总结

当pg_index的indcheckxmin字段被设置为true时，直到此pg_index行的xmin低于查询的TransactionXmin视界之前，查询都不能使用此索引。

而产生这种现象主要有两种情况：

1. 表上在当前事务中存在broken HOT chains；

2. old_snapshot_threshold被设置时。

您可能感兴趣的文章 :

原文链接 : https://foucus.blog.csdn.net/article/details/121701242

Tag : 索引(20)PostgreSQL(22)

六大主流数据同步工具对比：DataX、Airbyte、Cana

当数据量变大、数据源复杂、实时需求提高，很多团队在选数据同步工具时犯难。本文对 DataX、Airbyte、Canal、Debezium、Fivetran 与 Apache SeaTun
解读SQL生成工具

SQL 生成工具可用于测试Parser与其他数据库产品的兼容性，通过解析YACC语法文件中的产生式，生成对应的SQL语句，再使用数据库执行该SQL，根
SQLite3在嵌入式C环境中存储音频/视频文件的最优

SQLite3 在嵌入式C环境中存储音频/视频文件的专业方案在嵌入式系统中存储大型媒体文件需要平衡存储效率、访问速度和资源限制。以下是针
嵌入式数据库SQLite 3配置使用

0、惨痛教训随着管理开发的项目体积越来越庞大，产品系统涉及的数据量也越来越多，并且伴随着项目不久就要交付给甲方了。如果项目的
Sqlite3基本语句及安装过程

SQLite3简介 SQLite3是一款轻量级的、基于文件的开源关系型数据库引擎，由 D. Richard Hipp 于 2000 年首次发布。它遵循 SQL 标准，但与传统的数据
在SQLite中进行批量操作的有效实现方法

SQLite 是一个轻量级的关系型数据库管理系统，因其高效性和易用性而广受欢迎。在许多应用场景中，批量操作的需求是不可避免的，例如在
一文介绍在Hive中NULL的理解

在 Hive 中，NULL 是一个特殊的值，表示未知或缺失。任何与NULL的比较操作（如=,,,=,=,）都会返回NULL，而不是TRUE或FALSE。 1.NULL 的比较规则在
Navicat Premium 12数据库管理解决方案

Navicat Premium 12是一款全面的数据库管理工具，支持多种数据库系统如MySQL、MariaDB、Oracle、SQL Server、PostgreSQL等。它提供了多数据库连接、数据
sqlite3命令行工具使用介绍

一、启动与退出启动数据库连接 1 2 3 sqlite3 [database_file] # 打开/创建数据库文件（如 test.db） sqlite3 # 启动临时内存数据库 (:memory:) sqlite3 :m
StarRocks简介与搭建使用介绍

StarRocks简介 StarRocks 是一款高速、实时、全场景的MPP（大规模并行处理）分析型数据库系统，专为现代数据分析场景设计，强调亚秒级查询性