MariaDB 實現函數索引

原創

2020-02-22 21:36

我們知道MySQL 暫時不支持函數索引。目前大部分數據庫包括PostgreSQL,Oracle等都支持。什麼是函數索引呢？
函數索引就是說用某固定的函數來對列生成一個基於此函數結果集的索引樹。好處是開發人員寫SQL變得隨意而且簡單了，但是不好的一點也是如此，必須寫按照固定條件進行的讀取過濾。

在之前呢，如果要實現這樣的功能，MySQL 得創建一個新的列，然後用前置觸發器來修改此列的值。現在呢，MariaDB有一個虛擬列的特性可以很方便的來實現這個目的。
先來看下在PostgreSQL中的表結構

t_girl=# \d email_list;
             Table "public.email_list"
  Column  |            Type             | Modifiers 
----------+-----------------------------+-----------
 id       | integer                     | 
 email    | character varying(200)      | 
 log_time | timestamp without time zone | 
Indexes:
    "idx_email_suffix" btree (substr(email::text, "position"(email::text, '@'::text) + 1))

這張表的EMAIL列屬性上有一個函數索引，目的是來查找次EMAIL屬性是屬於哪家提供商，比如163，GMAIL等等。

我們給張表產生了20W行記錄。

t_girl=# select count(*) from email_list;
 count  
--------
 200000
(1 row)


Time: 39.851 ms

現在來進行對應的查詢。如果不嚴格按照這個函數的創建規範，查詢就不走索引，所以一定要嚴格來寫SQL。

                                                              QUERY PLAN                                                              
--------------------------------------------------------------------------------------------------------------------------------------
 Aggregate  (cost=1607.19..1607.20 rows=1 width=12) (actual time=5.514..5.514 rows=1 loops=1)
   ->  Bitmap Heap Scan on email_list  (cost=48.29..1602.08 rows=2047 width=12) (actual time=1.126..4.806 rows=1960 loops=1)
         Recheck Cond: (substr((email)::text, ("position"((email)::text, '@'::text) + 1)) = '56.com'::text)
         ->  Bitmap Index Scan on idx_email_suffix  (cost=0.00..47.78 rows=2047 width=0) (actual time=0.802..0.802 rows=1960 loops=1)
               Index Cond: (substr((email)::text, ("position"((email)::text, '@'::text) + 1)) = '56.com'::text)
 Total runtime: 5.603 ms
(6 rows)


Time: 6.601 ms

從查詢分析計劃中看到，走這個函數索引，掃描了大概2K行記錄，生成結果集1960行。

t_girl=# select count(email) as num from email_list where substr(email,position('@' in email)+1)='56.com';
 num  
------
 1960
(1 row)


Time: 5.251 ms
t_girl=#

接下來，我們看看在MariaDB中如何來實現對應的功能。
表結構如下：

MariaDB [t_girl]> show create table email_list;
+------------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| Table      | Create Table                                                                                                                                                                                                                                                                                                |
+------------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
| email_list | CREATE TABLE `email_list` (
  `id` int(11) DEFAULT NULL,
  `email` varchar(200) DEFAULT NULL,
  `log_time` datetime(6) DEFAULT NULL,
  `email_suffix` varchar(100) AS (substr(email,position('@' in email)+1)) PERSISTENT,
  KEY `idx_email_suffix` (`email_suffix`)
) ENGINE=InnoDB DEFAULT CHARSET=latin1 |
+------------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+
1 row in set (0.01 sec)

這裏我們用到了MariaDB的虛擬列，並且對虛擬列指定persistent屬性，這樣就能當成真實的屬性來看待了。
行，下來我們利用這個虛擬列來進行查詢，不過這樣反而簡單點，查詢語句不需要那麼嚴格了，直接跟普通的語句一樣。

MariaDB [t_girl]> explain select count(email) from email_list where email_suffix = '56.com';
+------+-------------+------------+------+------------------+------------------+---------+-------+------+-----------------------+
| id   | select_type | table      | type | possible_keys    | key              | key_len | ref   | rows | Extra                 |
+------+-------------+------------+------+------------------+------------------+---------+-------+------+-----------------------+
|    1 | SIMPLE      | email_list | ref  | idx_email_suffix | idx_email_suffix | 103     | const | 1959 | Using index condition |
+------+-------------+------------+------+------------------+------------------+---------+-------+------+-----------------------+
1 row in set (0.02 sec)

查詢速度當然也就非常快了。

MariaDB [t_girl]> select count(email) from email_list where email_suffix = '56.com';         
+--------------+
| count(email) |
+--------------+
|         1960 |
+--------------+
1 row in set (0.02 sec)

懶得去死

發佈了97 篇原創文章 · 獲贊 2 · 訪問量 47萬+

私信關注

發表評論

所有評論

還沒有人評論，想成為第一個評論的人麼? 請在上方評論欄輸入並且點擊發布.

MariaDB 實現函數索引

如何使用 JS 判斷用戶是否處於活躍狀態

Mono 支持LoongArch架構

lightdb秒級增加列和刪除列（not null帶默認值）

lightdb數據庫超時相關控制參數

通過HPA+CronHPA組合應對業務複雜彈性伸縮場景

❤️‍🔥 Solon Cloud Event 新的事務特性與應用

lightdb mysql 8.0兼容之不可見主鍵

使用 JS 實現在瀏覽器控制檯打印圖片 console.image()

基於Ubuntu-22.04安裝K8s-v1.28.2實驗（四）使用域名訪問網站應用

POSTGRESQL 分區表初次體驗

狀態值在數據庫中的檢索

MySQL 存儲過程調試工具商業和免費

PostgreSQL 實現MySQL "insert ignore" 語法。

TokuDB和InnoDB的讀寫分析與比較

https://yachay.unat.edu.pe/blog/index.php?comment_area=format_blog&comment_component=blog&comment_co

linux以太網驅動總結