Changefeed Log Filters

TiCDC supports filtering data by tables and events. This document introduces how to use the two types of filters.

Table filter

Table filter is a feature that allows you to keep or filter out specific databases and tables by specifying the following configurations:

  1. [filter]
  2. # Filter rules
  3. rules = ['*.*', '!test.*']

Common filter rules:

  • rules = ['*.*']
    • Replicate all tables (not including system tables)
  • rules = ['test1.*']
    • Replicate all tables in the test1 database
  • rules = ['*.*', '!scm1.tbl2']
    • Replicate all tables except for the scm1.tbl2 table
  • rules = ['scm1.tbl2', 'scm1.tbl3']
    • Only replicate tables scm1.tbl2 and scm1.tbl3
  • rules = ['scm1.tidb_*']
    • Replicate all tables in the scm1 database whose names start with tidb_

For more information, see Table filter syntax.

Event filter rules

Starting in v6.2.0, TiCDC supports event filter. You can configure event filter rules to filter out the DML and DDL events that meet the specified conditions.

The following is an example of event filter rules:

  1. [filter]
  2. # The event filter rules must be under the `[filter]` configuration. You can configure multiple event filters at the same time.
  3. [[filter.event-filters]]
  4. matcher = ["test.worker"] # matcher is an allow list, which means this rule only applies to the worker table in the test database.
  5. ignore-event = ["insert"] # Ignore insert events.
  6. ignore-sql = ["^drop", "add column"] # Ignore DDLs that start with "drop" or contain "add column".
  7. ignore-delete-value-expr = "name = 'john'" # Ignore delete DMLs that contain the condition "name = 'john'".
  8. ignore-insert-value-expr = "id >= 100" # Ignore insert DMLs that contain the condition "id >= 100".
  9. ignore-update-old-value-expr = "age < 18 or name = 'lili'" # Ignore update DMLs whose old value contains "age < 18" or "name = 'lili'".
  10. ignore-update-new-value-expr = "gender = 'male' and age > 18" # Ignore update DMLs whose new value contains "gender = 'male'" and "age > 18".

Description of configuration parameters:

  • matcher: the database and table that this event filter rule applies to. The syntax is the same as table filter.
  • ignore-event: the event type to be ignored. This parameter accepts an array of strings. You can configure multiple event types. Currently, the following event types are supported:
EventTypeAliasDescription
all dmlMatches all DML events
all ddlMatches all DDL events
insertDMLMatches insert DML event
updateDMLMatches update DML event
deleteDMLMatches delete DML event
create schemaDDLcreate databaseMatches create database event
drop schemaDDLdrop databaseMatches drop database event
create tableDDLMatches create table event
drop tableDDLMatches drop table event
rename tableDDLMatches rename table event
truncate tableDDLMatches truncate table event
alter tableDDLMatches alter table event, including all clauses of alter table, create index and drop index
add table partitionDDLMatches add table partition event
drop table partitionDDLMatches drop table partition event
truncate table partitionDDLMatches truncate table partition event
create viewDDLMatches create viewevent
drop viewDDLMatches drop view event
  • ignore-sql: the DDL statements to be ignored. This parameter accepts an array of strings, in which you can configure multiple regular expressions. This rule only applies to DDL events.
  • ignore-delete-value-expr: this parameter accepts a SQL expression. This rule only applies to delete DML events with the specified value.
  • ignore-insert-value-expr: this parameter accepts a SQL expression. This rule only applies to insert DML events with the specified value.
  • ignore-update-old-value-expr: this parameter accepts a SQL expression. This rule only applies to update DML events whose old value contains the specified value.
  • ignore-update-new-value-expr: this parameter accepts a SQL expression. This rule only applies to update DML events whose new value contains the specified value.

Log Filter - 图1

Note

  • When TiDB updates a value in the column of the clustered index, TiDB splits an UPDATE event into a DELETE event and an INSERT event. TiCDC does not identify such events as an UPDATE event and thus cannot correctly filter out such events.
  • When you configure a SQL expression, make sure all tables that matches matcher contain all the columns specified in the SQL expression. Otherwise, the replication task cannot be created. In addition, if the table schema changes during the replication, which results in a table no longer containing a required column, the replication task fails and cannot be resumed automatically. In such a situation, you must manually modify the configuration and resume the task.