Skip to main content

Integrating MySQL with ClickHouse

This page covers two options for integrating MySQL with ClickHouse:

  • using the MySQL table engine, for reading from a MySQL table
  • using the MaterializedMySQL database engine, for syncing a database in MySQL with a database in ClickHouse

Connecting ClickHouse to MySQL using the MySQL Table Engine

The MySQL table engine allows you to connect ClickHouse to MySQL. SELECT and INSERT statements can be made in either ClickHouse or in the MySQL table. This article illustrates the basic methods of how to use the MySQL table engine.

1. Configure MySQL

  1. Create a database in MySQL:

    CREATE DATABASE db1;
  2. Create a table:

    CREATE TABLE db1.table1 (
    id INT,
    column1 VARCHAR(255)
    );
  3. Insert sample rows:

    INSERT INTO db1.table1
    (id, column1)
    VALUES
    (1, 'abc'),
    (2, 'def'),
    (3, 'ghi');
  4. Create a user to connect from ClickHouse:

    CREATE USER 'mysql_clickhouse'@'%' IDENTIFIED BY 'Password123!';
  5. Grant privileges as needed. (For demonstration purposes, the mysql_clickhouse user is granted admin prvileges.)

    GRANT ALL PRIVILEGES ON *.* TO 'mysql_clickhouse'@'%';
note

If you are using this feaure in ClickHouse Cloud, you may need the to allow the ClickHouse Cloud IP addresses to access your MySQL instance. Check the ClickHouse Cloud Endpoints API for egress traffic details.

2. Define a Table in ClickHouse

  1. Now let's create a ClickHouse table that uses the MySQL table engine:

    CREATE TABLE mysql_table1 (
    id UInt64,
    column1 String
    )
    ENGINE = MySQL('mysql-host.domain.com','db1','table1','mysql_clickhouse','Password123!')

    The minimum parameters are:

    parameterDescriptionexample
    hosthostname or IPmysql-host.domain.com
    databasemysql database namedb1
    tablemysql table nametable1
    userusername to connect to mysqlmysql_clickhouse
    passwordpassword to connect to mysqlPassword123!
    note

    View the MySQL table engine doc page for a complete list of parameters.

3. Test the Integration

  1. In MySQL, insert a sample row:

    INSERT INTO db1.table1
    (id, column1)
    VALUES
    (4, 'jkl');
  2. Notice the existing rows from the MySQL table are in the ClickHouse table, along with the new row you just added:

    SELECT
    id,
    column1
    FROM mysql_table1

    You should see 4 rows:

    Query id: 6d590083-841e-4e95-8715-ef37d3e95197

    ┌─id─┬─column1─┐
    │ 1 │ abc │
    │ 2 │ def │
    │ 3 │ ghi │
    │ 4 │ jkl │
    └────┴─────────┘

    4 rows in set. Elapsed: 0.044 sec.
  3. Let's add a row to the ClickHouse table:

    INSERT INTO mysql_table1
    (id, column1)
    VALUES
    (5,'mno')
  4. Notice the new row appears in MySQL:

    mysql> select id,column1 from db1.table1;

    You should see the new row:

    +------+---------+
    | id | column1 |
    +------+---------+
    | 1 | abc |
    | 2 | def |
    | 3 | ghi |
    | 4 | jkl |
    | 5 | mno |
    +------+---------+
    5 rows in set (0.01 sec)

Summary

The MySQL table engine allows you to connect ClickHouse to MySQL to exchange data back and forth. For more details, be sure to check out the documentation page for the MySQL table engine.

Replicate a MySQL Database in ClickHouse

note

This page is not applicable to ClickHouse Cloud. The feature documented here is not yet available in ClickHouse Cloud services. See the ClickHouse Cloud Compatibility guide for more information.

The MaterializedMySQL database engine allows you to define a database in ClickHouse that contains all the existing tables in a MySQL database, along with all the data in those tables. On the MySQL side, DDL and DML operations can continue to made and ClickHouse detects the changes and acts as a replica to MySQL database.

This article demonstrates how to configure MySQL and ClickHouse to implement this replication.

1. Configure MySQL

  1. Configure the MySQL database to allow for replication and native authentication. ClickHouse only works with native password authentication. Add the following entries to /etc/my.cnf:

    default-authentication-plugin = mysql_native_password
    gtid-mode = ON
    enforce-gtid-consistency = ON
  2. Create a user to connect from ClickHouse:

    CREATE USER clickhouse_user IDENTIFIED BY 'ClickHouse_123';
  3. Grant the needed permissions to the new user. For demonstration purposes, full admin rights have been granted here:

    GRANT ALL PRIVILEGES ON *.* TO 'clickhouse_user'@'%';
    note

    The minimal permissions needed for the MySQL user are RELOAD, REPLICATION SLAVE, REPLICATION CLIENT and SELECT PRIVILEGE.

  4. Create a database in MySQL:

    CREATE DATABASE db1;
  5. Create a table:

    CREATE TABLE db1.table_1 (
    id INT,
    column1 VARCHAR(10),
    PRIMARY KEY (`id`)
    ) ENGINE = InnoDB;
  6. Insert a few sample rows:

    INSERT INTO db1.table_1
    (id, column1)
    VALUES
    (1, 'abc'),
    (2, 'def'),
    (3, 'ghi');

2. Configure ClickHouse

  1. Set parameter to allow use of experimental feature:

    set allow_experimental_database_materialized_mysql = 1;
  2. Create a database that uses the MaterializedMySQL database engine:

    CREATE DATABASE db1_mysql
    ENGINE = MaterializedMySQL(
    'mysql-host.domain.com:3306',
    'db1',
    'clickhouse_user',
    'ClickHouse_123'
    );

    The minimum parameters are:

    parameterDescriptionexample
    host:porthostname or IP and portmysql-host.domain.com
    databasemysql database namedb1
    userusername to connect to mysqlclickhouse_user
    passwordpassword to connect to mysqlClickHouse_123
    note

    View the MaterializedMySQL database engine doc page for a complete list of parameters.

3. Test the Integration

  1. In MySQL, insert a sample row:

    INSERT INTO db1.table_1
    (id, column1)
    VALUES
    (4, 'jkl');
  2. Notice the new row appears in the ClickHouse table:

    SELECT
    id,
    column1
    FROM db1_mysql.table_1

    The response looks like:

    Query id: d61a5840-63ca-4a3d-8fac-c93235985654

    ┌─id─┬─column1─┐
    │ 1 │ abc │
    └────┴─────────┘
    ┌─id─┬─column1─┐
    │ 4 │ jkl │
    └────┴─────────┘
    ┌─id─┬─column1─┐
    │ 2 │ def │
    └────┴─────────┘
    ┌─id─┬─column1─┐
    │ 3 │ ghi │
    └────┴─────────┘

    4 rows in set. Elapsed: 0.030 sec.
  3. Suppose the table in MySQL is modified. Let's a column to db1.table_1 in MySQL:

    alter table db1.table_1 add column column2 varchar(10) after column1;
  4. Now let's insert a row to the modified table:

    INSERT INTO db1.table_1
    (id, column1, column2)
    VALUES
    (5, 'mno', 'pqr');
  5. Notice the table in ClickHouse now has the new column and the new row:

    SELECT
    id,
    column1,
    column2
    FROM db1_mysql.table_1

    The previous rows will have NULL for column2:

    Query id: 2c32fd15-3c83-480b-9bfc-cba5d932d674

    Connecting to localhost:9000 as user default.
    Connected to ClickHouse server version 22.2.2 revision 54455.

    ┌─id─┬─column1─┬─column2─┐
    │ 3 │ ghi │ ᴺᵁᴸᴸ │
    └────┴─────────┴─────────┘
    ┌─id─┬─column1─┬─column2─┐
    │ 2 │ def │ ᴺᵁᴸᴸ │
    └────┴─────────┴─────────┘
    ┌─id─┬─column1─┬─column2─┐
    │ 1 │ abc │ ᴺᵁᴸᴸ │
    │ 5 │ mno │ pqr │
    └────┴─────────┴─────────┘
    ┌─id─┬─column1─┬─column2─┐
    │ 4 │ jkl │ ᴺᵁᴸᴸ │
    └────┴─────────┴─────────┘

    5 rows in set. Elapsed: 0.017 sec.

Summary

That's it! The MaterializedMySQL database engine will keep the MySQL database synced on ClickHouse. There are a few details and limitations, so be sure to read the doc page for MaterializedMySQL for more details.