Amazon RDS MySQL

Prerequisites

MySQL version ≥ 5.7
MySQL binlog enabled
A database user with sufficient privileges to configure the database, including enabling binary logging and creating users

MySQL Setup

1. Grant Database Access

Configure one of the Connection Options to ensure Streamkap can reach your database.

2. Configure Binary Logging

Binary logging records all changes to your database tables. The Connector relies on MySQL’s implementation of this.

Open the Amazon RDS console at https://console.aws.amazon.com/rds/.
In the navigation pane, choose Parameter groups.

Default parameter groups can’t be modifiedIf the DB instance is using a default parameter group, create a new one:

Choose Create parameter group.
Enter a Parameter group name and Description.
For Engine type, choose your database engine.
For Parameter group family, choose a DB parameter group family.
Choose Create.

Select the parameter group to edit.
Choose Edit from Parameter group actions.
Set binlog_format to ROW.
Set binlog_row_image to Full.
Choose Save changes.

If you created a new parameter group, associate it with your DB instance:

In the navigation pane, choose Databases and select the target DB instance.
Choose Modify.
Change the DB parameter group setting.
Choose Continue and review modifications.
On the confirmation page, choose Modify DB Instance.

A reboot is required to apply the changes.

Configuring RDS for MySQL binary logging

3. Set Binary Log Retention Period

Connect to your master database with your SQL tool.
View current settings with CALL mysql.rds_show_configuration;
If less than 24 hours or null, run CALL mysql.rds_set_configuration('binlog retention hours', 72);

4. Verify Binary Logs Are Enabled

You can verify using any of these methods:

Check the parameter group for the DB instance and that log_bin parameter is ON
Run the following SQL query on the DB instance SHOW VARIABLES LIKE '%log_bin%';. Result should be ON
Run SHOW BINARY LOGS

5. Create Database User

It’s recommended to create a separate user and role for the Connector to access your MySQL database. Below is an example script that does that.

-- Replace { ... } placeholders as required

-- Identify version
SHOW VARIABLES LIKE 'VERSION';

-- On MySQL version 5.6 to 8.0
CREATE USER 'streamkap_user'@'%' IDENTIFIED BY '{password}';

-- On MySQL version 8.0+
CREATE USER 'streamkap_user'@'%' IDENTIFIED WITH mysql_native_password BY '{password}';

-- Grant Permissions
GRANT REPLICATION CLIENT, RELOAD, SHOW DATABASES, REPLICATION SLAVE ON *.* TO 'streamkap_user'@'%';

-- Grant Select on all schemas needed
GRANT SELECT ON {schema}.* TO 'streamkap_user';

6. Enable Snapshots

To backfill your data, the Connector needs to be able to perform snapshots. See Snapshots & Backfilling for more information. To enable this feature, there are 2 methods available:

Method 1: Enable GTID (Recommended)

This method is recommended if you cannot create and grant Streamkap read/write privileges on a ‘signal’ table (method 2) for any reason. It’s the equivalent of a ‘read only’ connection.

GTID mode is only available in MySQL version ≥ 5.6.5

Global transaction identifiers (GTIDs) uniquely identify transactions that occur on a server within a cluster. Though not required, using GTIDs simplifies replication and enables you to more easily confirm if primary and replica servers are consistent as well as carry out incremental snapshots. Set up following these instructions. Ensure you follow the guide for your version https://dev.mysql.com/doc/refman/8.0/en/replication-gtids-howto.html and that GTID mode is ON.

Method 2: Create a table in the source database

If you cannot enable GTID mode, you will need to create the table and give permissions to the streamkap_user. The Connector will use this collection for managing snapshots.

Please create the signal table with the name streamkap_signal in a new schema called streamkap. It will not be recognised if given another name.

-- Create the schema
CREATE SCHEMA streamkap;

CREATE TABLE streamkap_signal (
  id VARCHAR(255) PRIMARY KEY,
  type VARCHAR(32) NOT NULL,
  data VARCHAR(2000) NULL
);

GRANT SELECT, UPDATE, INSERT ON streamkap.streamkap_signal TO 'streamkap_user';

7. Heartbeats

Connectors use “offsets”—like bookmarks—to track their position in the database’s log or change stream. When no changes occur for long periods, these offsets may become outdated, and the Connector might lose its place or stop capturing changes. Heartbeats ensure the Connector stays active and continues capturing changes. There are two layers of heartbeat protection:

Layer 1: Connector heartbeats (enabled by default)

The Connector periodically emits heartbeat messages to an internal topic, even when no actual data changes are detected. This keeps offsets fresh and prevents staleness. No configuration is necessary for this layer; it is automatically enabled. We recommend keeping this layer enabled for all deployments.

Layer 2: Source database heartbeats (recommended)

Why we recommend configuring Layer 2While Layer 2 is crucial for low-traffic or intermittent databases, we recommend configuring it for all deployments. It provides additional resilience and helps prevent issues during periods of inactivity.

You can configure regular updates to a dedicated heartbeat table in the source database. This simulates activity, ensuring change events are generated consistently, maintaining log progress and providing additional resilience. How this layer is configured depends on the connection type (if supported by the Source):

Read-write connections (when Read only is No during Streamkap Setup): The Connector updates the heartbeat table directly.
Read-only connections (when Read only is Yes during Streamkap Setup): A scheduled job on the primary database updates the heartbeat table, and these changes replicate to the read replica for the Connector to consume.

This layer requires you to set up a heartbeat table—and for read-only connections, a scheduled job (e.g., pg_cron for PostgreSQL, event_scheduler for MySQL)—on your source database.

Read-write connections
Read-only connections

For read-write connections (when Read only is No during Streamkap Setup), the Connector writes to the heartbeat table directly.

-- Create the streamkap schema
CREATE SCHEMA IF NOT EXISTS streamkap;

-- Switch to the streamkap schema
USE streamkap;

-- Create the heartbeat table with id, text, and last_update fields
CREATE TABLE streamkap_heartbeat (
    id INT AUTO_INCREMENT PRIMARY KEY,
    text TEXT,
    last_update TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

-- Grant permission to the Streamkap user
GRANT SELECT, UPDATE, INSERT, DELETE ON streamkap.streamkap_heartbeat TO STREAMKAP_USER;

-- Insert the first row into the heartbeat table
INSERT INTO streamkap_heartbeat (text) VALUES ('test_heartbeat');

For read-only connections (when Read only is Yes during Streamkap Setup), the Connector cannot write to the heartbeat table directly. Instead, you must configure a scheduled job on the primary database to generate artificial traffic. These changes will replicate to the read replica, which the Connector then consumes.

Run these commands on the primary database, not the read replica.The heartbeat table and scheduled event must be created on the primary database. The changes will automatically replicate to the read replica.

Enable the event scheduler

The MySQL Event Scheduler flag must be enabled on your database (event_scheduler=ON). See your provider’s documentation:

Amazon RDS

Modifying parameters in a DB parameter group in Amazon RDS

Amazon Aurora

Aurora MySQL configuration parameters reference

Google Cloud SQL

Configuring database flags

Azure Database

Server parameters

Check if the event scheduler is enabled:

SQL

SHOW VARIABLES WHERE VARIABLE_NAME = 'event_scheduler';

Making this setting persistentFor self-hosted MySQL, add event_scheduler = ON to your MySQL configuration file (usually my.cnf or my.ini) in the [mysqld] section.For managed databases (RDS, Cloud SQL, Azure), configure the event_scheduler parameter through your provider’s console or CLI.

Create the heartbeat table

SQL

-- Create the streamkap schema
CREATE SCHEMA IF NOT EXISTS streamkap;

-- Create the heartbeat table
CREATE TABLE streamkap.streamkap_heartbeat (
    id INT AUTO_INCREMENT PRIMARY KEY,
    text TEXT,
    last_update TIMESTAMP DEFAULT CURRENT_TIMESTAMP
);

-- Insert the initial row
INSERT INTO streamkap.streamkap_heartbeat (text) VALUES ('test_heartbeat');

Create the scheduled event

SQL

CREATE EVENT streamkap.streamkap_heartbeat_event
ON SCHEDULE EVERY 1 MINUTE
DO
  UPDATE streamkap.streamkap_heartbeat
  SET text = 'updated_heartbeat',
      last_update = CURRENT_TIMESTAMP
  WHERE id = 1;

Grant permissions

Whichever database user is used to create and run the event scheduler (often the MySQL root user or a dedicated event scheduler user) needs appropriate permissions on the heartbeat table. Additionally, the Streamkap user also needs permissions to monitor the heartbeat table.

SQL

GRANT EVENT ON streamkap.* TO {event scheduler user};
GRANT SELECT, UPDATE, INSERT, DELETE ON streamkap.streamkap_heartbeat TO {event scheduler user};

-- Grant permissions to the Streamkap user for monitoring and diagnostics
GRANT SELECT, UPDATE, INSERT, DELETE ON streamkap.streamkap_heartbeat TO 'streamkap_user'@'%';

Useful event scheduler commands

SQL

-- View all scheduled events in the streamkap schema
SHOW EVENTS IN streamkap;

-- View event details
SELECT * FROM information_schema.EVENTS WHERE EVENT_SCHEMA = 'streamkap';

-- Disable an event temporarily
ALTER EVENT streamkap.streamkap_heartbeat_event DISABLE;

-- Enable an event
ALTER EVENT streamkap.streamkap_heartbeat_event ENABLE;

-- Drop an event
DROP EVENT IF EXISTS streamkap.streamkap_heartbeat_event;

Streamkap Setup

Follow these steps to configure your new connector:

1. Create the Source

Navigate to Add Connectors.
Choose MySQL.

2. Connection Settings

Name: Enter a name for your connector.
Hostname: Specify the hostname.
Port: Default is 3306.
Connect via SSH Tunnel: The Connector will connect to an SSH server in your network which has access to your database. This is necessary if the Connector cannot connect directly to your database.
- See SSH Tunnel for setup instructions.
Username: Username to access the database. By default, Streamkap scripts use streamkap_user.
Password: Password to access the database.
Read only: Whether or not to use a read-only connection. Requires GTID to be enabled on the source database. See Enable GTID for more information.
Heartbeats: Enabled by default.
- For read-write connections, configure a heartbeat table in the source database and set Heartbeat Table Database. See Heartbeats for setup instructions.
- For read-only connections, configure a scheduled heartbeat event on the primary database using the MySQL Event Scheduler, and include the heartbeat table in Schema and Table Capture. See Heartbeats for setup instructions.
Connection Timezone: The timezone of your database.

Timezone conversionMySQL converts TIMESTAMP values from the current time zone to UTC for storage, and back from UTC to the current time zone for retrieval. By default, the current time zone for each connection is the database server’s time zone but this option allows you to override that.As long as the time zones remain the same, you get back the same value you store.We recommend using the default SERVER option which attempts to detect the session time zone from the values configured on the MySQL server session variables ‘time_zone’ or ‘system_time_zone’. It also reduces the chance of problems with daylight savings adjustment ‘fall back’ and ‘spring forward’.If either time zones change, an ad-hoc snapshot is recommended so your source and destination timestamps are consistent.

3. Snapshot Settings

If you set Read only to No, you will need to create a snapshot signal table and give permissions to the streamkap_user. See Enable Snapshots for setup instructions.

Signal Table Database: Streamkap will use a table in this database to manage snapshots.

4. Advanced Parameters

Represent binary data as: Specifies how the data for binary columns e.g. blob, binary, varbinary should be interpreted. Your destination for this data can impact which option you choose. Default is bytes.
Capture Only Captured Databases DDL: Used to control whether the connector records schema structures from all databases defined in the server (the default) or only those databases for which you’ve explicitly configured the connector. Specify true to capture schema history only for the specific databases you’ve configured. This is particularly valuable when databases are large, to reduce the volume of DDL stored in the schema history topic. It also improves startup times when the connector restarts or recovers from failures. Default is false. See Schema History Optimization for details.
Capture Only Captured Tables DDL: Used to control whether the connector records the schema structure for all tables in the configured databases (the default) or only the tables whose changes the connector captures. Specify true to capture schema history only for the specific tables you’ve configured. This is particularly valuable when tables are large, to reduce the volume of DDL statements stored in the schema history topic. It also improves startup times when the connector restarts or recovers from failures. Default is false. See Schema History Optimization for details.

Click Next.

5. Schema and Table Capture

Add Schemas/Tables: Specify the schema(s) and table(s) for capture.
- You can bulk upload here. The format is a simple list of schemas and tables, with each entry on a new row. Save as a .csv file without a header.

CDC only captures base tables, not ViewsChange Data Capture reads the MySQL binary log (binlog), which only records changes to physical tables. Database Views are query-time computations with no physical storage—they don’t generate binlog entries.What you cannot capture: Views, temporary tables, MEMORY tables, BLACKHOLE tables, or system tables (information_schema, performance_schema).Solution: Specify only the underlying base tables that feed your views. You can recreate the view logic in your destination or transformation layer.

Click Save.

Have questions? See the MySQL Source FAQ for answers to common questions about MySQL sources, troubleshooting, and best practices.

Getting Started

App

Troubleshooting

Deployment

Sources

Destinations

Transformation

Usage

Security

Developer Tools

Terraform

Prerequisites

MySQL Setup

1. Grant Database Access

2. Configure Binary Logging

Configuring RDS for MySQL binary logging

3. Set Binary Log Retention Period

4. Verify Binary Logs Are Enabled

5. Create Database User

6. Enable Snapshots

Method 1: Enable GTID (Recommended)

Method 2: Create a table in the source database

7. Heartbeats

Layer 1: Connector heartbeats (enabled by default)

Layer 2: Source database heartbeats (recommended)

Amazon RDS

Amazon Aurora

Google Cloud SQL

Azure Database

Streamkap Setup

1. Create the Source

2. Connection Settings

3. Snapshot Settings

4. Advanced Parameters

5. Schema and Table Capture

Getting Started

App

Troubleshooting

Deployment

Sources

Destinations

Transformation

Usage

Security

Developer Tools

Terraform

​Prerequisites

​MySQL Setup

​1. Grant Database Access

​2. Configure Binary Logging

Configuring RDS for MySQL binary logging

​3. Set Binary Log Retention Period

​4. Verify Binary Logs Are Enabled

​5. Create Database User

​6. Enable Snapshots

​Method 1: Enable GTID (Recommended)

​Method 2: Create a table in the source database

​7. Heartbeats

​Layer 1: Connector heartbeats (enabled by default)

​Layer 2: Source database heartbeats (recommended)

Amazon RDS

Amazon Aurora

Google Cloud SQL

Azure Database

​Streamkap Setup

​1. Create the Source

​2. Connection Settings

​3. Snapshot Settings

​4. Advanced Parameters

​5. Schema and Table Capture

Prerequisites

MySQL Setup

1. Grant Database Access

2. Configure Binary Logging

3. Set Binary Log Retention Period

4. Verify Binary Logs Are Enabled

5. Create Database User

6. Enable Snapshots

Method 1: Enable GTID (Recommended)

Method 2: Create a table in the source database

7. Heartbeats

Layer 1: Connector heartbeats (enabled by default)

Layer 2: Source database heartbeats (recommended)

Streamkap Setup

1. Create the Source

2. Connection Settings

3. Snapshot Settings

4. Advanced Parameters

5. Schema and Table Capture