Prerequisites
- MongoDB version ≥ 5.0
- A MongoDB user with sufficient privileges to create database users and collections
MongoDB Setup
1. Grant Database Access
- Configure one of the Connection Options to ensure Streamkap can reach your database.
2. Create Database User
MongoDB Shell
- Using MongoDB Shell, connect to your primary node or replica set.
- Create a user for Streamkap using the script below. Replace password with your choice.
3. Enable Snapshots
To backfill your data, the Connector needs to be able to perform snapshots. See Snapshots & Backfilling for more information. You will need to create the table and give necessary permissions to thestreamkap_user. The Connector will use this collection for managing snapshots. Below is an example script that does that.
This collection can exist in a different database (on the same MongoDB cluster) to the database Streamkap captures data from.
MongoDB Shell
4. Heartbeats
MongoDB uses change streams to track changes. While change streams use resume tokens to track position, these tokens can expire or become invalidated—particularly on clusters with high write activity or when using custom aggregation pipelines that filter events. Heartbeats ensure the Connector receives regular change events, keeping resume tokens fresh and providing liveness monitoring. There are two layers of heartbeat protection:Layer 1: Connector heartbeats (enabled by default)
The Connector periodically emits heartbeat messages to an internal topic, even when no actual data changes are detected. This keeps offsets fresh and prevents staleness. No configuration is necessary for this layer; it is automatically enabled. We recommend keeping this layer enabled for all deployments.Layer 2: Source database heartbeats (recommended)
Why we recommend configuring Layer 2Layer 2 is especially important when:
- Your database has low or intermittent traffic
- You use custom aggregation pipelines that filter out many events
- You need reliable liveness monitoring
Create the heartbeat collection
Connect to your MongoDB instance and create the heartbeat collection:
Create a heartbeat script
Create a script that updates the heartbeat document:Make the script executable:
5. Obtain Connection String
You’ll need the connection string for setting up the Connector in Streamkap.MongoDB Shell
- Connect to your replica set or primary node using the MongoDB shell as an Admin user.
-
Run
db.getMongo()method to return your connection string-
We recommend the connection string have the following parameters. They will be added automatically if not included:
w=majorityreadPreference=primaryPreferred
-
We recommend the connection string have the following parameters. They will be added automatically if not included:
For information on accepted connection string formats, please see MongoDB - Connection String Formats
Streamkap Setup
Follow these steps to configure your new connector:1. Create the Source
- Navigate to Add Connectors.
- Choose MongoDB.
- MongoDB.
2. Connection Settings
- Name: Enter a name for your connector
- Connection String: Copy the connection string from earlier steps but replace username and password in the string with the one you created earlier.
- Array Encoding: Specify how Streamkap should encode MongoDB array types.
Arrayencodes them as a JSON array but requires all elements in the arrays to be of the same type e.g. array of integers.Array_Stringencodes them as a JSON string and must be used if the MongoDB arrays have mixed types. - Nested Document Encoding: Specify how Streamkap should encode nested documents.
Documentencodes them as JSON objects but may be problematic for complex (e.g. multiple levels of nested sub documents and arrays, sub arrays of nested documents) documents.Stringencodes them as a JSON string and we recommend it if the MongoDB nested documents are complex. - Connect via SSH Tunnel: The Connector will connect to an SSH server in your network which has access to your database. This is necessary if the Connector cannot connect directly to your database.
- See SSH Tunnel for setup instructions.
3. Snapshot Settings
- Signal Table Database: Streamkap will use a collection in this database to manage snapshots e.g.
public. See Enable Snapshots for more information.
4. Database and Collection Capture
- Add Database/Collections: Specify the database(s) and collection(s) for capture.
- You can bulk upload here. The format is a simple list of databases and collections, with each entry on a new row. Save as a .csv file without a header.
- If you configured Layer 2 heartbeats, include the heartbeat collection (e.g.,
streamkap.streamkap_heartbeat). See Heartbeats for setup instructions.