Snowflake

Prerequisites

A Snowflake account granted ACCOUNTADMIN system-defined role or custom role with privileges to:

CREATE WAREHOUSE, DATABASE, SCHEMA
CREATE ROLE, USER
CREATE NETWORK POLICY

Snowflake Setup

It’s recommended to create a separate user and role for Streamkap to access your Snowflake database. Below is an example script that does that.

-- We've provided defaults so, change these as required with names for Database Objects in 'UPPERCASE'
SET user_name           = UPPER('STREAMKAP_USER');
SET user_password       = '{password}'; -- IMPORTANT: Make sure to change this!
SET warehouse_name      = UPPER('STREAMKAP_WH'); -- Used for automatic QA, UPSERT mode and optional views
SET database_name       = UPPER('STREAMKAPDB');
SET schema_name         = UPPER('STREAMKAP');
SET role_name           = UPPER('STREAMKAP_ROLE');
SET network_policy_name = UPPER('STREAMKAP_NETWORK_ACCESS');

-- If your Snowflake account uses custom roles to grant privileges, change these values below
SET sysadmin_role       = UPPER('SYSADMIN');
SET securityadmin_role  = UPPER('SECURITYADMIN');
SET accountadmin_role   = UPPER('ACCOUNTADMIN');

-- Create a warehouse with defaults:
-- Standard, X-Small, No Scaling, Auto-Suspend after 1 Minute
USE ROLE IDENTIFIER($sysadmin_role);
CREATE WAREHOUSE IF NOT EXISTS IDENTIFIER($warehouse_name) AUTO_SUSPEND =1;

-- Create a database and schema for Streamkap
USE WAREHOUSE IDENTIFIER($warehouse_name);
CREATE DATABASE IF NOT EXISTS IDENTIFIER($database_name);
USE DATABASE IDENTIFIER($database_name);
CREATE SCHEMA IF NOT EXISTS IDENTIFIER($schema_name);

-- Create a Snowflake role with privileges for the Streamkap connector
USE ROLE IDENTIFIER($securityadmin_role);
CREATE ROLE IF NOT EXISTS IDENTIFIER($role_name);

-- Grant privileges on the warehouse
GRANT USAGE ON WAREHOUSE IDENTIFIER($warehouse_name) TO ROLE IDENTIFIER($role_name);

-- Grant privileges on the database
GRANT USAGE ON DATABASE IDENTIFIER($database_name) TO ROLE IDENTIFIER($role_name);
-- Optional: Permissions for auto schema creation on the database
GRANT CREATE SCHEMA ON DATABASE IDENTIFIER($database_name) TO ROLE IDENTIFIER($role_name);

-- Grant privileges on the database schema
USE ROLE IDENTIFIER($sysadmin_role);
USE DATABASE IDENTIFIER($database_name);
GRANT USAGE ON SCHEMA IDENTIFIER($schema_name) TO ROLE IDENTIFIER($role_name);
GRANT CREATE TABLE ON SCHEMA IDENTIFIER($schema_name) TO ROLE IDENTIFIER($role_name);
GRANT CREATE FILE FORMAT ON SCHEMA IDENTIFIER($schema_name) TO ROLE IDENTIFIER($role_name);
GRANT CREATE STAGE ON SCHEMA IDENTIFIER($schema_name) TO ROLE IDENTIFIER($role_name);
GRANT CREATE PIPE ON SCHEMA IDENTIFIER($schema_name) TO ROLE IDENTIFIER($role_name);

-- Grant privileges for dynamic table and task creation (Only if auto-creation is enabled)
GRANT CREATE DYNAMIC TABLE ON SCHEMA IDENTIFIER($schema_name) TO ROLE IDENTIFIER($role_name);
GRANT CREATE TASK ON SCHEMA IDENTIFIER($schema_name) TO ROLE IDENTIFIER($role_name);
USE ROLE IDENTIFIER($accountadmin_role);
GRANT EXECUTE TASK ON ACCOUNT TO ROLE IDENTIFIER($role_name);

-- Create a user for Streamkap
USE ROLE IDENTIFIER($securityadmin_role);
CREATE USER IDENTIFIER($user_name) PASSWORD = $user_password DEFAULT_ROLE = $role_name;

-- Grant the custom role to the Streamkap user
GRANT ROLE IDENTIFIER($role_name) TO USER IDENTIFIER($user_name);

-- Set the custom role as the default role for the Streamkap user.
-- If you encounter an 'Insufficient privileges' error, verify the '$securityadmin_role' has OWNERSHIP privilege on the '$user_name'.
ALTER USER IDENTIFIER($user_name) SET DEFAULT_ROLE = $role_name;

-- Prevents Snowflake getting confused in an edge-case where a table exists in the public schema and current schema with the same name.
ALTER USER IDENTIFIER($user_name) SET SEARCH_PATH = '$current';

-- Allow the Streamkap user access to the Snowflake account
-- Latest IPs can be found here: https://docs.streamkap.com/docs/streamkap-ip-addresses
-- If you need to edit the network policy, you can use:
-- ALTER NETWORK POLICY STREAMKAP_NETWORK_ACCESS SET ALLOWED_IP_LIST=('52.32.238.100');
CREATE NETWORK POLICY IDENTIFIER($network_policy_name) ALLOWED_IP_LIST=('52.32.238.100');
ALTER USER IDENTIFIER($user_name) SET NETWORK_POLICY = $network_policy_name;

We do not use CREATE OR REPLACE in our scripts. This is to avoid destroying something by mistake that already exists in your Snowflake account.

Key Pair Authentication

The connector relies on an RSA key pair for authentication which you can generate using OpenSSH. Below are example scripts that do that. You can modify them to suit your security policies, but please ensure the key pair meets these minimum requirements:

RSA 2048-bit
PKCS#8 key format

SSH key generation on WindowsSnowflake does not support keys generated by PuTTY Key Generator.One of the easiest and quickest ways to generate a valid OpenSSL key is via Git Bash which is installed by default with Git for Windows. After installation, you can open a Git Bash prompt by Left Shift + Right Clicking on your Desktop, choosing “Open Git Bash here” and then executing the OpenSSL commands below.If you have any issues following these instructions or are unable to install Git for Windows, please contact us.

# Make sure to change '{passphrase}' to a password of your choice
openssl genrsa 2048 | openssl pkcs8 -topk8 -v2 aes256 -inform PEM -out streamkap_key.p8 -passout pass:{passphrase}

# generates the public key, referencing the private key
# Don't forget to replace '{passphrase}' with the password used in the previous command
openssl rsa -in streamkap_key.p8 -pubout -out streamkap_key.pub -passin pass:{passphrase}

The scripts above should create two files (the key pair), one private (may have an extension e.g. .p8) and the other public (usually has the extension .pub). Store both files in a secure place. Once generated, the public key needs to be assigned to the Snowflake database user created for Streamkap earlier. This command will copy the public key you generated to your clipboard.

egrep -v '^-|^$' ./streamkap_key.pub | pbcopy

Now attach the public key to the user:

-- We've provided a default, so change this as required
SET user_name = UPPER('STREAMKAP_USER');

USE ROLE SECURITYADMIN;

-- Replace '{public key}' below with the public key file contents
-- If you used the previous command to copy the key to your clipboard, use Ctrl+V (Windows)
-- or Cmd+V (MacOS) to replace the '{public key}' placeholder with the key
-- Key part MUST start with 'MII' excluding any headers and footers
ALTER USER IDENTIFIER($user_name) SET RSA_PUBLIC_KEY = '{public key}';

Streamkap Setup

Follow these steps to configure your new connector:

1. Create the Destination

Navigate to Add Connectors.
Choose Snowflake.

2. Connection Settings

Name: Enter a name for your connector.
Snowflake URL: The URL for accessing your Snowflake account. This URL must include your account identifier. Note that the protocol (https://) and port number are optional.
Username: User login name for the Snowflake account (Case sensitive).
Private Key: Provide the private key you generated by using the command below.
egrep -v '^-|^$' ./streamkap_key.p8 | pbcopy
- Key secured with passphrase?: If checked (default), provide your SSH key’s passphrase, otherwise, uncheck for SSH keys without passphrase.
  - Private Key Passphrase: The passphrase is used to decrypt the private key.
Database Name: The name of the database to use (Case sensitive).
Schema Name: The name of the schema where tables will be created (Case sensitive).
Snowflake Role: The name of an existing role with necessary privileges (for Streamkap) assigned to the user specified by Username (Case sensitive).

3. Ingestion Settings

Ingestion Mode: How the Connector loads data into the Snowflake tables. See Upsert mode for further details.
Changing ingestion modeappend and upsert modes use different, incompatible methods for loading data into the Snowflake tables. If - for whatever reason - you want to change modes for an existing Snowflake Connector, please create a new Snowflake Destination instead i.e. a separate destination for append, and for upsert.
- appendmode:
  - Use Dynamic Tables: Specifies whether the connector should create Dynamic Tables & Cleanup Tasks. See Dynamic Tables.
    - Custom SQL Template - Dynamic Table Creation: These template queries run for each table the first time a record is streamed for them.
    - Custom SQL Template - Dynamic Table Name: Can be used as {{dynamicTableName}} in dynamic table creation SQL. It can use input JSON data for more complex mappings and logic.
    - Custom SQL Template - Input JSON data: Use {"TABLE_DATA": {"{table_name}": {"{key}": "{value}"}, ...}, ...} to set table specific data. This data will be available in the custom SQL templates e.g. SELECT {{key}}.
    - Auto QA Deduplication Table Mapping: Mapping between the tables that store append-only data and the deduplicated tables. The dedupeTable in mapping will be used for QA scripts. If dedupeSchema is not specified, the deduplicated table will be created in the same schema as the raw table.
- upsertmode:
  - Delete Mode: Specifies whether the connector processes deletions (or tombstone events) and removes the corresponding row from the database.
  - Use Hybrid Tables: Specifies whether the connector should create Hybrid Tables.

Click Save.

Troubleshooting

Dynamic Tables

Snowflake Dynamic Tables are materialized views which consist of the latest records inserted into Snowflake. Streamkap’s Snowflake Connector creates them—if enabled—for each table the first time a record is streamed for them. A Snowflake Task is also created for each dynamic table to clean up older entries periodically. Below is the default template—shown in the Streamkap UI. You can modify it there to suit your requirements.

  CREATE OR REPLACE DYNAMIC TABLE {{table}}_DT
    TARGET_LAG = '15 minutes' -- Minimum is 1 minute
    WAREHOUSE = {{warehouse}}
    AS
  SELECT * EXCLUDE dedupe_id
  FROM (
    SELECT *,
      ROW_NUMBER() OVER (
        PARTITION BY {{primaryKeyColumns}}
        ORDER BY _streamkap_ts_ms DESC, _streamkap_offset DESC
      ) AS dedupe_id
    FROM {{table}}
  )
  WHERE dedupe_id = 1         -- Latest record
    AND __deleted = 'false';  -- Excluding deleted records

  CREATE OR REPLACE TASK {{table}}_CT -- This statement and the `ALTER TASK {{table}}_CT RESUME;` can be removed if you don't want to clean up old records
    WAREHOUSE = {{warehouse}}
    SCHEDULE = '4380 minutes' -- We don't recommend changing this to a very short interval (e.g. 30 minutes) as it can increase Snowflake costs
    TASK_AUTO_RETRY_ATTEMPTS = 3
    ALLOW_OVERLAPPING_EXECUTION = FALSE
    AS
  DELETE FROM {{table}}
  WHERE NOT EXISTS (
    SELECT 1
    FROM (
      SELECT {{primaryKeyColumns}}, MAX(_streamkap_ts_ms) AS max_timestamp
      FROM {{table}}
      GROUP BY {{primaryKeyColumns}}
    ) AS subquery
    WHERE {{{keyColumnsAndCondition}}}
      AND {{table}}._streamkap_ts_ms = subquery.max_timestamp
  );

  ALTER TASK {{table}}_CT RESUME;

Offset Management (Append Mode)

UI Coming Soon: The Streamkap app will soon expose consumer group and offset management directly in the UI. Currently, offset troubleshooting and resets require assistance from Streamkap support, who can coordinate both Kafka and Snowflake offset resets safely.

When using append mode, the Snowflake destination connector uses Snowpipe Streaming, which involves two layers of offset tracking that can become misaligned and cause ingestion issues.

Understanding dual offset tracking

Two offset systems work together:

Kafka Consumer Group Offsets (connect-<connector-id>)
- Tracks which Kafka messages the connector has consumed from each topic partition
- Managed by Kafka Connect framework
- Consumer lag = difference between latest message in topic and last consumed offset
Snowpipe Streaming Channel Offsets
- Tracks which messages have been successfully ingested into Snowflake tables
- Each topic partition creates a Snowflake channel (e.g., TOPIC_0 for partition 0)
- Managed within Snowflake using SYSTEM$SNOWPIPE_STREAMING_UPDATE_CHANNEL_OFFSET_TOKEN
- View channels with SHOW CHANNELS in your Snowflake schema

The problem: These two offset systems can become misaligned, especially when topics are recreated.

Common offset issues

Negative consumer lag scenarios:

Kafka topic is deleted and recreated, but consumer group retains old offsets
Consumer group “remembers” offsets from the old topic that no longer exist
Connector appears “ahead” of the current topic messages

Snowflake channel offset issues:

Snowflake channels may retain offsets that reference deleted/recreated topics
Channels can become “stuck” with offsets pointing to non-existent messages
Data ingestion stops even though Kafka consumer group appears healthy

Symptoms to watch for:

Connector shows as running but no new data appears in Snowflake
Negative or unexpectedly high consumer lag
Channels showing stale offset positions in SHOW CHANNELS

Offset reset strategies

When offset resets are needed:

After recreating Kafka topics
Negative consumer lag that doesn’t resolve automatically
Connector stuck and not processing new messages
Need to reprocess historical data

Reset approaches:Kafka Consumer Group Reset:

To earliest: Reprocess all available messages in the topic
To latest: Skip to newest messages (ignore historical data)
To timestamp: Start from a specific point in time
To offset: Jump to exact message position

Snowflake Channel Reset:

-- Reset channel offset to beginning (0) for a table
SELECT SYSTEM$SNOWPIPE_STREAMING_UPDATE_CHANNEL_OFFSET_TOKEN(
    'DATABASE.SCHEMA.TABLE_NAME', 
    'TOPIC_NAME_0',  -- Channel name (usually TOPIC_PARTITION)
    '0'              -- New offset position
);

-- View current channels and their offsets
SHOW CHANNELS;

Coordination Required: Both Kafka consumer group AND Snowflake channel offsets typically need to be reset together to avoid data gaps or duplicates.

Impact and best practices

Before resetting offsets, consider:

Data duplication: Resetting to earlier positions may cause duplicate records in Snowflake
Processing time: Reprocessing large volumes of historical data takes time
Snowflake costs: Additional data processing increases Snowpipe Streaming and storage costs
Coordination complexity: Both Kafka and Snowflake offsets need proper alignment

Best practices:

Use Dynamic Tables: Automatically handle deduplication from any offset resets
Reset to latest: For new deployments where historical data isn’t needed
Coordinate resets: Ensure both Kafka consumer group and Snowflake channel offsets are reset together
Monitor channels: Regularly check SHOW CHANNELS output for channel health
Test thoroughly: After offset resets, verify data flow and check for duplicates

Upsert mode

Snowflake destination connector can run in upsert mode. This mode switches off the use of snowpipe streaming and connector uses periodic MERGE INTO statements to upsert data into target snowflake tables. Dynamic tables or other de-duplication mechanisms will not be necessary when using upsert mode.

Snowflake costsCurrently upsert mode requires a warehouse to be running so overall the costs will be higher compared to append mode which uses snowpipe streaming.

Getting the Snowflake URL

You can also run the script below in a Snowflake worksheet to return the Snowflake URL. You need to be logged into Snowflake with an account granted ORGADMIN system-defined role to run this script.

USE ROLE ORGADMIN;

-- Snowflake URL is the 'account_url', not 'account_locator_url'
SHOW ACCOUNTS;

Snowflake Setup scripts failing

There can be many reasons for them to fail, but the scripts below can help you diagnose the issues. You need to be logged into Snowflake with an account granted ACCOUNTADMIN system-defined role or custom role with equivalent privileges to run these scripts. Copy paste the scripts below into Snowflake worksheets. Change the object names at the top as required and run all queries.

-- Replace object names below with the names used by your Snowflake Setup script
SET warehouse_name      = UPPER('STREAMKAP_WH');
SET database_name       = UPPER('STREAMKAPDB');
SET schema_name         = UPPER('STREAMKAP');
SET role_name           = UPPER('STREAMKAP_ROLE');
SET user_name           = UPPER('STREAMKAP_USER');
SET network_policy_name = UPPER('STREAMKAP_NETWORK_ACCESS');

-- If your Snowflake account uses custom roles to grant privileges, change the role name below
SET accountadmin_role   = UPPER('ACCOUNTADMIN');

USE ROLE IDENTIFIER($accountadmin_role);

-- If any of the queries fail or return no results, '$accountadmin_role' doesn't have necessary privileges, or the object doesn't exist
-- Warehouses, databases and schemas
DESC WAREHOUSE IDENTIFIER($warehouse_name);
DESC DATABASE IDENTIFIER($database_name); -- Displays schemas; no need for DESC SCHEMA ... query also

-- Users
DESC USER IDENTIFIER($user_name); -- Shows name, defaults and RSA details; no need for separate queries

-- Network policies
DESC NETWORK POLICY IDENTIFIER($network_policy_name);

If any of the queries return an error or no results:

Check in the top right corner (next to Share and Run buttons) of the Snowflake Worksheets that the role is set to ACCOUNTADMIN, or a custom role with equivalent privileges
Depending on which query failed or returned no results, check the object names at the top of the script are correct
If a query returns "Object does not exist or is not authorized" error, go to the Snowsight UI Admin page and see if the object is showing there. For example, if DESC WAREHOUSE ... failed, go to Admin -> Warehouses page and check if the Warehouse is shown on that page

If the warehouse, database, schema, role and user exists, privileges might be an issue. Run Script #2 and ensure the privileges displayed match or include the following:

Getting Started

App

Deployment

Sources

Destinations

Transformation

Billing

Security

Rest API

Prerequisites

Snowflake Setup

Key Pair Authentication

Streamkap Setup

1. Create the Destination

2. Connection Settings

3. Ingestion Settings

Troubleshooting

Dynamic Tables

Offset Management (Append Mode)

Upsert mode

Getting the Snowflake URL

Snowflake Setup scripts failing

Getting Started

App

Deployment

Sources

Destinations

Transformation

Billing

Security

Rest API

​Prerequisites

​Snowflake Setup

​Key Pair Authentication

​Streamkap Setup

​1. Create the Destination

​2. Connection Settings

​3. Ingestion Settings

​Troubleshooting

​Dynamic Tables

​Offset Management (Append Mode)

​Upsert mode

​Getting the Snowflake URL

​Snowflake Setup scripts failing

Prerequisites

Snowflake Setup

Key Pair Authentication

Streamkap Setup

1. Create the Destination

2. Connection Settings

3. Ingestion Settings

Troubleshooting

Dynamic Tables

Offset Management (Append Mode)

Upsert mode

Getting the Snowflake URL

Snowflake Setup scripts failing