Redshift Integration

Connecting Redshift to Correlated

Correlated reads data from Redshift to populate Accounts and Users in the product. Depending on how your tables are structured, there will be a significant difference in database performance, load, and ultimately cost. Please consult with your Correlated team member if you have questions on how to set up your data.

You can skip ahead to the section that is most relevant to you based on your setup:
Connecting Redshift to Correlated
Configuring Your Integration
Setting up a Redshift Datashare

IP Address

Before you get started, you will likely need to modify your network's firewall allowlist in order to connect Redshift to Correlated. Please allow access from Correlated's single, static IP address:

  • 34.122.97.66

Connecting Redshift to Correlated

  1. Log into the Correlated application
  2. Click on Integrations in the left navigation
  3. Go to your Redshift integration and click "Connect"
  1. You'll be prompted to select your Region and input your AWS Account Id.
  1. Choose the type of connection you'd like to use. We recommend connecting Direct rather than using a Datashare if possible.

For more information on how to setup a datashare, see below.

Configuring Your Integration

  1. Once Redshift is successfully connected (the integration card will be green), you can click "Edit" on the Redshift integration to link your data in Redshift to Correlated
  1. You'll see three tabs at the top of the page: Connection, Connected Tables, and Dimensions
  • Connection will summarize what dataset is connected, the last date/time it was updated, and the available tables
  • Connected Tables is where you can add in the specific tables you'd like to connect to Correlated. This step is required because you need to tell Correlated how to interpret your table schema. Note: you can always come back and add more tables in later on
  • Dimensions allows you to view all of the Dimensions that Correlated has access to through this integration

Connecting Tables to Correlated

Now that you can see the available tables from your datashare, it's time to connect tables to Correlated.

  1. Navigate to the "Connected Tables" tab
  2. Click "Connect Table"
  1. Select the table you are trying to connect
  2. Select a table type (Account or User)
  3. In the User ID dropdown, select the User ID that uniquely identifies a user. In the Account ID dropdown, select the Account ID that uniquely identifies an account.
    Note: You must use the same Account ID and User ID as you do in other data sources so that you have one single, global identifier. Failing to do so will result in duplicate Accounts and Users.
  4. The timestamp column is only needed if you are providing a "versioned" data source. Versioned data sources are updated incrementally, whereas un-versioned data sources are only updated daily. Please note that this column MUST be when that row was written into Redshift or populated in a view. This cannot be another timestamp from a downstream system.
  5. Finally, click "Add Table" and you will see it show up in the "Connected Tables" tab

Adding Columns into Correlated as Dimensions

Any columns from your connected tables can be added as dimensions to Correlated. This means you'll be able to use these dimensions to filter Account and Users and build Signals.

  1. In order to add dimensions, simply navigate to the "Connected Tables" tab and click the gear icon next to the table you want to import from
  1. You'll see a modal that lists all of your available columns. Select the columns you want to import
  1. You can either immediately add the dimensions, or choose to customize the dimension names and descriptions.
    Note: Once you've added a column as a dimension, it will no longer be available as an option in the modal

  2. Lastly, navigate over to the "Dimensions" tab to validate that your dimensions have been successfully created

📘

You may observe some time lag between adding a dimension and it being available in Correlated

After you add a dimension, Correlated will query Redshift to populate your Accounts and Users. This usually takes several minutes, so if you don't immediately see values populating, that is the most likely reason. Values will populate over time as we finish querying your Redshift instance.

Setting up a Redshift Datashare

Correlated can also connect with Redshift's data sharing functionality to access your Redshift data. You'll want to identify which databases in Redshift you want to expose and associate datashares with those databases. Datashares are read-only and provide you with easy visibility into the data that you are sharing with Correlated.

You can create a datashare using the following methods:

Once your datashare is ready, repeat the steps from above to connect in Correlated, only this time choosing the "Datashare" option when prompted.

🚧

Your datashare needs to be approved by Correlated

Once you've connected Redshift, you'll notice that your Redshift setup modal is disabled and the connection is processing (the integration card will be yellow). This is because Correlated needs to accept your connection and confirm that the connection is valid. You'll typically see this updated within 24 hours of creating a connection. Please contact us if you are still seeing a processing status after 24 hours.