Redshift
- 3 Minutes to read
- DarkLight
Redshift
- 3 Minutes to read
- DarkLight
Article Summary
Share feedback
Thanks for sharing your feedback!
Amazon Redshift is a fully managed data warehousing service provided by Amazon Web Services (AWS). It's designed to handle large-scale data analytics and complex querying, offering high-performance columnar storage and parallel processing capabilities to efficiently process and analyze large datasets for business insights.
Prerequisites
- You have a running publicly accessible Redshift instance.
- You have configured Security Groups to make sure the cluster is accessible for Dataddo IPs.
- You have configured a Redshift user with CREATE, SELECT, UPDATE, INSERT, DELETE permissions.
Authorize the Connection to Redshift
In AWS Portal
Enable Public Accessibility of Redshift Cluster
- Go to the AWS Console and navigate to Redshift Console.
- In the navigation panel, choose Clusters and the name of the cluster that you want to connect to.
- In the cluster's configuration tab, make sure that Publicly accessible option is set to Yes. If not, please modify the cluster to make it publicly accessible.
Configure Security Group
Whitelist Dataddo IP address in the inbound rules of the cluster's security group.
- Go to the AWS Console and navigate to Redshift Console.
- In the navigation pane, choose Clusters and the name of the cluster that you want to connect to.
- On the configuration tab for the cluster, find the VPC security groups section and select the attached security group.
- In the Inbound rules section, choose Edit inbound rules.
- For each Dataddo IP set a Custom TCP Rule. Use CIDR notation (e.g. 52.17.68.150/32). The default port value for Redshift is 5439.
In Redshift Shell
Create a Redshift user
- Log into Redshift shell.
- Run a command to create a new user.
CREATE USER dataddo PASSWORD '***';
Set permissions
- Log into Redshift shell.
- Grant the CREATE, SELECT, UPDATE, INSERT, DELETE permissiong for the schema you plan to use (the example below presume the public schema is used).
GRANT USAGE ON SCHEMA public TO dataddo;
GRANT SELECT, INSERT, UPDATE, DELETE, CREATE ON ALL TABLES IN SCHEMA public TO dataddo;
ALTER DEFAULT PRIVILEGES IN SCHEMA public GRANT SELECT, INSERT, UPDATE, DELETE, CREATE ON TABLES TO dataddo;
In Dataddo
- In the Authorizers tab, click on Authorize New Service and select Redshift.
- You will be asked to fill the following fields
- Server IP or Hostname: Public IP or hostname of your Redshift cluster.
- Database: Name of the database you will use for writing or reading the data.
- Username: Username as configured in Create a Redshift user step.
- Password: Password as configured in Create a Redshift user step.
- Port: Port to connect to Redshift. The default value is 5439.
- [Optional] Configure the connection via SSH tunnel. To do so, please follow these steps.
- Cick on Save.
Create a New Redshift Destination
- Under the Destinations tab, click on the Create Destination button and select the destination from the list.
- Select your account from the drop-down menu.
- Name your destination and click on Save to create your destination.
Need to authorize another connection?
Click on Add new Account in drop-down menu during authorizer selection and follow the on-screen prompts. You can also go to the Authorizers tab and click on Add New Service.
Create a Flow to Redshift
- Navigate to Flows and click on Create Flow.
- Click on Connect Your Data to add your sources.
- Click on Connect Your Data Destination to add the destination.
- Choose the Write mode and fill in the other required information.
- Check the Data Preview to see if your configuration is correct.
- Name your flow, and click on Create Flow to finish the setup.
Table Naming Convention
When naming your table, please make sure the table name:
- Contains only
- Letters
- Numbers
- Underscores
- Does NOT contain whitespaces.
Related Articles
Was this article helpful?