- 2 Minutes to read
-
DarkLight
Data Blending
- 2 Minutes to read
-
DarkLight
Merge two data sources to load to one destination.
Create a New Data Flow
-
Sign in to your Dataddo account and click on Flows at the top of the page.
-
Click on Create Flow in the top right corner.
-
Click on Add Source to connect it to the destination. You can type the connector's name into the search bar to find it faster.
-
Click on Blend Sources.
-
Click on Select Source to choose the sources you wish to blend.
-
Choose a Join Key that is the same for both sources. Dataddo will blend the datasets based on this particular key.
You can select other columns for each source by dragging the fields from the list on the left or right. The columns do not need to be the same for each source.
-
Click on the button between the sources to configure the joining type. By default, Left Join is selected.
- Left Join - records from the right source will be joined to the left dataset.
- Inner Join - returns records which values are only matching in both column sources.
-
Click on Save Source.
-
Once the data is configured for blending, you can Add Destination.
- Set up the destination, choosing the write mode and other necessary settings. You can also name the flow at the top.
- NOTE: Remember to set the time for the flow with a little delay after the synchronization of your source to let the data load.
- By clicking on Data Preview, you can check the data that will be sent to the destination. The preview has a limit of approximately 100 rows. Click on Save Flow.
If changes are made in the flow that would affect the database schema (field name, add/delete column(s), and changing a field data type), go to the database and delete the previously created table. Then, save the changes and refresh.
Connecting and Configuring a Data Flow
Dataddo will automatically configure BI app destinations. But, for storages such as warehouses, a manual configuration is needed. For more information on how to connect and configure a data flow to a warehouse, check our detailed guide on creating data flows.
FAQ
1. I get “The column name ‘___’ is specified more than once” error message when I try to blend data. How can I fix this?
This problem can be fixed by renaming the affected column in one of the two sources.
Click on the three dots on one of the sources you want to blend and select Edit. Navigate to the Data Types tab, change the name of the affected column to e.g. propertyid_2 and click on Save.
You should now be able to blend your data without one column being defined more than once.
Feel free to contact us and we will help you with the setup. To speed the process of resolving your issue, make sure you provide us with sufficient information.