Published Sep 7, 2023

Data Ingestion: How to Sync Data Between Any Source Application and Snowflake

Celigo
Celigo

Integrating data with a data warehouse is crucial for businesses of all sizes. It ensures data quality, enables comprehensive analytics, and streamlines business processes. Recent customer surveys indicate that companies place a high priority on addressing data warehousing and analytics challenges. Our goal is to assist businesses in unlocking and analyzing data stored in data warehouses, preventing data silos. This enhances data quality, improves visibility for reporting, enables operational scalability, and ensures compliance.

Celigo’s template for data warehousing with Snowflake offers a straightforward method for connecting your platform and selecting resources for migration into your data warehouse, facilitating easier data management and analysis.

Integration Marketplace

It starts in the marketplace, where you will find the data warehouse template with Snowflake. You can access the template by quickly creating an account. (Then, you can browse all of the other Snowflake integration templates.)

Selecting data sources

After installing and configuring a connection to your Snowflake account, you can select between the following ecommerce source applications.

Selecting resources for Shopify

In this example, we will set up a connection to Shopify and select all the resources available to sync into our data warehouse in Snowflake.

View all flows

Once the template is installed and all connections are created, they are moved into an integration tile, showing two flows for every connection created. The first flow bulk inserts resources into a staging table in Snowflake, and the second flow merges and deletes the staging table.

Creating a flow group

Now, to make the integration look more organized, we will create a flow group for each connection, in this case, just one category for Shopify.

Overview of staging table flow

For the first flow in the template, all the selected resources correspond to an individual export step.

In this example, we have three of the five resources shown: Customers, Orders, and Abandoned checkouts.

All of these resources are then synced to a staging table in Snowflake.

Snowflake import settings

For the destination table, select the staging table you are using for the Shopify data from the dropdown below.

Snowflake mapping settings

Our mapping shows that the JSON serialized data will be mapped into a data variant column along with the resource name, which would be Customers, Orders, etc., and the primary key and timestamp of the data will be mapped into a column.

Overview of merge & delete flow

The second flow, or the merge & delete flow, will check for existing records with a SQL query, limiting to 1, and merge & clear that data from the staging table.

Changing table settings

To specify different staging and production tables, changes can made in the flow settings.

Snowflake data

After all those changes have been made and the flow is set up to use, we can run the flows and view all of the data in our data warehouse in Snowflake.

Data can be queried with different views as needed; however, all the JSON data is shown in the correct table to be viewed, analyzed, and reported on at any time.

Data is now in a warehouse

Congratulations! You have successfully synced your data warehouse and source application.

For more information on this template, read our Data Warehouse Automation for Snowflake documentationwhich will address any questions, from an overview to the setup and configuration of other applications.

Let’s get started

Integrate, automate, and optimize every process.

Already a user? Log in and install the template now.

Integration insights

Expand your knowledge on all things integration and automation. Explore our resources, learn from experts, and elevate your building expertise.