The Warehouse

BLOG & REsources

Oct
12
Data Modeling in the Cloud Era

We have all seen how more and more companies are moving to the cloud for their data management platforms. Snowflake, Azure Synapse, AWS Redshift, and Google Big Query are leading this charge towards low-admin, instantly scalable cloud database solutions. Accompanying this is a migration to cloud-hosted data integration and low-code ETL solutions like Matillion and Fivetran. It is tempting to assume that with all these low-overhead data management platforms the concept of data modeling may be a thing of the past, relegated to the pile of on-premise databases that this brave new world is supplanting.


In reality, data modeling is more important than ever. A key to understanding this importance i...


Sep
30
Easily Connect to Any API Source From Matillion ELT

If you are like many ETL developers you’ve struggled with an easy way to source cloud services data via REST API. Although standards are in place for REST API web services protocols, it seems that every vendor has their own variation of them, creating new challenges for each new source. Matillion’s cloud ELT product has long featured an API profile creator that sources from JSON files and creates RSD (Real Simple Discovery, an XML format) scripts for use with API query components. The effectiveness of this approach, however, is only as good as the quality of JSON files provided by the vendor.


Now, with version 1.47, Matillion introduces much more simplified functionality for extra...


Sep
27
Data Technology Needs a Data Culture


We have all heard the notion that data should be viewed as an asset, the “new oil”, a crucial business resource for the enterprise. When viewed in this context, a business’s investment in data must be assessed by the quality of its return. Measuring this return is more than just “how many cool-looking dashboards” you can now create; rather, a CTO/CDO must look beyond the technical capabilities and determine if data is having a true impact on the business culture. This means evolving a BI and Data Science community that engages employees at all levels of the organization. The path to this positive return is creating a modern data culture that permeates the enter...


Feb
25
Snowflake Data Cloud and the future of SAP BW

Enterprises running SAP Business Warehouse (SAP BW, BW/4HANA and BW on HANA) are keeping a close eye on challengers like Snowflake. Is cloud data warehousing the answer to all the challenges for organisations with a large SAP footprint? And how does a cloud data warehouse fit into the data architecture? Should it replace SAP BW or is there still value in SAP BW?


Why do most enterprises with a large SAP footprint run SAP BW?


The dominant position of SAP BW is easy to explain from a historic perspective, but it would not do justice to SAP BW to ignore its current strengths as well. Let us start with the latter. SAP BW is still the only data warehouse platform which delivers all data warehouse f...


Jan
28
Hey You! Get Onto My Cloud! Rise of the Data Cloud Pt 1

In 2015, I was fortunate enough to lead the sales development effort of a cloud-based supply chain visibility product. The product was geared toward the manufacturing sector to allow a more transparent and collaborative platform to enable information sharing. We leveraged the Salesforce cloud to build the application, and it was intended to support digital transformation efforts within the supply chain to optimize the flow of information, provide real-time data to the supplier, empower collaboration, and scale for adoption. Unfortunately, the ability to execute on the vision failed and it was tied to one key component – the part on delivering real-time data.



Let’s fast forward to...


Jun
22
Rethinking the Data Vault for Real-time Data

The data vault has long been viewed as a model best suited for historical and archival enterprise data. Its “insert only”, business-process approach to raw, unadulterated data is ideal for low-maintenance storage of all enterprise-generated information from all systems. Use cases for data vaults have traditionally revolved around historical tracking and auditing . . . however, the perception has largely been that it is ill-suited to analytics due to its many-to-many relationships and dispersed structure. In fact data vaults are often used as a “lightly modelled stage” for traditional star-schema data warehouses.


But the data vault may be best suited for a use case that...


Mar
30
Lakes, Swamps, and Puddles: The "Data Wetlands" Ecosystem

If you feel like you’re “drowning” in jargon and buzzwords surrounding the recent developments in data lakes and their ilk, you are not alone. A recent TDWI survey showed rapidly increasing adoption of data lakes as a source of big data analytics, though it also revealed barriers to success and confusion around implementation value. Much of this confusion stems from myths and misperceptions around the technical and business uses of a data lake. This article will examine the proper use of a data lake, and how proper governance can prevent it from becoming the dreaded data swamp.


To be clear, a data lake is not a data management platform, in that it is not an integrated, ce...


Mar
23
Where's the "T?" A look at ETL vs. ELT

In a previous blog post, we examined the differences between traditional ETL (extract, transform and load) and ELT, where the “heavy-lifting” of data transformation is handled by the robust and scalable (usually cloud-hosted) target platform. In today’s modern cloud data warehouse environment, ELT maximizes the speed at which data is staged and ingested, while leveraging massive computing power in the cloud to cleanse, aggregate and otherwise prepare that data for general consumption. But where is the best place to manage that transformation piece? Is it using cloud-friendly ETL tools, or is it within the management consoles of the cloud DWs themselves?


A common perception a...


May
29
Talend v. Matillion for Cloud Migration

The two leading ETL/ELT tools for cloud data migration are Talend and Matillion, and both are well-positioned for moving and transforming data into the modern data warehouse. So if you’re moving to any type of cloud-hosted DW, whether it is a cloud-dedicated warehouse such as Snowflake, or part of a larger cloud platform such as AWS Redshift, Azure SQL Data Warehouse or Google BigQuery, which tool should you use to move your existing on-prem data?


Both Talend and Matillion can source any kind of on-prem data and land it in a cloud-hosted data environment. They can also move data to and from AWS’s cloud data-storage S3 as well as Azure’s Blob storage (which can be used to s...


May
28
Approaching an ERP Migration with Analytics in mind

Growth companies today rely on Enterprise Resource Planning (ERP) systems to manage their daily operations and collect and retain vital business data. The ERP space continues to evolve . . . cloud-hosted ERP, AI-based automation, digital transformation, etc.; eventually an organization will find itself upgrading or migrating to a more modern enterprise platform. This move will likely involve a Systems Integrator (SI) specialist to drive the effort, and an SI’s focus may need to incorporate more than just the operational systems at hand . . . the organization’s approach to data integration and analytics should be accounted for as well.



Your SI and internal migration team will init...


Pandata GroupLess

Chicago

WeWork/ Fulton Market

220 N. Green Street

Second Floor

Chicago, IL 60607

Madison

316 W Washington Ave

Suite 525

Madison, Wisconsin 53703

Send Message