The Warehouse

Short reads. Big insights.
Industry trends. Thought Leadership. Opinions. Hot Tips. And so much more.
 

May
27
Three Levels of Data Maturity – Part Two

In the first installment of this series, we examined how mid-sized enterprises can quickly get started on their journey to data maturity by implementing an operational reporting platform in as little as 4 to 6 weeks. The target users for this type of data service are the mid-line operational managers looking for actionable, tactical insight into system operations. The next user group to reach on the data maturity journey are the decision-makers at the departmental level (finance, sales, marketing, supply chain, etc) who require strategic insight for planning and resource management. The architecture required for this next level of data analytics is the 2-tiered subject-oriented data warehou...


Mar
22
Three Levels of Data Maturity – Part One

Today’s businesses continue to strive for data maturity and to create a culture that is data-driven and data-literate. But the perception remains that starting such a journey is expensive and time-consuming. There is a belief that a combination of high upstart costs and lengthy implementation time prevents the business from seeing any near-future value in its data and analytics investment. The reality, however, is that today’s cloud-based data platform technologies, specifically the triumvirate of Snowflake (data storage and compute), Matillion (data loading and transformation), and ThoughtSpot (data analytics and insight), enable rapid analytic ability with minimum initial inv...


Mar
18
3 Killers of Data Cloud Modernization Planning

The time it takes from beginning a data cloud initiative to realizing any of tangible value can be years.


After all, there is a lot to consider. First you need to ensure that there is business alignment, between the business and I.T., on what the short, mid, and long-term objectives are. Second, there is the small task of assessing your entire data estate. It is important to figure out with each vendor which modern data stack configuration, workload option, availability zone, and pricing plan is the optimal configuration for you.


Third, you need to build a migration plan, defining a strategy for each possible candidate analytical application and data set, that will drive value to the business ...


Oct
12
Data Modeling in the Cloud Era

We have all seen how more and more companies are moving to the cloud for their data management platforms. Snowflake, Azure Synapse, AWS Redshift, and Google Big Query are leading this charge towards low-admin, instantly scalable cloud database solutions. Accompanying this is a migration to cloud-hosted data integration and low-code ETL solutions like Matillion and Fivetran. It is tempting to assume that with all these low-overhead data management platforms the concept of data modeling may be a thing of the past, relegated to the pile of on-premise databases that this brave new world is supplanting.


In reality, data modeling is more important than ever. A key to understanding this importance i...


Sep
30
Easily Connect to Any API Source From Matillion ELT

If you are like many ETL developers you’ve struggled with an easy way to source cloud services data via REST API. Although standards are in place for REST API web services protocols, it seems that every vendor has their own variation of them, creating new challenges for each new source. Matillion’s cloud ELT product has long featured an API profile creator that sources from JSON files and creates RSD (Real Simple Discovery, an XML format) scripts for use with API query components. The effectiveness of this approach, however, is only as good as the quality of JSON files provided by the vendor.


Now, with version 1.47, Matillion introduces much more simplified functionality for extra...


Sep
27
Data Technology Needs a Data Culture


We have all heard the notion that data should be viewed as an asset, the “new oil”, a crucial business resource for the enterprise. When viewed in this context, a business’s investment in data must be assessed by the quality of its return. Measuring this return is more than just “how many cool-looking dashboards” you can now create; rather, a CTO/CDO must look beyond the technical capabilities and determine if data is having a true impact on the business culture. This means evolving a BI and Data Science community that engages employees at all levels of the organization. The path to this positive return is creating a modern data culture that permeates the enter...


Feb
25
Snowflake Data Cloud and the future of SAP BW

Enterprises running SAP Business Warehouse (SAP BW, BW/4HANA and BW on HANA) are keeping a close eye on challengers like Snowflake. Is cloud data warehousing the answer to all the challenges for organisations with a large SAP footprint? And how does a cloud data warehouse fit into the data architecture? Should it replace SAP BW or is there still value in SAP BW?


Why do most enterprises with a large SAP footprint run SAP BW?


The dominant position of SAP BW is easy to explain from a historic perspective, but it would not do justice to SAP BW to ignore its current strengths as well. Let us start with the latter. SAP BW is still the only data warehouse platform which delivers all data warehouse f...


Jan
28
Hey You! Get Onto My Cloud! Rise of the Data Cloud Pt 1

In 2015, I was fortunate enough to lead the sales development effort of a cloud-based supply chain visibility product. The product was geared toward the manufacturing sector to allow a more transparent and collaborative platform to enable information sharing. We leveraged the Salesforce cloud to build the application, and it was intended to support digital transformation efforts within the supply chain to optimize the flow of information, provide real-time data to the supplier, empower collaboration, and scale for adoption. Unfortunately, the ability to execute on the vision failed and it was tied to one key component – the part on delivering real-time data.



Let’s fast forward to...


Jun
22
Rethinking the Data Vault for Real-time Data

The data vault has long been viewed as a model best suited for historical and archival enterprise data. Its “insert only”, business-process approach to raw, unadulterated data is ideal for low-maintenance storage of all enterprise-generated information from all systems. Use cases for data vaults have traditionally revolved around historical tracking and auditing . . . however, the perception has largely been that it is ill-suited to analytics due to its many-to-many relationships and dispersed structure. In fact data vaults are often used as a “lightly modelled stage” for traditional star-schema data warehouses.


But the data vault may be best suited for a use case that...


Mar
30
Lakes, Swamps, and Puddles: The "Data Wetlands" Ecosystem

If you feel like you’re “drowning” in jargon and buzzwords surrounding the recent developments in data lakes and their ilk, you are not alone. A recent TDWI survey showed rapidly increasing adoption of data lakes as a source of big data analytics, though it also revealed barriers to success and confusion around implementation value. Much of this confusion stems from myths and misperceptions around the technical and business uses of a data lake. This article will examine the proper use of a data lake, and how proper governance can prevent it from becoming the dreaded data swamp.


To be clear, a data lake is not a data management platform, in that it is not an integrated, ce...


Pandata Group

Chicago

420 W. Huron Street

Suite 201 

Chicago, IL 60650

Madison

701 E. Washington Ave

Suite 202

Madison, Wisconsin 53703

Cincinnati

151 W. 4th Street

Cincinnati, OH 45202