The Warehouse

BLOG & NOTES

Mar
30
Lakes, Swamps, and Puddles: The "Data Wetlands" Ecosystem

If you feel like you’re “drowning” in jargon and buzzwords surrounding the recent developments in data lakes and their ilk, you are not alone. A recent TDWI survey showed rapidly increasing adoption of data lakes as a source of big data analytics, though it also revealed barriers to success and confusion around implementation value. Much of this confusion stems from myths and misperceptions around the technical and business uses of a data lake. This article will examine the proper use of a data lake, and how proper governance can prevent it from becoming the dreaded data swamp.


To be clear, a data lake is not a data management platform, in that it is not an integrated, ce...


Mar
23
Where's the "T?" A look at ETL vs. ELT

In a previous blog post, we examined the differences between traditional ETL (extract, transform and load) and ELT, where the “heavy-lifting” of data transformation is handled by the robust and scalable (usually cloud-hosted) target platform. In today’s modern cloud data warehouse environment, ELT maximizes the speed at which data is staged and ingested, while leveraging massive computing power in the cloud to cleanse, aggregate and otherwise prepare that data for general consumption. But where is the best place to manage that transformation piece? Is it using cloud-friendly ETL tools, or is it within the management consoles of the cloud DWs themselves?


A common perception a...


Pandata GroupLessLess
Pandata GroupLess

Madison

Pandata Group, LLC

316 W. Washington Avenue

Suite 525

Madison, WI 53704

Madison

Pandata Group, LLC

316 W. Washington Avenue

Suite 525

Madison, WI 53704

Less

Pandata Group © 2020

Pandata Group © 2020

Less


Chicago

WeWork/ Fulton Market

220 N. Green Street

Second Floor

Chicago, IL 60607


Chicago

WeWork/ Fulton Market

220 N. Green Street

Second Floor

Chicago, IL 60607

Less