The Warehouse

BLOG & REsources

Jun
22
Rethinking the Data Vault for Real-time Data

The data vault has long been viewed as a model best suited for historical and archival enterprise data. Its “insert only”, business-process approach to raw, unadulterated data is ideal for low-maintenance storage of all enterprise-generated information from all systems. Use cases for data vaults have traditionally revolved around historical tracking and auditing . . . however, the perception has largely been that it is ill-suited to analytics due to its many-to-many relationships and dispersed structure. In fact data vaults are often used as a “lightly modelled stage” for traditional star-schema data warehouses.


But the data vault may be best suited for a use case that...


Mar
30
Lakes, Swamps, and Puddles: The "Data Wetlands" Ecosystem

If you feel like you’re “drowning” in jargon and buzzwords surrounding the recent developments in data lakes and their ilk, you are not alone. A recent TDWI survey showed rapidly increasing adoption of data lakes as a source of big data analytics, though it also revealed barriers to success and confusion around implementation value. Much of this confusion stems from myths and misperceptions around the technical and business uses of a data lake. This article will examine the proper use of a data lake, and how proper governance can prevent it from becoming the dreaded data swamp.


To be clear, a data lake is not a data management platform, in that it is not an integrated, ce...


Mar
23
Where's the "T?" A look at ETL vs. ELT

In a previous blog post, we examined the differences between traditional ETL (extract, transform and load) and ELT, where the “heavy-lifting” of data transformation is handled by the robust and scalable (usually cloud-hosted) target platform. In today’s modern cloud data warehouse environment, ELT maximizes the speed at which data is staged and ingested, while leveraging massive computing power in the cloud to cleanse, aggregate and otherwise prepare that data for general consumption. But where is the best place to manage that transformation piece? Is it using cloud-friendly ETL tools, or is it within the management consoles of the cloud DWs themselves?


A common perception a...


Aug
01
Pandata Group Partners with ThoughtSpot to Deliver Search and AI-driven Analytics

Pandata Group is pleased to announce a partnership with ThoughtSpot, to deliver search and AI-driven analytics to reshape the way mid-market and enterprise organizations are able to answer data questions, find insights and make decisions.



ThoughtSpot was recently positioned in the Leaders quadrant of the Gartner 2019 Magic Quadrant for Analytics and Business Intelligence Platforms. The software platform is a spark to what Gartner recognizes as the third wave of disruption to traditional BI in the form of augmented analytics. With ThoughtSpot, business people can type a simple Google-like search in natural language to instantly analyze billions of rows of data, and leverage artificial intelli...


Mar
08
Data Integration Roadmap - Part One: The Logical Data Model

Our recent blog series on the data integration portfolio introduced a variety of new architectures that help the enterprise manage their data resources, including replication, virtualization and cloud data warehousing. Organizations are now able to integrate multiple data management solutions to address a variety of business sources and requirements. But it is important to understand that the foundation of any enterprise data management portfolio remains the same . . . a roadmap to data management must be created that is independent of the underlying technology. This series of blogs will examine the three main elements of the data integration roadmap: the logical data model, master data ma...


Sep
06
The Data Integration Portfolio - Part Four: Putting It All Together (In The Cloud)

This blog series has examined the hybrid data portfolio as a mix of technologies and approaches to a data foundation for the modern enterprise. We’ve examined a variety of strategies and technologies in data integration, including virtualization, replication and streaming data. We’ve shown that there is no “one size fits all” approach to an integrated data foundation, but instead have seen how a variety of disciplines that suit specific business and technical challenges can make up a cohesive data policy.


This final chapter puts it all together under the umbrella of “time-to-value" and its importance to the agile enterprise data platform. No matter what the techn...


Jun
11
Pandata Group Snowflake Partner Announcement


temp-post-image



Pandata Group's partners help us empower our clients and deliver value from data and analytics using the best technology for their particular business needs, which is why we're excited to announce we've added Snowflake to that list. After getting to know the Snowflake team and the ins and outs of their product, we see the immediate value Snowflake can bring to clients in search of a modern data warehousing solution.



Why Snowflake?



  • A fully managed, born-in-the-cloud data warehouse that delivers power, flexibility, and simplicity.

  • Hosted on Amazon Web Services (AWS), the platform is elastic as you need it to be. Start small and scale as you grow. Conversley you can scale back, on-the-fly, as ...

Pandata GroupLessLess
Pandata GroupLess

Madison

Pandata Group, LLC

316 W. Washington Avenue

Suite 525

Madison, WI 53704

(877) 350-5192

Madison

Pandata Group, LLC

316 W. Washington Avenue

Suite 525

Madison, WI 53704

(877) 350-5192

Less

Pandata Group © 2020

Pandata Group © 2020

Less


Chicago

WeWork/ Fulton Market

220 N. Green Street

Second Floor

Chicago, IL 60607


Chicago

WeWork/ Fulton Market

220 N. Green Street

Second Floor

Chicago, IL 60607

Less