About
Influential Data Vault innovator and modeller specialising on all things Snowflake Data…
Services
Articles by Patrick
Activity
-
Day 1 of our east coast customer tour wrapped up with two great meetings: one with Tandigm Health 🏥 and the other with Anup from Thomson Reuters 💼.…
Day 1 of our east coast customer tour wrapped up with two great meetings: one with Tandigm Health 🏥 and the other with Anup from Thomson Reuters 💼.…
Liked by Patrick Cuba
-
How much do you trust the outcome of your prompt? #GenAI
How much do you trust the outcome of your prompt? #GenAI
Shared by Patrick Cuba
-
Helsinki data professionals, this is your final chance to join us for an incredible day of learning, networking, and hands-on Data Vault workshops!…
Helsinki data professionals, this is your final chance to join us for an incredible day of learning, networking, and hands-on Data Vault workshops!…
Liked by Patrick Cuba
Experience
Education
Licenses & Certifications
Publications
-
The Seven Deadly Sins of Fake Vault
Data Vault Alliance
There are Data Vault implementations parading as Data Vault 2.0 implementations but do not follow the DV2.0 standards. DV2.0 is more than just using surrogate hash keys and omitting end dates….
Here we explore at a high level the seven deadly sins of Fake Vault -
Data Vault Mysteries... Business Vault
Data Vault Alliance
How to build a Business Vault is a common question in the data vault community, the article dives into what it is and how to build a Business Vault
-
A Rose By Any Other Name – A Data Vault Perspective
Data Vault Alliance
The title and subject of this discussion is the contextual nature of business keys, keys are the immutable value that represents a thing, entity, interest, or business object uniquely and is used by a business to identify that thing forever. However, a key by itself is meaningless…
-
Learning Data Vault is Like Learning How to Make Beer!
Data Vault Alliance
A discussion on Cognitive load, Data Vault and Beer!
-
Apache Spark GraphX and the Seven Bridges of Königsberg
Macquarie Bank @Medium
Solving technical debt using Apache Spark and Data Vault
-
The Data Vault Guru: a pragmatic guide on building a data vault
Cuba BI Consulting
The data vault methodology presents a unique opportunity to model the enterprise data warehouse using the same automation principles applicable in today’s software delivery, continuous integration, continuous delivery and continuous deployment while still maintaining the standards expected for governing a corporation’s most valuable asset: data. This book provides at first the landscape of a modern architecture and then as a thorough guide on how to deliver a data model that flexes as the…
The data vault methodology presents a unique opportunity to model the enterprise data warehouse using the same automation principles applicable in today’s software delivery, continuous integration, continuous delivery and continuous deployment while still maintaining the standards expected for governing a corporation’s most valuable asset: data. This book provides at first the landscape of a modern architecture and then as a thorough guide on how to deliver a data model that flexes as the enterprise flexes, the data vault. Whether the data is structured, semi-structured or even unstructured one thing is clear, there is always a model either applied early (schema-on-write) or applied late (schema-on-read). Today’s focus on data governance requires that we know what we retain about our customers, the data vault provides that focus by delivering a methodology focused on all aspects about the customer and provides some of the best practices for modern day data compliance.
The book will delve into every data vault modelling artefact, its automation with sample code, raw vault, business vault, testing framework, a build framework, sample data vault models, how to build automation patterns on top of a data vault and even offer an extension of data vault that provides automated timeline correction, not to mention variation of data vault designed to provide audit trails, metadata control and integration with agile delivery tools. -
Bring out your Dead ... Data
Data Vault Alliance
What plans are in place after you meet your demise? This subject is not a comfortable discussion to have for most. What happens to your belongings, your estate, and all that you leave behind? Who will be notified? And, what of those who depend on you? Graveyards paint a picture of dread in horror films and usually the protagonist is not at the cemetery by choice. Whether it was a death by natural causes or the consequences of unfortunate events; those that are affected are left behind to pick…
What plans are in place after you meet your demise? This subject is not a comfortable discussion to have for most. What happens to your belongings, your estate, and all that you leave behind? Who will be notified? And, what of those who depend on you? Graveyards paint a picture of dread in horror films and usually the protagonist is not at the cemetery by choice. Whether it was a death by natural causes or the consequences of unfortunate events; those that are affected are left behind to pick up the pieces.
-
Solving the Time Crime of Back-dated Data in Your Data Vault
Data Vault Alliance
Analytics assumes that data movement will follow a linear path into a historical repository to represent the correct sequence of events. That is, if today is Thursday and we have loaded data for Wednesday, we then do not expect data from Tuesday to arrive today, because all the data up until today should have already been loaded. Whether it be a missing batch file or a missing data record – a Tuesday file may have arrived and loaded but could have been provided without a complete set of…
Analytics assumes that data movement will follow a linear path into a historical repository to represent the correct sequence of events. That is, if today is Thursday and we have loaded data for Wednesday, we then do not expect data from Tuesday to arrive today, because all the data up until today should have already been loaded. Whether it be a missing batch file or a missing data record – a Tuesday file may have arrived and loaded but could have been provided without a complete set of records. There are a myriad of reasons why these scenarios might occur (a locked record due to an update in the database may cause a push file produced from the source system to be incomplete). These are real scenarios in loading a data warehouse and it can skew what we know of a business entity (e.g. customer) and may derive erroneous analytics for that business entity or worse, the wrong facts are reported to a regulatory body or to the customers themselves.
-
Integrating SAS and Data Vault
SAS Global Forum
Data Vault (DV) modelling technique is fast gaining popularity around the world as an easy to learn, easy to
manage, easy to maintain and easy to adapt (to business change) data modelling technique. In this paper
we will briefly explore what DV is; DV artifacts and we will explore how SAS can be used to automate its
data loading patterns. -
My SAS Grid Scheduler
SAS Global Forum
No Batch Scheduler? No problem! This paper describes the use of a SAS DI Studio job that can be
started by a time dependent scheduler like Windows Scheduler (or crontab in UNIX) to mimic a batch
scheduler on SAS Grid.
Projects
-
The Data Must Flow
- Present
Artefacts from the Data Vault Guru
Honors & Awards
-
DataEngBytes Melbourne 2024
DataEngBytes
Travelled to Melbourne to deliver my presentation I did in Sydney
-
DataEngBytes Sydney 2024
DataEngBytes
Returned to present at DataEngBytes Sydney
-
Keynote Speaker at Data Vault UK 2024
Data Vault UK
Returned for a second year to deliver the keynote on Day 2
-
Keynote Speaker at Data Vault UK 2023
-
Delivered the opening Keynote to the inaugural Data Vault UK conference in London; titled “The Truth about Data Modelling”
-
DataEngBytes Sydney
-
Data engineering perspective for your Data Vault
-
Customer Satisfaction
Certus Solutions
Patrick went above & beyond with the effort spent creating and delivering a substantial information pack about the approaches they’re using at Macquarie Bank for their Data Harbour.
Patrick dove right in to produce a comprehensive presentation that was informative to all. -
Invited Speaker at SAS Global Forum 2018
SAS
I will be presenting at SAS Global forum in 2018 about Data Vault integration with SAS
-
Invited Speaker at SAS Global Forum 2017
SAS
-
Excellence in meeting deliverables
Westpac
-
Nominated for excellence in monthly performance award
Business & Decision
System architecture deliverable above and beyond requirements
-
Exemplary performance in tight deliverable
Bankwest
Delivered SAS OLAP cube POC
-
Excellence at meeting deliverables
PBT Group
Organizations
-
DAMA - Australia
Member
- Present -
Business Architecture Guild
Member
-Business Architecture practices
-
Data Vault Alliance
Member
-Leading contributor to discussions offering advice, ideas and stories for data vault 2.0 across multiple platforms. Asked to contribute on the platform by Dan Linstedt where the most technical of topics were being handled by me. Often introduced innovative ideas to the platform like: - why we prefer schema evolution over refactoring - business keys always loaded as varchar to hub tables - extended record tracking satellites - effectivity satellite and driver key - pit flow manifold - why hash…
Leading contributor to discussions offering advice, ideas and stories for data vault 2.0 across multiple platforms. Asked to contribute on the platform by Dan Linstedt where the most technical of topics were being handled by me. Often introduced innovative ideas to the platform like: - why we prefer schema evolution over refactoring - business keys always loaded as varchar to hub tables - extended record tracking satellites - effectivity satellite and driver key - pit flow manifold - why hash on Snowflake and many more
-
SAS User Group (SNUG)
Committee Member
-Leading and assisting in organizing the local chapter of the SAS user group.
-
SAS Professionals
Member
-Association of SAS professionals; my input covers things like administration, platform and data engineering using SAS. http://www.sasprofessionals.net/
Recommendations received
2 people have recommended Patrick
Join now to viewMore activity by Patrick
-
🧩 Struggling to align data from multiple operational systems? You’re not alone. Different systems define core business concepts—like customers…
🧩 Struggling to align data from multiple operational systems? You’re not alone. Different systems define core business concepts—like customers…
Liked by Patrick Cuba
-
In the last Zach Wilson's data engineering boot camp, Albert Campillo created a series of #dbt infographics to illustrate the dbt content I…
In the last Zach Wilson's data engineering boot camp, Albert Campillo created a series of #dbt infographics to illustrate the dbt content I…
Liked by Patrick Cuba
-
Excited to lead a panel discussion on 5/7 about how leaders can navigate M&A and establish a scalable data warehouse to drive business growth…
Excited to lead a panel discussion on 5/7 about how leaders can navigate M&A and establish a scalable data warehouse to drive business growth…
Liked by Patrick Cuba
-
Exciting news! Ben Kerford, President at 7Rivers, has been featured in Innovations of the World Milwaukee. In this spotlight, he shares how AI and…
Exciting news! Ben Kerford, President at 7Rivers, has been featured in Innovations of the World Milwaukee. In this spotlight, he shares how AI and…
Liked by Patrick Cuba
-
I’m happy to share that I’ve obtained a new certification: Get started with Databricks for Data Warehousing from Databricks! 🏎️ VaultSpeed…
I’m happy to share that I’ve obtained a new certification: Get started with Databricks for Data Warehousing from Databricks! 🏎️ VaultSpeed…
Liked by Patrick Cuba
-
This morning, I had the privilege of chatting with Tom Redman and Tom Davenport about Knowledge Graphs and Unstructured Data. Everyone’s talking…
This morning, I had the privilege of chatting with Tom Redman and Tom Davenport about Knowledge Graphs and Unstructured Data. Everyone’s talking…
Liked by Patrick Cuba
-
✨ One more inspiring Data Intelligence Day—this time in the vibrant city of Paris! It was truly energizing to see how the audience embraced key…
✨ One more inspiring Data Intelligence Day—this time in the vibrant city of Paris! It was truly energizing to see how the audience embraced key…
Liked by Patrick Cuba
-
You declare “Kimball modelling is dead”, you either were doing #kimballmodelling wrong or have never seen the benefits of doing so!
You declare “Kimball modelling is dead”, you either were doing #kimballmodelling wrong or have never seen the benefits of doing so!
Shared by Patrick Cuba
-
A mini-book for accelerating your #datavault program with tips and tricks and guidance. Largely based on the "Data Vault Guru: a pragmatic guide for…
A mini-book for accelerating your #datavault program with tips and tricks and guidance. Largely based on the "Data Vault Guru: a pragmatic guide for…
Shared by Patrick Cuba
Other similar profiles
Explore collaborative articles
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
Explore MoreOthers named Patrick Cuba
-
Patrick Cuba
IT Architect, Research Computing Group at Saint Louis University
-
Patrick Cuba
-
Patrick Cuba
SAS Consultant at PBT Group
-
PATRICK CUBA
--
17 others named Patrick Cuba are on LinkedIn
See others named Patrick Cuba