Patrick Cuba

Patrick Cuba

Sydney, New South Wales, Australia
6K followers 500+ connections

About

Influential Data Vault innovator and modeller specialising on all things Snowflake Data…

Services

Articles by Patrick

  • Data Vault & Knowledge Graphs, a Love Story…

    Data Vault & Knowledge Graphs, a Love Story…

    Pardon the expressive title, the intent of this article is to marry the two concepts (data vault & knowledge graphs)…

    1 Comment

Activity

Join now to see all activity

Experience

  • Snowflake Graphic

    Snowflake

    Sydney, New South Wales, Australia

  • -

    Sydney Area, Australia

  • -

    Sydney, New South Wales, Australia

  • -

    Sydney, New South Wales, Australia

  • -

    Sydney

  • -

    Sydney, Australia

  • -

    Sydney, Australia

  • -

    Sydney Area, Australia

  • -

    Chester, United Kingdom

  • -

    Perth, Australia

  • -

    Johannesburg Area, South Africa

  • -

    Johannesburg Area, South Africa

Education

  • University of Johannesburg Graphic

    University of Johannesburg

    -

    Activities and Societies: Nothing

  • -

    Activities and Societies: Nothing

    Four core subjects covering Financial Markets and Economic Principles; Financial Risk Management; Financial Regulation, Ethics and Law; Financial Analysis Techniques

  • -

    Activities and Societies: Nothing

  • -

    Activities and Societies: Tennis and Football

Licenses & Certifications

Join now to see all certifications

Publications

  • The Seven Deadly Sins of Fake Vault

    Data Vault Alliance

    There are Data Vault implementations parading as Data Vault 2.0 implementations but do not follow the DV2.0 standards. DV2.0 is more than just using surrogate hash keys and omitting end dates….
    Here we explore at a high level the seven deadly sins of Fake Vault

    See publication
  • Data Vault Mysteries... Business Vault

    Data Vault Alliance

    How to build a Business Vault is a common question in the data vault community, the article dives into what it is and how to build a Business Vault

    See publication
  • A Rose By Any Other Name – A Data Vault Perspective

    Data Vault Alliance

    The title and subject of this discussion is the contextual nature of business keys, keys are the immutable value that represents a thing, entity, interest, or business object uniquely and is used by a business to identify that thing forever. However, a key by itself is meaningless…

    See publication
  • Learning Data Vault is Like Learning How to Make Beer!

    Data Vault Alliance

    A discussion on Cognitive load, Data Vault and Beer!

    See publication
  • Apache Spark GraphX and the Seven Bridges of Königsberg

    Macquarie Bank @Medium

    Solving technical debt using Apache Spark and Data Vault

    See publication
  • The Data Vault Guru: a pragmatic guide on building a data vault

    Cuba BI Consulting

    The data vault methodology presents a unique opportunity to model the enterprise data warehouse using the same automation principles applicable in today’s software delivery, continuous integration, continuous delivery and continuous deployment while still maintaining the standards expected for governing a corporation’s most valuable asset: data. This book provides at first the landscape of a modern architecture and then as a thorough guide on how to deliver a data model that flexes as the…

    The data vault methodology presents a unique opportunity to model the enterprise data warehouse using the same automation principles applicable in today’s software delivery, continuous integration, continuous delivery and continuous deployment while still maintaining the standards expected for governing a corporation’s most valuable asset: data. This book provides at first the landscape of a modern architecture and then as a thorough guide on how to deliver a data model that flexes as the enterprise flexes, the data vault. Whether the data is structured, semi-structured or even unstructured one thing is clear, there is always a model either applied early (schema-on-write) or applied late (schema-on-read). Today’s focus on data governance requires that we know what we retain about our customers, the data vault provides that focus by delivering a methodology focused on all aspects about the customer and provides some of the best practices for modern day data compliance.
    The book will delve into every data vault modelling artefact, its automation with sample code, raw vault, business vault, testing framework, a build framework, sample data vault models, how to build automation patterns on top of a data vault and even offer an extension of data vault that provides automated timeline correction, not to mention variation of data vault designed to provide audit trails, metadata control and integration with agile delivery tools.

    See publication
  • Bring out your Dead ... Data

    Data Vault Alliance

    What plans are in place after you meet your demise? This subject is not a comfortable discussion to have for most. What happens to your belongings, your estate, and all that you leave behind? Who will be notified? And, what of those who depend on you? Graveyards paint a picture of dread in horror films and usually the protagonist is not at the cemetery by choice. Whether it was a death by natural causes or the consequences of unfortunate events; those that are affected are left behind to pick…

    What plans are in place after you meet your demise? This subject is not a comfortable discussion to have for most. What happens to your belongings, your estate, and all that you leave behind? Who will be notified? And, what of those who depend on you? Graveyards paint a picture of dread in horror films and usually the protagonist is not at the cemetery by choice. Whether it was a death by natural causes or the consequences of unfortunate events; those that are affected are left behind to pick up the pieces.

    See publication
  • Solving the Time Crime of Back-dated Data in Your Data Vault

    Data Vault Alliance

    Analytics assumes that data movement will follow a linear path into a historical repository to represent the correct sequence of events. That is, if today is Thursday and we have loaded data for Wednesday, we then do not expect data from Tuesday to arrive today, because all the data up until today should have already been loaded. Whether it be a missing batch file or a missing data record – a Tuesday file may have arrived and loaded but could have been provided without a complete set of…

    Analytics assumes that data movement will follow a linear path into a historical repository to represent the correct sequence of events. That is, if today is Thursday and we have loaded data for Wednesday, we then do not expect data from Tuesday to arrive today, because all the data up until today should have already been loaded. Whether it be a missing batch file or a missing data record – a Tuesday file may have arrived and loaded but could have been provided without a complete set of records. There are a myriad of reasons why these scenarios might occur (a locked record due to an update in the database may cause a push file produced from the source system to be incomplete). These are real scenarios in loading a data warehouse and it can skew what we know of a business entity (e.g. customer) and may derive erroneous analytics for that business entity or worse, the wrong facts are reported to a regulatory body or to the customers themselves.

    See publication
  • Integrating SAS and Data Vault

    SAS Global Forum

    Data Vault (DV) modelling technique is fast gaining popularity around the world as an easy to learn, easy to
    manage, easy to maintain and easy to adapt (to business change) data modelling technique. In this paper
    we will briefly explore what DV is; DV artifacts and we will explore how SAS can be used to automate its
    data loading patterns.

    See publication
  • My SAS Grid Scheduler

    SAS Global Forum

    No Batch Scheduler? No problem! This paper describes the use of a SAS DI Studio job that can be
    started by a time dependent scheduler like Windows Scheduler (or crontab in UNIX) to mimic a batch
    scheduler on SAS Grid.

    See publication
Join now to see all publications

Projects

  • The Data Must Flow

    - Present

    Artefacts from the Data Vault Guru

Honors & Awards

  • DataEngBytes Melbourne 2024

    DataEngBytes

    Travelled to Melbourne to deliver my presentation I did in Sydney

  • DataEngBytes Sydney 2024

    DataEngBytes

    Returned to present at DataEngBytes Sydney

  • Keynote Speaker at Data Vault UK 2024

    Data Vault UK

    Returned for a second year to deliver the keynote on Day 2

  • Keynote Speaker at Data Vault UK 2023

    -

    Delivered the opening Keynote to the inaugural Data Vault UK conference in London; titled “The Truth about Data Modelling”

  • DataEngBytes Sydney

    -

    Data engineering perspective for your Data Vault

  • Customer Satisfaction

    Certus Solutions

    Patrick went above & beyond with the effort spent creating and delivering a substantial information pack about the approaches they’re using at Macquarie Bank for their Data Harbour.
    Patrick dove right in to produce a comprehensive presentation that was informative to all.

  • Invited Speaker at SAS Global Forum 2018

    SAS

    I will be presenting at SAS Global forum in 2018 about Data Vault integration with SAS

  • Invited Speaker at SAS Global Forum 2017

    SAS

  • Excellence in meeting deliverables

    Westpac

  • Nominated for excellence in monthly performance award

    Business & Decision

    System architecture deliverable above and beyond requirements

  • Exemplary performance in tight deliverable

    Bankwest

    Delivered SAS OLAP cube POC

  • Excellence at meeting deliverables

    PBT Group

Organizations

  • DAMA - Australia

    Member

    - Present
  • Business Architecture Guild

    Member

    -

    Business Architecture practices

  • Data Vault Alliance

    Member

    -

    Leading contributor to discussions offering advice, ideas and stories for data vault 2.0 across multiple platforms. Asked to contribute on the platform by Dan Linstedt where the most technical of topics were being handled by me. Often introduced innovative ideas to the platform like: - why we prefer schema evolution over refactoring - business keys always loaded as varchar to hub tables - extended record tracking satellites - effectivity satellite and driver key - pit flow manifold - why hash…

    Leading contributor to discussions offering advice, ideas and stories for data vault 2.0 across multiple platforms. Asked to contribute on the platform by Dan Linstedt where the most technical of topics were being handled by me. Often introduced innovative ideas to the platform like: - why we prefer schema evolution over refactoring - business keys always loaded as varchar to hub tables - extended record tracking satellites - effectivity satellite and driver key - pit flow manifold - why hash on Snowflake and many more

  • SAS User Group (SNUG)

    Committee Member

    -

    Leading and assisting in organizing the local chapter of the SAS user group.

  • SAS Professionals

    Member

    -

    Association of SAS professionals; my input covers things like administration, platform and data engineering using SAS. http://www.sasprofessionals.net/

Recommendations received

More activity by Patrick

View Patrick’s full profile

  • See who you know in common
  • Get introduced
  • Contact Patrick directly
Join to view full profile

Other similar profiles

Explore collaborative articles

We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.

Explore More

Others named Patrick Cuba

Add new skills with these courses