Data Modeling, Data Warehouse and Document

Data Modeling

Data Warehouse

Document

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

Data Science Blog

SEPTEMBER 19, 2023

In the contemporary age of Big Data, Data Warehouse Systems and Data Science Analytics Infrastructures have become an essential component for organizations to store, analyze, and make data-driven decisions. So why using IaC for Cloud Data Infrastructures?

Data Warehouse

Data Warehouse Azure SQL Database

Data Modeling Fundamentals in Power BI

phData

JUNE 13, 2023

While the front-end report visuals are important and the most visible to end users, a lot goes on behind the scenes that contribute heavily to the end product, including data modeling. In this blog, we’ll describe data modeling and its significance in Power BI. What is Data Modeling?

Power BI

Power BI Data Modeling Data Models Data Warehouse

Join 17,000+

professionals

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Trending Sources

What Are the Best Data Modeling Methodologies & Processes for My Data Lake?

phData

SEPTEMBER 19, 2023

However, to fully harness the potential of a data lake, effective data modeling methodologies and processes are crucial. Data modeling plays a pivotal role in defining the structure, relationships, and semantics of data within a data lake. Consistency of data throughout the data lake.

Data Lakes

Data Lakes Data Modeling Data Models Data Warehouse

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

Beyond data: Cloud analytics mastery for business brilliance

Dataconomy

SEPTEMBER 4, 2023

Key features of cloud analytics solutions include: Data models , Processing applications, and Analytics models. Data models help visualize and organize data, processing applications handle large datasets efficiently, and analytics models aid in understanding complex data sets, laying the foundation for business intelligence.

Analytics

Analytics Analytics Big Data Analytics Big Data Analytics

How to use foundation models and trusted governance to manage AI workflow risk

IBM Journey to AI blog

OCTOBER 16, 2023

It includes processes that trace and document the origin of data, models and associated metadata and pipelines for audits. How to scale AL and ML with built-in governance A fit-for-purpose data store built on an open lakehouse architecture allows you to scale AI and ML while providing built-in governance tools.

AI AI Data Warehouse ML

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

AWS Machine Learning Blog

SEPTEMBER 18, 2024

Additionally, Feast promotes feature reuse, so the time spent on data preparation is reduced greatly. It promotes a disciplined approach to data modeling, making it easier to ensure data quality and consistency across the ML pipelines.

AWS

AWS Machine Learning Machine Learning ML

Discover the Snowflake Architecture With All its Pros and Cons- NIX United

Mlearning.ai

FEBRUARY 16, 2023

The ultimate need for vast storage spaces manifests in data warehouses: specialized systems that aggregate data coming from numerous sources for centralized management and consistency. In this article, you’ll discover what a Snowflake data warehouse is, its pros and cons, and how to employ it efficiently.

Data Warehouse

Data Warehouse Business Intelligence Business Intelligence Database

Hierarchies in Dimensional Modelling

Pickl AI

AUGUST 9, 2024

Hierarchies align data modelling with business processes, making it easier to analyse data in a context that reflects real-world operations. Designing Hierarchies Designing effective hierarchies requires careful consideration of the business requirements and the data model.

Data Warehouse

Data Warehouse Data Quality ETL Business Intelligence

The Ultimate Modern Data Stack Migration Guide

phData

JULY 18, 2023

With the birth of cloud data warehouses, data applications, and generative AI , processing large volumes of data faster and cheaper is more approachable and desired than ever. First up, let’s dive into the foundation of every Modern Data Stack, a cloud-based data warehouse.

Data Warehouse

Data Warehouse Analytics Analytics Cloud Data

How Alation’s Data Team Uses the Modern Data Stack to Power Insights

Alation

OCTOBER 27, 2022

Few actors in the modern data stack have inspired the enthusiasm and fervent support as dbt. This data transformation tool enables data analysts and engineers to transform, test and document data in the cloud data warehouse. This graph is an example of one analysis, documented in our internal catalog.

Data Analyst

Data Analyst Data Scientist Analytics Analytics

Maximize the Power of dbt and Snowflake to Achieve Efficient and Scalable Data Vault Solutions

phData

AUGUST 10, 2023

Data Vault - Data Lifecycle Architecturally, let’s understand the data lifecycle in the data vault into the following layers, which play a key role in choosing the right pattern and tools to implement. Data Acquisition: Extracting data from source systems and making it accessible.

SQL

SQL Data Observability Data Quality Data Pipeline

How to Optimize Power BI and Snowflake for Advanced Analytics

phData

MAY 25, 2023

One of the easiest ways for Snowflake to achieve this is to have analytics solutions query their data warehouse in real-time (also known as DirectQuery). For more information on composite models, check out Microsoft’s official documentation. This ensures the maximum amount of Snowflake consumption possible.

Power BI

Power BI Analytics Analytics Azure

The Data Scientist’s Guide to the Data Catalog

Alation

JULY 19, 2022

The traditional data science workflow , as defined by Joe Blitzstein and Hanspeter Pfister of Harvard University, contains 5 key steps: Ask a question. Get the data. Explore the data. Model the data. A data catalog can assist directly with every step, but model development.

Data Scientist

Data Scientist Data Quality Data Science Data Analyst

Understanding Business Intelligence Architecture: Key Components

Pickl AI

JANUARY 28, 2025

External Data Sources: These can be market research data, social media feeds, or third-party databases that provide additional insights. Data can be structured (e.g., documents and images). The diversity of data sources allows organizations to create a comprehensive view of their operations and market conditions.

Business Intelligence

Business Intelligence Business Intelligence ETL Data Lakes

Best Practices for Fact Tables in Dimensional Models

Pickl AI

AUGUST 11, 2024

Consider factors such as data volume, query patterns, and hardware constraints. Document and Communicate Maintain thorough documentation of fact table designs, including definitions, calculations, and relationships. Use indexing and partitioning strategies to improve query performance.

Data Quality

Data Quality Data Warehouse Data Governance Analytics

How to Better Plan Your Snowflake Migration

phData

SEPTEMBER 26, 2023

A common problem solved by phData is the migration from an existing data platform to the Snowflake Data Cloud , in the best possible manner. This allows you to potentially build out a new data model , reduce any complexity from the existing architecture, and plan for the long-term future use cases.

SQL

SQL Database ETL Data Models

What Are dbt Artifacts

phData

FEBRUARY 8, 2024

Data Modeling, dbt has gradually emerged as a powerful tool that largely simplifies the process of building and handling data pipelines. dbt is an open-source command-line tool that allows data engineers to transform, test, and document the data into one single hub which follows the best practices of software engineering.

Data Modeling

Data Modeling Data Models Data Warehouse Database

Who is a BI Developer: Role, Responsibilities & Skills

Pickl AI

JULY 3, 2023

It is the process of converting raw data into relevant and practical knowledge to help evaluate the performance of businesses, discover trends, and make well-informed choices. Data gathering, data integration, data modelling, analysis of information, and data visualization are all part of intelligence for businesses.

Business Intelligence

Business Intelligence Business Intelligence SQL Data Visualization

What Free Tools Pair Well With The Snowflake AI Data Cloud?

phData

OCTOBER 17, 2024

dbt offers a SQL-first transformation workflow that lets teams build data transformation pipelines while following software engineering best practices like CI/CD, modularity, and documentation. But you still want to start building out the data model.

AI AI SQL Data Quality

Top ETL Tools: Unveiling the Best Solutions for Data Integration

Pickl AI

JUNE 7, 2024

Also Read: Top 10 Data Science tools for 2024. It is a process for moving and managing data from various sources to a central data warehouse. This process ensures that data is accurate, consistent, and usable for analysis and reporting. This process helps organisations manage large volumes of data efficiently.

ETL

ETL Data Quality Data Pipeline Data Warehouse

Implementing GenAI in Practice

Iguazio

JANUARY 22, 2024

In addition, MLOps practices like building data, experting tracking, versioning, artifacts and others, also need to be part of the GenAI productization process. For example, when indexing a new version of a document, it’s important to take care of versioning in the ML pipeline. This also makes indexing more efficient.

Data Pipeline

Data Pipeline ML ML Data Warehouse

Data Catalog First, Master Data Management Second: Here’s Why

Alation

DECEMBER 21, 2022

MDM is a discipline that helps organize critical information to avoid duplication, inconsistency, and other data quality issues. Transactional systems and data warehouses can then use the golden records as the entity’s most current, trusted representation. Data Catalog and Master Data Management.

Data Quality

Data Quality Data Warehouse Data Profiling Data Governance

dbt and Sigma Integration

phData

JUNE 27, 2023

Using SQL-centric transformations to model data to be deployed. dbt is also great for data lineage and documentation to empower business analysts to make informed decisions on their data. Data Ingestion with Fivetran Fivetran is used to move your source(s) into a centralized space for storage.

SQL

SQL Database Data Quality Data Warehouse

How to Enable Governed Self-Service Analytics

phData

APRIL 10, 2023

A truly governed self-service analytics model puts data modeling responsibilities in the hands of IT and report generation and analysis in the hands of business users who will actually be doing the analysis. Business users build reports on an IT-owned and IT-created data model that is focused on reporting solutions.

Analytics

Analytics Analytics Data Governance Data Modeling

Top 10 Reasons for Alation with Snowflake: Reduce Risk with Active Data Governance

Alation

SEPTEMBER 7, 2021

With Snowflake, data stewards have a choice to leverage Snowflake’s governance policies. First, stewards are dependent on data warehouse admins to provide information and to create and edit enforcement policies in Snowflake. Operationalize data governance at scale. Two problems arise. In Summary.

Data Governance

Data Governance Data Scientist Data Quality Data Profiling

Benefits of Learning Tableau for Data Analysts

Pickl AI

MAY 7, 2024

Their tasks encompass: Data Collection and Extraction Identify relevant data sources and gather data from various internal and external systems Extract, transform, and load data into a centralized data warehouse or analytics platform Data Cleaning and Preparation Cleanse and standardize data to ensure accuracy, consistency, and completeness.

Data Analyst

Data Analyst Tableau Data Science Data Analysis

Understanding Zero-Code Development Life Cycle in Matillion

phData

MAY 11, 2023

When a new entrant to ETL development reads this article, they could easily have mastered Matillion Designer’s methods or read through the Matillion Versioning Documentation to develop their own approach to ZDLC. One scenario could be multiple team members who will each work on ingesting and processing data from one of the source systems.

ETL

ETL Analytics Analytics Data Modeling

How to Effectively Handle Unstructured Data Using AI

DagsHub

NOVEMBER 11, 2024

In this article, we’ll explore how AI can transform unstructured data into actionable intelligence, empowering you to make informed decisions, enhance customer experiences, and stay ahead of the competition. What is Unstructured Data? These AI models use multimedia data to understand and improve more complicated information.

AI AI Data Lakes Database

Why Should you Codify your Best Practices in dbt?

phData

JANUARY 7, 2025

Structuring the dbt Project The most important aspect of any dbt project is its structural design, which organizes project files and code in a way that supports scalability for large data warehouses. Downstream Models Dependent on Source : Downstream models (marts or intermediate) should not directly depend on source nodes.

SQL

SQL Data Warehouse Database Data Modeling

Best Data Engineering Tools Every Engineer Should Know

Pickl AI

MARCH 19, 2025

It is widely used for storing and managing structured data, making it an essential tool for data engineers. MongoDB MongoDB is a NoSQL database that stores data in flexible, JSON-like documents. Apache Spark Apache Spark is a powerful data processing framework that efficiently handles Big Data.

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

Data Science Current

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

Data Modeling Fundamentals in Power BI

Webinars

Trending Sources

What Are the Best Data Modeling Methodologies & Processes for My Data Lake?

Webinars

Beyond data: Cloud analytics mastery for business brilliance

How to use foundation models and trusted governance to manage AI workflow risk

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

Discover the Snowflake Architecture With All its Pros and Cons- NIX United

Hierarchies in Dimensional Modelling

The Ultimate Modern Data Stack Migration Guide

How Alation’s Data Team Uses the Modern Data Stack to Power Insights

Maximize the Power of dbt and Snowflake to Achieve Efficient and Scalable Data Vault Solutions

How to Optimize Power BI and Snowflake for Advanced Analytics

The Data Scientist’s Guide to the Data Catalog

Understanding Business Intelligence Architecture: Key Components

Best Practices for Fact Tables in Dimensional Models

How to Better Plan Your Snowflake Migration

What Are dbt Artifacts

Who is a BI Developer: Role, Responsibilities & Skills

What Free Tools Pair Well With The Snowflake AI Data Cloud?

Top ETL Tools: Unveiling the Best Solutions for Data Integration

Implementing GenAI in Practice

Data Catalog First, Master Data Management Second: Here’s Why

dbt and Sigma Integration

How to Enable Governed Self-Service Analytics

Top 10 Reasons for Alation with Snowflake: Reduce Risk with Active Data Governance

Benefits of Learning Tableau for Data Analysts

Understanding Zero-Code Development Life Cycle in Matillion

How to Effectively Handle Unstructured Data Using AI

Why Should you Codify your Best Practices in dbt?

Best Data Engineering Tools Every Engineer Should Know

Stay Connected