Azure, Data Engineering and Data Warehouse

Data Warehouse in Azure SQL

Analytics Vidhya

SEPTEMBER 28, 2022

This article was published as a part of the Data Science Blogathon. Introduction to Data Warehouse SQL Data Warehouse is also a cloud-based data warehouse that uses Massively Parallel Processing (MPP) to run complex queries across petabytes of data rapidly. Import big […].

Data Warehouse

Data Warehouse Azure SQL Big Data

How a Delta Lake is Process with Azure Synapse Analytics

Analytics Vidhya

JULY 29, 2022

Introduction We are all pretty much familiar with the common modern cloud data warehouse model, which essentially provides a platform comprising a data lake (based on a cloud storage account such as Azure Data Lake Storage Gen2) AND a data warehouse compute engine […].

Azure

Azure Data Warehouse Data Lakes Analytics

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

Data Science Blog

SEPTEMBER 19, 2023

In the contemporary age of Big Data, Data Warehouse Systems and Data Science Analytics Infrastructures have become an essential component for organizations to store, analyze, and make data-driven decisions. So why using IaC for Cloud Data Infrastructures?

Data Warehouse

Data Warehouse Azure SQL Database

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Understanding the Basics of Data Warehouse and its Structure

Analytics Vidhya

FEBRUARY 21, 2023

This is where data warehousing is a critical component of any business, allowing companies to store and manage vast amounts of data. It provides the necessary foundation for businesses to […] The post Understanding the Basics of Data Warehouse and its Structure appeared first on Analytics Vidhya.

Data Warehouse

Data Warehouse Analytics Analytics Azure

Exploring Azure Cosmos DB and Its Capabilities for Data Migration

Analytics Vidhya

MAY 10, 2022

This article was published as a part of the Data Science Blogathon. Data Engineers, I am sure this simple article will help you guys better understand Cosmos DB from Azure with nice features. Recently many customers have been looking forward to implementing the Data Migration into Cosmos DB.

Azure

Azure Data Engineering Data Engineer Data Engineering

Most Frequently Asked Azure Data Factory Interview Questions

Analytics Vidhya

FEBRUARY 20, 2023

Introduction Azure data factory (ADF) is a cloud-based data ingestion and ETL (Extract, Transform, Load) tool. The data-driven workflow in ADF orchestrates and automates data movement and data transformation.

Azure

Azure ETL Analytics Analytics

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Data Science Blog

MAY 20, 2024

Continuous Integration and Continuous Delivery (CI/CD) for Data Pipelines: It is a Game-Changer with AnalyticsCreator! The need for efficient and reliable data pipelines is paramount in data science and data engineering. It offers full BI-Stack Automation, from source to data warehouse through to frontend.

Data Pipeline

Data Pipeline Data Warehouse Azure Data Lakes

Exploring the Power of Microsoft Fabric: A Hands-On Guide with a Sales Use Case

Data Science Dojo

SEPTEMBER 11, 2024

With this full-fledged solution, you don’t have to spend all your time and effort combining different services or duplicating data. OneLake, being built on Azure Data Lake Storage (ADLS), supports various data formats, including Delta, Parquet, CSV, and JSON. On the home page, select Synapse Data Engineering.

Power BI

Power BI Data Pipeline Data Warehouse Data Engineering

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

Data Science Connect

JANUARY 27, 2023

Data engineering is a crucial field that plays a vital role in the data pipeline of any organization. It is the process of collecting, storing, managing, and analyzing large amounts of data, and data engineers are responsible for designing and implementing the systems and infrastructure that make this possible.

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

Delta Lake: A Comprehensive Introduction

Analytics Vidhya

JANUARY 2, 2023

Introduction Delta Lake is an open-source storage layer that brings data lakes to the world of Apache Spark. Delta Lakes provides an ACID transaction–compliant and cloud–native platform on top of cloud object stores such as Amazon S3, Microsoft Azure Storage, and Google Cloud Storage.

Data Lakes

Data Lakes Azure Analytics Analytics

Was ist ein Data Lakehouse?

Data Science Blog

MAY 15, 2023

tl;dr Ein Data Lakehouse ist eine moderne Datenarchitektur, die die Vorteile eines Data Lake und eines Data Warehouse kombiniert. Organisationen können je nach ihren spezifischen Bedürfnissen und Anforderungen zwischen einem Data Warehouse und einem Data Lakehouse wählen.

Data Warehouse

Data Warehouse Data Lakes Azure AWS

Azure Data Engineer Jobs

Pickl AI

APRIL 6, 2023

Accordingly, one of the most demanding roles is that of Azure Data Engineer Jobs that you might be interested in. The following blog will help you know about the Azure Data Engineering Job Description, salary, and certification course. How to Become an Azure Data Engineer?

Azure

Azure Data Engineering Data Engineer Data Engineering

CI/CD für Datenpipelines – Ein Game-Changer mit AnalyticsCreator

Data Science Blog

JULY 20, 2024

Die Bedeutung effizienter und zuverlässiger Datenpipelines in den Bereichen Data Science und Data Engineering ist enorm. Es bietet vollständige Automatisierung des BI-Stacks und unterstützt ein breites Spektrum an Data Warehouses, analytischen Datenbanken und Frontends.

Azure

Azure SQL Power BI Data Lakes

On-Prem vs. The Cloud: Key Considerations

phData

FEBRUARY 21, 2025

In this post, we will be particularly interested in the impact that cloud computing left on the modern data warehouse. We will explore the different options for data warehousing and how you can leverage this information to make the right decisions for your organization. Understanding the Basics What is a Data Warehouse?

Data Warehouse

Data Warehouse Cloud Data ETL Cloud Computing

Process Mining – Ist Celonis wirklich so gut? Ein Praxisbericht.

Data Science Blog

SEPTEMBER 3, 2024

auf den Analyse-Ressourcen der Microsoft Azure Cloud oder in auf der databricks-Plattform. Alternativ zu Databricks können auch andere Data Warehouse Datenbankplattformen zur Anwendung kommen, beispielsweise auch snowflake mit dbt. Umgesetzt werden diese Anwendungsfälle bisher vor allem auf dritten Plattformen, wie z.

Data Science

Data Science Power BI Azure Data Warehouse

Sneak peek at Microsoft Fabric price and its promising features

Dataconomy

JUNE 1, 2023

By automating the integration of all Fabric workloads into OneLake, Microsoft eliminates the need for developers, analysts, and business users to create their own data silos. This approach not only improves performance by eliminating the need for separate data warehouses but also results in substantial cost savings for customers.

Power BI

Power BI Data Lakes Azure Data Silos

Discover the Most Important Fundamentals of Data Engineering

Pickl AI

NOVEMBER 4, 2024

Summary: The fundamentals of Data Engineering encompass essential practices like data modelling, warehousing, pipelines, and integration. Understanding these concepts enables professionals to build robust systems that facilitate effective data management and insightful analysis. What is Data Engineering?

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

ETL Pipelines With Python Azure Functions

Mlearning.ai

JULY 8, 2023

One of them is Azure functions. In this article we’re going to check what is an Azure function and how we can employ it to create a basic extract, transform and load (ETL) pipeline with minimal code. A batch ETL works under a predefined schedule in which the data are processed at specific points in time.

ETL

ETL Azure Python Internet of Things

Your Complete Roadmap to Become an Azure Data Scientist

Pickl AI

SEPTEMBER 5, 2024

Summary: This blog provides a comprehensive roadmap for aspiring Azure Data Scientists, outlining the essential skills, certifications, and steps to build a successful career in Data Science using Microsoft Azure. What is Azure?

Azure

Azure Data Scientist Data Science Machine Learning

How to Shift from Data Science to Data Engineering

ODSC - Open Data Science

JANUARY 18, 2024

Data engineering is a rapidly growing field, and there is a high demand for skilled data engineers. If you are a data scientist, you may be wondering if you can transition into data engineering. In this blog post, we will discuss how you can become a data engineer if you are a data scientist.

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

Why Open Table Format Architecture is Essential for Modern Data Systems

phData

NOVEMBER 8, 2024

Data Versioning and Time Travel Open Table Formats empower users with time travel capabilities, allowing them to access previous dataset versions. Versioning also ensures a safer experimentation environment, where data scientists can test new models or hypotheses on historical data snapshots without impacting live data.

Data Lakes

Data Lakes Data Warehouse Database Azure

11 Open-Source Data Engineering Tools Every Pro Should Use

ODSC - Open Data Science

FEBRUARY 6, 2024

Data engineering has become an integral part of the modern tech landscape, driving advancements and efficiencies across industries. So let’s explore the world of open-source tools for data engineers, shedding light on how these resources are shaping the future of data handling, processing, and visualization.

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Pickl AI

JULY 25, 2023

Unfolding the difference between data engineer, data scientist, and data analyst. Data engineers are essential professionals responsible for designing, constructing, and maintaining an organization’s data infrastructure. Read more to know.

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

How to Optimize Power BI and Snowflake for Advanced Analytics

phData

MAY 25, 2023

In a perfect world, Microsoft would have clients push even more storage and compute to its Azure Synapse platform. One of the easiest ways for Snowflake to achieve this is to have analytics solutions query their data warehouse in real-time (also known as DirectQuery).

Power BI

Power BI Analytics Analytics Azure

What Are The Best Third-Party Data Ingestion Tools For Snowflake?

phData

FEBRUARY 14, 2023

Data integration is essentially the Extract and Load portion of the Extract, Load, and Transform (ELT) process. Data ingestion involves connecting your data sources, including databases, flat files, streaming data, etc, to your data warehouse. Snowflake provides native ways for data ingestion.

Data Warehouse

Data Warehouse Azure AWS Database

8 Data Lake Vendors to Make Your Data Life Easier in 2023

ODSC - Open Data Science

JUNE 7, 2023

Data has to be stored somewhere. Data warehouses are repositories for your cleaned, processed data, but what about all that unstructured data your organization is starting to notice? What is a data lake? So let’s take a look at a few of the leading industry examples of data lakes. Where does it go?

Data Lakes

Data Lakes Azure Data Warehouse Hadoop

Discover the Snowflake Architecture With All its Pros and Cons- NIX United

Mlearning.ai

FEBRUARY 16, 2023

The ultimate need for vast storage spaces manifests in data warehouses: specialized systems that aggregate data coming from numerous sources for centralized management and consistency. In this article, you’ll discover what a Snowflake data warehouse is, its pros and cons, and how to employ it efficiently.

Data Warehouse

Data Warehouse Business Intelligence Business Intelligence Database

Best Practices When Developing Matillion Jobs

phData

SEPTEMBER 2, 2024

Best practices are a pivotal part of any software development, and data engineering is no exception. This ensures the data pipelines we create are robust, durable, and secure, providing the desired data to the organization effectively and consistently. Below are the best practices.

ETL

ETL Data Warehouse SQL Database

Maximising Efficiency with ETL Data: Future Trends and Best Practices

Pickl AI

OCTOBER 17, 2024

Introduction ETL plays a crucial role in Data Management. This process enables organisations to gather data from various sources, transform it into a usable format, and load it into data warehouses or databases for analysis. Loading The transformed data is loaded into the target destination, such as a data warehouse.

ETL

ETL Data Warehouse Data Quality Data Governance

Setting up Data Lake on GCP using Cloud Storage and BigQuery

Analytics Vidhya

FEBRUARY 25, 2023

Introduction A data lake is a centralized and scalable repository storing structured and unstructured data. The need for a data lake arises from the growing volume, variety, and velocity of data companies need to manage and analyze.

Data Lakes

Data Lakes Analytics Analytics Data Warehouse

What Are the Best Data Modeling Methodologies & Processes for My Data Lake?

phData

SEPTEMBER 19, 2023

Thankfully, there are tools available to help with metadata management, such as AWS Glue, Azure Data Catalog, or Alation, that can automate much of the process. What are the Best Data Modeling Methodologies and Processes? Data lakes are meant to be flexible for new incoming data, whether structured or unstructured.

Data Lakes

Data Lakes Data Modeling Data Models Data Warehouse

Top ETL Tools: Unveiling the Best Solutions for Data Integration

Pickl AI

JUNE 7, 2024

Also Read: Top 10 Data Science tools for 2024. It is a process for moving and managing data from various sources to a central data warehouse. This process ensures that data is accurate, consistent, and usable for analysis and reporting. This process helps organisations manage large volumes of data efficiently.

ETL

ETL Data Quality Data Pipeline Data Warehouse

Booths and Demos Coming to the ODSC West 2024 Expo Hall

ODSC - Open Data Science

OCTOBER 7, 2024

There you’ll hear from Ivan Nardini, Developer Relations Engineer at Google Cloud and discover the latest advancements in AI and learn how to leverage Google Cloud’s powerful tools and infrastructure to drive innovation in your organization.

Data Scientist

Data Scientist Data Science Machine Learning Machine Learning

Getting Started With Matillion Data Productivity Cloud

phData

NOVEMBER 28, 2023

Matillion is also built for scalability and future data demands, with support for cloud data platforms such as Snowflake Data Cloud , Databricks, Amazon Redshift, Microsoft Azure Synapse, and Google BigQuery, making it future-ready, everyone-ready, and AI-ready. That process will not take longer than 3 minutes!

Data Warehouse

Data Warehouse Data Pipeline ETL Azure

Considerations and Approaches to Loading Reference Data into Snowflake

phData

AUGUST 9, 2024

Typically, this data is scattered across Excel files on business users’ desktops. Cloud Storage Upload Snowflake can easily upload files from cloud storage (AWS S3, Azure Storage, GCP Cloud Storage). Hosting the data in Snowflake’s hybrid tables may be a better choice for these applications.

ETL

ETL Data Warehouse Data Governance Tableau

How to Build ETL Data Pipeline in ML

The MLOps Blog

MAY 17, 2023

This article explores the importance of ETL pipelines in machine learning, a hands-on example of building ETL pipelines with a popular tool, and suggests the best ways for data engineers to enhance and sustain their pipelines. It comprises three main areas: Landing area, Staging area, and Data Warehouse area.

ETL

ETL Data Pipeline ML ML

When To Use Internal vs. External Stages in Snowflake

phData

AUGUST 4, 2023

The external stage area includes Microsoft Azure Blob storage, Amazon AWS S3, and Google Cloud Storage. Amazon S3 for AWS, Azure Blob Storage for Azure, or Google Cloud Storage for GCP) to store the actual data files in micro-partitions. They are flexible, secure, and provide exceptional performance.

Database

Database Azure SQL AWS

How to Build a Data Mesh in Snowflake

phData

SEPTEMBER 20, 2023

A data mesh is a conceptual architectural approach for managing data in large organizations. Traditional data management approaches often involve centralizing data in a data warehouse or data lake, leading to challenges like data silos, data ownership issues, and data access and processing bottlenecks.

Data Silos

Data Silos Database Data Quality Data Engineering

How to Version Control Data in ML for Various Data Sources

The MLOps Blog

JANUARY 23, 2023

However, there are some key differences that we need to consider: Size and complexity of the data In machine learning, we are often working with much larger data. Basically, every machine learning project needs data. Given the range of tools and data types, a separate data versioning logic will be necessary.

ML

ML ML Data Lakes Machine Learning

Data Mesh Architecture and the Data Catalog

Alation

FEBRUARY 8, 2022

While data fabric takes a product-and-tech-centric approach, data mesh takes a completely different perspective. Data mesh inverts the common model of having a centralized team (such as a data engineering team), who manage and transform data for wider consumption. But why is such an inversion needed?

Data Governance

Data Governance Data Engineering Data Engineering Data Engineer

The Evolution of Customer Data Modeling: From Static Profiles to Dynamic Customer 360

phData

SEPTEMBER 27, 2024

Here’s how a composable CDP might incorporate the modeling approaches we’ve discussed: Data Storage and Processing : This is your foundation. You might choose a cloud data warehouse like the Snowflake AI Data Cloud or BigQuery. Building a composable CDP requires some serious data engineering chops.

Data Modeling

Data Modeling Data Models Apache Kafka Data Lakes

Top 10 Python Scripts for use in Matillion for Snowflake

phData

OCTOBER 28, 2024

Modern low-code/no-code ETL tools allow data engineers and analysts to build pipelines seamlessly using a drag-and-drop and configure approach with minimal coding. Matillion ETL for Snowflake is an ELT/ETL tool that allows for the ingestion, transformation, and building of analytics for data in the Snowflake AI Data Cloud.

Python

Python ETL AWS Database

The Ultimate Modern Data Stack Migration Guide

phData

JULY 18, 2023

With the birth of cloud data warehouses, data applications, and generative AI , processing large volumes of data faster and cheaper is more approachable and desired than ever. First up, let’s dive into the foundation of every Modern Data Stack, a cloud-based data warehouse.

Data Warehouse

Data Warehouse Analytics Analytics SQL

Best Data Engineering Tools Every Engineer Should Know

Pickl AI

MARCH 19, 2025

Summary: Data engineering tools streamline data collection, storage, and processing. Tools like Python, SQL, Apache Spark, and Snowflake help engineers automate workflows and improve efficiency. Learning these tools is crucial for building scalable data pipelines. Thats where data engineering tools come in!

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

Data Warehouse in Azure SQL

How a Delta Lake is Process with Azure Synapse Analytics

Webinars

Trending Sources

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

Webinars

Understanding the Basics of Data Warehouse and its Structure

Exploring Azure Cosmos DB and Its Capabilities for Data Migration

Most Frequently Asked Azure Data Factory Interview Questions

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Exploring the Power of Microsoft Fabric: A Hands-On Guide with a Sales Use Case

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

Delta Lake: A Comprehensive Introduction

Was ist ein Data Lakehouse?

Azure Data Engineer Jobs

CI/CD für Datenpipelines – Ein Game-Changer mit AnalyticsCreator

On-Prem vs. The Cloud: Key Considerations

Process Mining – Ist Celonis wirklich so gut? Ein Praxisbericht.

Sneak peek at Microsoft Fabric price and its promising features

Discover the Most Important Fundamentals of Data Engineering

ETL Pipelines With Python Azure Functions

Your Complete Roadmap to Become an Azure Data Scientist

How to Shift from Data Science to Data Engineering

Why Open Table Format Architecture is Essential for Modern Data Systems

11 Open-Source Data Engineering Tools Every Pro Should Use

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

How to Optimize Power BI and Snowflake for Advanced Analytics

What Are The Best Third-Party Data Ingestion Tools For Snowflake?

8 Data Lake Vendors to Make Your Data Life Easier in 2023

Discover the Snowflake Architecture With All its Pros and Cons- NIX United

Best Practices When Developing Matillion Jobs

Maximising Efficiency with ETL Data: Future Trends and Best Practices

Setting up Data Lake on GCP using Cloud Storage and BigQuery

What Are the Best Data Modeling Methodologies & Processes for My Data Lake?

Top ETL Tools: Unveiling the Best Solutions for Data Integration

Booths and Demos Coming to the ODSC West 2024 Expo Hall

Getting Started With Matillion Data Productivity Cloud

Considerations and Approaches to Loading Reference Data into Snowflake

How to Build ETL Data Pipeline in ML

When To Use Internal vs. External Stages in Snowflake

How to Build a Data Mesh in Snowflake

How to Version Control Data in ML for Various Data Sources

Data Mesh Architecture and the Data Catalog

The Evolution of Customer Data Modeling: From Static Profiles to Dynamic Customer 360

Top 10 Python Scripts for use in Matillion for Snowflake

The Ultimate Modern Data Stack Migration Guide

Best Data Engineering Tools Every Engineer Should Know

Stay Connected