Blog, Data Models and ETL - Data Science Current

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Data Science Blog

MAY 20, 2024

It offers full BI-Stack Automation, from source to data warehouse through to frontend. It supports a holistic data model, allowing for rapid prototyping of various models. It also supports a wide range of data warehouses, analytical databases, data lakes, frontends, and pipelines/ETL.

Data Pipeline

Data Pipeline Data Warehouse Azure Data Lakes

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

Data Science Blog

SEPTEMBER 19, 2023

So why using IaC for Cloud Data Infrastructures? This ensures that the data models and queries developed by data professionals are consistent with the underlying infrastructure. Enhanced Security and Compliance Data Warehouses often store sensitive information, making security a paramount concern.

Data Warehouse

Data Warehouse Azure SQL Database

Rethinking Extract Transform Load (ETL) Designs

Dataversity

MARCH 29, 2021

Have you ever been in a situation when you had to represent the ETL team by being up late for L3 support only to find out that one of your […]. The post Rethinking Extract Transform Load (ETL) Designs appeared first on DATAVERSITY.

ETL

ETL Database Data Models Data Modeling

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Top ETL Tools: Unveiling the Best Solutions for Data Integration

Pickl AI

JUNE 7, 2024

Summary: Choosing the right ETL tool is crucial for seamless data integration. Top contenders like Apache Airflow and AWS Glue offer unique features, empowering businesses with efficient workflows, high data quality, and informed decision-making capabilities. Choosing the right ETL tool is crucial for smooth data management.

ETL

ETL Data Quality Data Pipeline Data Warehouse

Optimizing Snowflake’s Performance for Data Vault Modeling

phData

OCTOBER 9, 2023

However, to harness the full potential of Snowflake’s performance capabilities, it is essential to adopt strategies tailored explicitly for data vault modeling. Hash keys provide all key types’ best data load performance, consistency, and audibility.

ETL

ETL Clustering Data Warehouse SQL

What Are the Best Data Modeling Methodologies & Processes for My Data Lake?

phData

SEPTEMBER 19, 2023

However, to fully harness the potential of a data lake, effective data modeling methodologies and processes are crucial. Data modeling plays a pivotal role in defining the structure, relationships, and semantics of data within a data lake. Consistency of data throughout the data lake.

Data Lakes

Data Lakes Data Models Data Modeling Data Warehouse

How Rocket Companies modernized their data science solution on AWS

AWS Machine Learning Blog

FEBRUARY 21, 2025

Apache Hive was used to provide a tabular interface to data stored in HDFS, and to integrate with Apache Spark SQL. Apache HBase was employed to offer real-time key-based access to data. Data is stored in HDFS and is accessed via Hive, which provides a tabular interface to the data and integrates with Spark SQL.

Data Science

Data Science AWS Hadoop Data Scientist

What is Data Integration in Data Mining with Example?

Pickl AI

JUNE 28, 2023

But, this data is often stored in disparate systems and formats. Here comes the role of Data Mining. Read this blog to know more about Data Integration in Data Mining, The process encompasses various techniques that help filter useful data from the resource. Thereby, improving data quality and consistency.

Data Mining

Data Mining Data Mining Data Mining ETL

Data warehouse architecture

Dataconomy

OCTOBER 17, 2023

The sheer volume of data that companies are now gathering is incredible, and understanding how best to store and use this information to extract top performance can be incredibly overwhelming. But It’s always better to call data warehouse experts before making a big decision.

Data Warehouse

Data Warehouse Big Data Big Data ETL

Who is a BI Developer: Role, Responsibilities & Skills

Pickl AI

JULY 3, 2023

It is the process of converting raw data into relevant and practical knowledge to help evaluate the performance of businesses, discover trends, and make well-informed choices. Data gathering, data integration, data modelling, analysis of information, and data visualization are all part of intelligence for businesses.

Business Intelligence

Business Intelligence Business Intelligence SQL Data Visualization

Understanding Business Intelligence Architecture: Key Components

Pickl AI

JANUARY 28, 2025

Introduction Business Intelligence (BI) architecture is a crucial framework that organizations use to collect, integrate, analyze, and present business data. This architecture serves as a blueprint for BI initiatives, ensuring that data-driven decision-making is efficient and effective. time, product) and facts (e.g.,

Business Intelligence

Business Intelligence Business Intelligence ETL Data Lakes

How to Create a Power BI Dataflow with Snowflake Data

phData

DECEMBER 2, 2024

In this blog, we will explain dataflows and their use cases and show an example of how to bring data from Snowflake AI Data Cloud into a dataflow. Most Power BI developers are familiar with Power Query , Which is the data transformation layer of Power BI. What are Dataflows, and Why are They So Great?

Power BI

Power BI Data Modeling Data Models Data Visualization

Understanding Zero-Code Development Life Cycle in Matillion

phData

MAY 11, 2023

With the “Data Productivity Cloud” launch, Matillion has achieved a balance of simplifying source control, collaboration, and dataops by elevating Git integration to a “first-class citizen” within the framework. In Matillion ETL, the Git integration enables an organization to connect to any Git offering (e.g.,

ETL

ETL Analytics Analytics Data Modeling

How to Use Fivetran to Ingest Data for a Composable CDP (Customer Data Platform)

phData

JUNE 6, 2024

Marketing and business professionals must effectively manage and leverage their customer data to stay competitive. In this blog, we will explore how marketing professionals have approached the challenge of effectively using their vast amount of customer data using Composable CDPs.

Data Warehouse

Data Warehouse Cloud Data ETL Data Modeling

Hierarchies in Dimensional Modelling

Pickl AI

AUGUST 9, 2024

Summary: This blog delves into hierarchies in dimensional modelling, highlighting their significance in data organisation and analysis. Real-world examples illustrate their application, while tools and technologies facilitate effective hierarchical data management in various industries.

Data Warehouse

Data Warehouse Data Quality ETL Business Intelligence

How to Use Fivetran to Ingest Salesforce Data into Snowflake

phData

SEPTEMBER 25, 2024

With the importance of data in various applications, there’s a need for effective solutions to organize, manage, and transfer data between systems with minimal complexity. While numerous ETL tools are available on the market, selecting the right one can be challenging.

ETL

ETL Database Data Warehouse Analytics

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Pickl AI

JULY 25, 2023

Data engineers are essential professionals responsible for designing, constructing, and maintaining an organization’s data infrastructure. They create data pipelines, ETL processes, and databases to facilitate smooth data flow and storage. Data Warehousing: Amazon Redshift, Google BigQuery, etc.

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

Getting Started with AI in High-Risk Industries, How to Become a Data Engineer, and Query-Driven…

ODSC - Open Data Science

JANUARY 11, 2024

Getting Started with AI in High-Risk Industries, How to Become a Data Engineer, and Query-Driven Data Modeling How To Get Started With Building AI in High-Risk Industries This guide will get you started building AI in your organization with ease, axing unnecessary jargon and fluff, so you can start today.

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

A Comprehensive Guide to Business Intelligence Analysts

Pickl AI

MARCH 3, 2025

Business Intelligence Analysts are the skilled artisans who transform this raw data into valuable insights, empowering organizations to make strategic decisions and stay ahead of the curve. Key Takeaways BI Analysts convert data into actionable insights for strategic business decisions.

Business Intelligence

Business Intelligence Business Intelligence Data Analyst Data Visualization

What Are Business Intelligence Tools

Pickl AI

JANUARY 15, 2025

Furthermore, a study indicated that 71% of organisations consider Data Analytics a critical factor for enhancing their business performance. This blog will explore what Business Intelligence tools are, their functionalities, real-world applications, and address common questions surrounding them.

Business Intelligence

Business Intelligence Business Intelligence Power BI Data Visualization

Where Does Fivetran Fit into The Modern Data Stack?

phData

JULY 17, 2023

In order to fully leverage this vast quantity of collected data, companies need a robust and scalable data infrastructure to manage it. This is where Fivetran and the Modern Data Stack come in. The modern data stack is important because its suite of tools is designed to solve all of the core data challenges companies face.

Data Warehouse

Data Warehouse Data Pipeline Cloud Data ETL

Modernizing data science lifecycle management with AWS and Wipro

AWS Machine Learning Blog

JANUARY 5, 2024

Experiment notebooks Purpose : The customer’s data science team wanted to experiment with various datasets and multiple models to come up with the optimal features, using those as further inputs to the automated pipeline. He holds the AWS AI/ML Specialty certification and authors technical blogs on AI/ML services and solutions.

AWS

AWS Data Science ML ML

dbt and Sigma Integration

phData

JUNE 27, 2023

All of which have a specific role used to collect, store, process, and analyze data. This blog will hone in on the new collaboration, how to implement it into your workbooks, and why Sigma users should be excited about the feature. dbt’s addition of data freshness, quality, and cataloging is just another example of Sigma’s vision.

SQL

SQL Database Data Quality Data Warehouse

How to Build a Power BI Datamart Using Snowflake Data

phData

JULY 11, 2023

Power BI Datamarts provides a low/no code experience directly within Power BI Service that allows developers to ingest data from disparate sources, perform ETL tasks with Power Query, and load data into a fully managed Azure SQL database. Note: At the time of writing this blog, Power BI Datamarts is in preview.

Power BI

Power BI SQL Azure ETL

Maximize the Power of dbt and Snowflake to Achieve Efficient and Scalable Data Vault Solutions

phData

AUGUST 10, 2023

In this blog, our focus will be on exploring the data lifecycle along with several Design Patterns, delving into their benefits and constraints. Data architects can leverage these patterns as starting points or reference models when designing and implementing data vault architectures.

SQL

SQL Data Observability Data Quality Data Pipeline

Data architecture strategy for data quality

IBM Journey to AI blog

JANUARY 5, 2023

The right data architecture can help your organization improve data quality because it provides the framework that determines how data is collected, transported, stored, secured, used and shared for business intelligence and data science use cases. Perform data quality monitoring based on pre-configured rules.

Data Quality

Data Quality Data Lakes Data Warehouse Big Data

How Alation’s Data Team Uses the Modern Data Stack to Power Insights

Alation

OCTOBER 27, 2022

We document these custom models in Alation Data Catalog and publish common queries that other teams can use for operational use cases or reporting needs. Contact title mappings, which are buiilt in some of data models, are documented within our data catalog. Jason: How do you use these models?

Data Analyst

Data Analyst Data Scientist Analytics Analytics

Comparing Tools For Data Processing Pipelines

The MLOps Blog

MARCH 15, 2023

If you will ask data professionals about what is the most challenging part of their day to day work, you will likely discover their concerns around managing different aspects of data before they get to graduate to the data modeling stage. Pricing It is free to use and is licensed under Apache License Version 2.0.

Data Pipeline

Data Pipeline ETL SQL Data Quality

What Free Tools Pair Well With The Snowflake AI Data Cloud?

phData

OCTOBER 17, 2024

Getting your data into Snowflake, creating analytics applications from the data, and even ensuring your Snowflake account runs smoothly all require some sort of tool. In this blog, we’ll review some of the best free tools for use with Snowflake Data Cloud , what they can do for you, and how to use them without breaking the bank.

AI

AI AI SQL Data Quality

Azure Data Engineer Jobs

Pickl AI

APRIL 6, 2023

Accordingly, one of the most demanding roles is that of Azure Data Engineer Jobs that you might be interested in. The following blog will help you know about the Azure Data Engineering Job Description, salary, and certification course. For Azure Data Engineer, there are various skills required.

Azure

Azure Data Engineering Data Engineering Data Engineer

Best Practices for Fact Tables in Dimensional Models

Pickl AI

AUGUST 11, 2024

Summary: This blog discusses best practices for designing effective fact tables in dimensional models. Additionally, it addresses common challenges and offers practical solutions to ensure that fact tables are structured for optimal data quality and analytical performance.

Data Quality

Data Quality Data Warehouse Data Governance Analytics

How and When to Use Dataflows in Power BI

phData

SEPTEMBER 28, 2023

Dataflows allow users to establish source connections and retrieve data, and subsequent data transformations can be conducted using the online Power Query Editor. In this blog, we will provide insights into the process of creating Dataflows and offer guidance on when to choose them to address real-world use cases effectively.

Power BI

Power BI Data Preparation Machine Learning Machine Learning

Apply fine-grained data access controls with AWS Lake Formation in Amazon SageMaker Data Wrangler

AWS Machine Learning Blog

AUGUST 21, 2023

The capabilities of Lake Formation simplify securing and managing distributed data lakes across multiple accounts through a centralized approach, providing fine-grained access control. Solution overview We demonstrate this solution with an end-to-end use case using a sample dataset, the TPC data model.

AWS

AWS Data Lakes Clustering Data Preparation

How to Use Custom SQL and CSVs in Sigma Computing

phData

JULY 10, 2024

These tools allow users to handle more advanced data tasks and analyses. In this blog, we’ll explain why custom SQL and CSVs are important, demonstrate how to use these features in Sigma Computing, and provide some best practices to help you get started. Click on the Create New button located in the upper left-hand corner.

SQL

SQL Data Warehouse Analytics Analytics

Becoming a Prized Data Warehouse and Data Integration Tester

Dataversity

MARCH 1, 2021

Data warehouse (DW) testers with data integration QA skills are in demand. Data warehouse disciplines and architectures are well established and often discussed in the press, books, and conferences. Each business often uses one or more data […]. Click to learn more about author Wayne Yaddow.

Data Warehouse

Data Warehouse ETL Data Governance Data Quality

Your Essential Guide to MongoDB Interview Questions and Answers

Pickl AI

JULY 18, 2024

Read Blogs: Crucial Statistics Interview Questions for Data Science Success. MongoDB is a NoSQL database that handles large-scale data and modern application requirements. MongoDB is a NoSQL database that uses a document-oriented data model. Python Interview Questions And Answers. What is MongoDB? What Is MongoDB?

Database

Database SQL Data Analyst Database Administration

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

AWS Machine Learning Blog

SEPTEMBER 18, 2024

Hosted on Amazon ECS with tasks run on Fargate, this platform streamlines the end-to-end ML workflow, from data ingestion to model deployment. This blog post delves into the details of this MLOps platform, exploring how the integration of these tools facilitates a more efficient and scalable approach to managing ML projects.

AWS

AWS Machine Learning Machine Learning ML

Exploring the Power of Data Warehouse Functionality

Pickl AI

JUNE 11, 2024

But raw data alone isn’t enough to gain valuable insights. This is where data warehouses come in – powerful tools designed to transform raw data into actionable intelligence. This blog delves into the world of data warehouses, exploring their functionality, key features, and the latest innovations.

Data Warehouse

Data Warehouse ETL Data Mining Data Mining

Why Snowflake is the Ideal Platform for Data Vault Modeling

phData

APRIL 20, 2023

In today’s world, data-driven applications demand more flexibility, scalability, and auditability, which traditional data warehouses and modeling approaches lack. This is where the Snowflake Data Cloud and data vault modeling comes in handy. Again dbt Data Vault package automates a major portion of it.

Data Warehouse

Data Warehouse Data Governance Clustering Database

The Ultimate Modern Data Stack Migration Guide

phData

JULY 18, 2023

Slow Response to New Information: Legacy data systems often lack the computation power necessary to run efficiently and can be cost-inefficient to scale. This typically results in long-running ETL pipelines that cause decisions to be made on stale or old data.

Data Warehouse

Data Warehouse Analytics Analytics SQL

Best Data Engineering Tools Every Engineer Should Know

Pickl AI

MARCH 19, 2025

Data engineering is all about collecting, organising, and moving data so businesses can make better decisions. Handling massive amounts of data would be a nightmare without the right tools. In this blog, well explore the best data engineering tools that make data work easier, faster, and more reliable.

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

Webinars

Trending Sources

Rethinking Extract Transform Load (ETL) Designs

Webinars

Top ETL Tools: Unveiling the Best Solutions for Data Integration

Optimizing Snowflake’s Performance for Data Vault Modeling

What Are the Best Data Modeling Methodologies & Processes for My Data Lake?

How Rocket Companies modernized their data science solution on AWS

What is Data Integration in Data Mining with Example?

Data warehouse architecture

Who is a BI Developer: Role, Responsibilities & Skills

Understanding Business Intelligence Architecture: Key Components

How to Create a Power BI Dataflow with Snowflake Data

Understanding Zero-Code Development Life Cycle in Matillion

How to Use Fivetran to Ingest Data for a Composable CDP (Customer Data Platform)

Hierarchies in Dimensional Modelling

How to Use Fivetran to Ingest Salesforce Data into Snowflake

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Getting Started with AI in High-Risk Industries, How to Become a Data Engineer, and Query-Driven…

A Comprehensive Guide to Business Intelligence Analysts

What Are Business Intelligence Tools

Where Does Fivetran Fit into The Modern Data Stack?

Modernizing data science lifecycle management with AWS and Wipro

dbt and Sigma Integration

How to Build a Power BI Datamart Using Snowflake Data

Maximize the Power of dbt and Snowflake to Achieve Efficient and Scalable Data Vault Solutions

Data architecture strategy for data quality

How Alation’s Data Team Uses the Modern Data Stack to Power Insights

Comparing Tools For Data Processing Pipelines

What Free Tools Pair Well With The Snowflake AI Data Cloud?

Azure Data Engineer Jobs

Best Practices for Fact Tables in Dimensional Models

How and When to Use Dataflows in Power BI

Apply fine-grained data access controls with AWS Lake Formation in Amazon SageMaker Data Wrangler

How to Use Custom SQL and CSVs in Sigma Computing

Becoming a Prized Data Warehouse and Data Integration Tester

Your Essential Guide to MongoDB Interview Questions and Answers

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

Exploring the Power of Data Warehouse Functionality

Why Snowflake is the Ideal Platform for Data Vault Modeling

The Ultimate Modern Data Stack Migration Guide

Best Data Engineering Tools Every Engineer Should Know

Stay Connected