Blog, Data Models and SQL - Data Science Current

Data Modeling in Machine Learning Pipelines: Best Practices Using SQL and NoSQL Databases

Dataversity

JANUARY 14, 2025

Data, undoubtedly, is one of the most significant components making up a machine learning (ML) workflow, and due to this, data management is one of the most important factors in sustaining ML pipelines.

Machine Learning

Machine Learning Machine Learning SQL Data Modeling

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Data Science Blog

MAY 20, 2024

It offers full BI-Stack Automation, from source to data warehouse through to frontend. It supports a holistic data model, allowing for rapid prototyping of various models. It also supports a wide range of data warehouses, analytical databases, data lakes, frontends, and pipelines/ETL. Mixed approach of DV 2.0

Data Pipeline

Data Pipeline Data Warehouse Azure Data Lakes

SQL vs. NoSQL: Decoding the database dilemma to perfect solutions

Data Science Dojo

JULY 12, 2023

Welcome to the world of databases, where the choice between SQL (Structured Query Language) and NoSQL (Not Only SQL) databases can be a significant decision. In this blog, we’ll explore the defining traits, benefits, use cases, and key factors to consider when choosing between SQL and NoSQL databases.

SQL

SQL Database Big Data Big Data

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Traditional vs Vector databases: Your guide to make the right choice

Data Science Dojo

MARCH 8, 2024

This blog delves into a detailed comparison between the two data management techniques. In today’s digital world, businesses must make data-driven decisions to manage huge sets of information. Hence, databases are important for strategic data handling and enhanced operational efficiency.

Database

Database Natural Language Processing Clustering SQL

Object-centric Process Mining on Data Mesh Architectures

Data Science Blog

NOVEMBER 15, 2023

New big data architectures and, above all, data sharing concepts such as Data Mesh are ideal for creating a common database for many data products and applications. The Event Log Data Model for Process Mining Process Mining as an analytical system can very well be imagined as an iceberg.

Data Modeling

Data Modeling Data Models Business Intelligence Business Intelligence

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

Data Science Blog

SEPTEMBER 19, 2023

So why using IaC for Cloud Data Infrastructures? This ensures that the data models and queries developed by data professionals are consistent with the underlying infrastructure. Enhanced Security and Compliance Data Warehouses often store sensitive information, making security a paramount concern.

Data Warehouse

Data Warehouse Azure SQL Database

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

Data Science Connect

JANUARY 27, 2023

In this blog post, we will be discussing 7 tips that will help you become a successful data engineer and take your career to the next level. Learn SQL: As a data engineer, you will be working with large amounts of data, and SQL is the most commonly used language for interacting with databases.

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

Tales of Data Modelers

Dataversity

DECEMBER 20, 2021

Reading Larry Burns’ “Data Model Storytelling” (TechnicsPub.com, 2021) was a really good experience for a guy like me (i.e., someone who thinks that data models are narratives). The post Tales of Data Modelers appeared first on DATAVERSITY. The post Tales of Data Modelers appeared first on DATAVERSITY.

Data Modeling

Data Modeling Data Models Database SQL

2021: Three Game-Changing Data Modeling Perspectives

Dataversity

JANUARY 18, 2021

So, I had to cut down my January 2021 list of things of importance in Data Modeling in this new, fine year (I hope)! The post 2021: Three Game-Changing Data Modeling Perspectives appeared first on DATAVERSITY. Common wisdom has it that we humans can only focus on three things at a time.

Data Modeling

Data Modeling Data Models Database SQL

Transform your data into insights: The data analyst’s guide to Power BI

Data Science Dojo

FEBRUARY 9, 2023

Data is an essential component of any business, and it is the role of a data analyst to make sense of it all. Power BI is a powerful data visualization tool that helps them turn raw data into meaningful insights and actionable decisions. Check out this course and learn Power BI today!

Power BI

Power BI Data Analyst Data Visualization Data Analysis

Tabular Data Exploration and Modelling with LLMs

Towards AI

JANUARY 11, 2024

Tabular data is the data in the typical table — some columns and rows are structured well, like in Excel or SQL data. It's the most common usage of data forms in many data use cases. With the power of LLM, we would learn how to explore the data and perform data modeling.

Python

Python Clean Data SQL Data Science

10 Data Modeling Tools You Should Know

Pickl AI

JUNE 28, 2023

Data is driving most business decisions. In this, data modeling tools play a crucial role in developing and maintaining the information system. Moreover, it involves the creation of a conceptual representation of data and its relationship. Data modeling tools play a significant role in this.

Data Modeling

Data Modeling Data Models Database SQL

How to Use Custom SQL and CSVs in Sigma Computing

phData

JULY 10, 2024

Sigma Computing , a cloud-based analytics platform, helps data analysts and business professionals maximize their data with collaborative and scalable analytics. One of Sigma’s key features is its support for custom SQL queries and CSV file uploads. These tools allow users to handle more advanced data tasks and analyses.

SQL

SQL Data Warehouse Analytics Analytics

What Are the Best Data Modeling Methodologies & Processes for My Data Lake?

phData

SEPTEMBER 19, 2023

However, to fully harness the potential of a data lake, effective data modeling methodologies and processes are crucial. Data modeling plays a pivotal role in defining the structure, relationships, and semantics of data within a data lake. Consistency of data throughout the data lake.

Data Lakes

Data Lakes Data Modeling Data Models Data Warehouse

GraphRAG Is the Logical Step From RAG — So Why the Sudden Hype?

Towards AI

JULY 17, 2024

I’m not going to go into huge details on this as if you follow AI / LLM (which I assume you do if you are reading this) but in a nutshell, RAG is the process whereby you feed external data into an LLM alongside prompts to ensure it has all of the information it needs to make decisions. What is GraphRAG? Why use Graphs and what are they?

Database

Database Data Modeling Data Models SQL

How Rocket Companies modernized their data science solution on AWS

AWS Machine Learning Blog

FEBRUARY 21, 2025

Data exploration and model development were conducted using well-known machine learning (ML) tools such as Jupyter or Apache Zeppelin notebooks. Apache Hive was used to provide a tabular interface to data stored in HDFS, and to integrate with Apache Spark SQL. HBase is employed to offer real-time key-based access to data.

Data Science

Data Science AWS Hadoop Data Scientist

Citus 12: Schema-based sharding for PostgreSQL

Hacker News

JULY 18, 2023

What if you could automatically shard your PostgreSQL database across any number of servers and get industry-leading performance at scale without any special data modelling steps? In this blog post, you’ll get a high-level overview of schema-based sharding and other new Citus 12 features: What is schema-based sharding?

Database

Database SQL Data Modeling Data Models

A Comprehensive Guide to Business Intelligence Analysts

Pickl AI

MARCH 3, 2025

Summary: Business Intelligence Analysts transform raw data into actionable insights. They use tools and techniques to analyse data, create reports, and support strategic decisions. Key skills include SQL, data visualization, and business acumen. Introduction We are living in an era defined by data.

Business Intelligence

Business Intelligence Business Intelligence Data Analyst Data Visualization

Optimizing Snowflake’s Performance for Data Vault Modeling

phData

OCTOBER 9, 2023

However, to harness the full potential of Snowflake’s performance capabilities, it is essential to adopt strategies tailored explicitly for data vault modeling. Because of data vault’s modeling structure, transformation queries for moving data between these layers can become exceedingly complex.

ETL

ETL Clustering Data Warehouse SQL

5 Ways to Optimize Your Sigma Computing Calculations

phData

SEPTEMBER 13, 2023

Sigma Computing is a powerful data modeling and analysis platform designed to leverage the power of modern cloud technology. Once connected to Snowflake , Sigma utilizes Machine Generated SQL to produce the most optimal results. Check out this blog to master the fundamentals. True or False. True or False.

SQL

SQL Data Modeling Data Models Analytics

It’s All About Relations!

Dataversity

NOVEMBER 21, 2022

The new ISO 39075 Graph Query Language Standard is to hit the data streets in late 2023 (?). If graph databases are standardized pretty soon, what will happen to SQL? Not simply because legacy SQL has a tremendous inertia, but because relational database paradigms […]. They will very likely stay around for a long time.

SQL

SQL Database Data Models Data Modeling

Who is a BI Developer: Role, Responsibilities & Skills

Pickl AI

JULY 3, 2023

It is the process of converting raw data into relevant and practical knowledge to help evaluate the performance of businesses, discover trends, and make well-informed choices. Data gathering, data integration, data modelling, analysis of information, and data visualization are all part of intelligence for businesses.

Business Intelligence

Business Intelligence Business Intelligence SQL Data Visualization

How to Optimize Power BI and Snowflake for Advanced Analytics

phData

MAY 25, 2023

The June 2021 release of Power BI Desktop introduced Custom SQL queries to Snowflake in DirectQuery mode. In 2021, Microsoft enabled Custom SQL queries to be run to Snowflake in DirectQuery mode further enhancing the connection capabilities between the platforms.

Power BI

Power BI Analytics Analytics Azure

Maximize the Power of dbt and Snowflake to Achieve Efficient and Scalable Data Vault Solutions

phData

AUGUST 10, 2023

In this blog, our focus will be on exploring the data lifecycle along with several Design Patterns, delving into their benefits and constraints. Data architects can leverage these patterns as starting points or reference models when designing and implementing data vault architectures.

SQL

SQL Data Observability Data Quality Data Pipeline

Best 8 Data Version Control Tools for Machine Learning 2024

DagsHub

DECEMBER 11, 2023

Best 8 data version control tools for 2023 (Source: DagsHub ) Introduction With business needs changing constantly and the growing size and structure of datasets, it becomes challenging to efficiently keep track of the changes made to the data, which leads to unfortunate scenarios such as inconsistencies and errors in data.

Machine Learning

Machine Learning Machine Learning Data Lakes Big Data

Learn the Difference Between MySQL and PostgreSQL

Pickl AI

DECEMBER 17, 2024

This blog explores PostgreSQL vs MySQL, two popular RDBMS solutions, highlighting their differences to help you choose the right one for your needs. It is open-source and uses Structured Query Language (SQL) to manage and manipulate data. PostgreSQLs architecture is highly flexible, supporting many data models and workloads.

SQL

SQL Database Analytics Analytics

Data science vs data analytics: Unpacking the differences

IBM Journey to AI blog

SEPTEMBER 19, 2023

And you should have experience working with big data platforms such as Hadoop or Apache Spark. Additionally, data science requires experience in SQL database coding and an ability to work with unstructured data of various types, such as video, audio, pictures and text.

Data Science

Data Science Analytics Analytics Data Scientist

How to Build a Power BI Datamart Using Snowflake Data

phData

JULY 11, 2023

Power BI Datamarts provides a low/no code experience directly within Power BI Service that allows developers to ingest data from disparate sources, perform ETL tasks with Power Query, and load data into a fully managed Azure SQL database. Note: At the time of writing this blog, Power BI Datamarts is in preview.

Power BI

Power BI SQL Azure ETL

Azure Data Engineer Jobs

Pickl AI

APRIL 6, 2023

Accordingly, one of the most demanding roles is that of Azure Data Engineer Jobs that you might be interested in. The following blog will help you know about the Azure Data Engineering Job Description, salary, and certification course. Having experience using at least one end-to-end Azure data lake project.

Azure

Azure Data Engineering Data Engineer Data Engineering

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Pickl AI

JULY 25, 2023

Data engineers are essential professionals responsible for designing, constructing, and maintaining an organization’s data infrastructure. They create data pipelines, ETL processes, and databases to facilitate smooth data flow and storage. Data Visualization: Matplotlib, Seaborn, Tableau, etc.

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

dbt and Sigma Integration

phData

JUNE 27, 2023

All of which have a specific role used to collect, store, process, and analyze data. This blog will hone in on the new collaboration, how to implement it into your workbooks, and why Sigma users should be excited about the feature. Using SQL-centric transformations to model data to be deployed.

SQL

SQL Database Data Quality Data Warehouse

Tables vs. Pivot Tables in Sigma Computing

phData

MARCH 28, 2024

In this blog, we will cover what tables and pivot tables are, the advantages and limitations of each, and the factors to consider when choosing which element to use. At the end of this blog, you will have a firm understanding of both elements and how to utilize each in your day-to-day data exploration.

Data Modeling

Data Modeling Data Models Data Visualization SQL

Understanding the Benefits of Data Vault Architecture in Snowflake

phData

AUGUST 16, 2023

To address these complexities, a powerful data warehousing solution like the Snowflake Data Cloud , coupled with an effective data modeling approach such as the Data Vault architecture, can be a winning combination. What is a Data Vault Architecture? Contact phData!

Data Warehouse

Data Warehouse Data Governance SQL Data Modeling

How to choose a graph database: we compare 6 favorites

Cambridge Intelligence

OCTOBER 19, 2023

The answer probably depends more on the complexity of your queries than the connectedness of your data. Relational databases (with recursive SQL queries), document stores, key-value stores, etc., Multi-model databases combine graphs with two other NoSQL data models – document and key-value stores.

Database

Database Azure SQL Analytics

Hierarchies in Dimensional Modelling

Pickl AI

AUGUST 9, 2024

Summary: This blog delves into hierarchies in dimensional modelling, highlighting their significance in data organisation and analysis. Real-world examples illustrate their application, while tools and technologies facilitate effective hierarchical data management in various industries.

Data Warehouse

Data Warehouse Data Quality ETL Business Intelligence

Top 10 Reasons for Alation with Snowflake: Reduce Risk with Active Data Governance

Alation

SEPTEMBER 7, 2021

This is the last of the 4-part blog series. In the previous blog , we discussed how Alation provides a platform for data scientists and analysts to complete projects and analysis at speed. In this blog we will discuss how Alation helps minimize risk with active data governance. Two problems arise. In Summary.

Data Governance

Data Governance Data Scientist Data Quality Data Profiling

Cassandra vs MongoDB

Pickl AI

SEPTEMBER 20, 2024

Both databases are designed to handle large volumes of data, but they cater to different use cases and exhibit distinct architectural designs. Key Features of Apache Cassandra Scalability: Cassandra can scale horizontally by adding more servers to accommodate growing data needs. Here’s a detailed comparison of their key differences.

Database

Database Clustering Data Modeling Data Models

Data science vs. machine learning: What’s the difference?

IBM Journey to AI blog

JULY 6, 2023

It uses advanced tools to look at raw data, gather a data set, process it, and develop insights to create meaning. Areas making up the data science field include mining, statistics, data analytics, data modeling, machine learning modeling and programming. appeared first on IBM Blog.

Machine Learning

Machine Learning Machine Learning Data Science Big Data

Operations Analyst Job Description and Duties for 2025

Pickl AI

JANUARY 9, 2025

With organisations prioritising efficiency, sustainability, and data-backed decision-making, Operations Analysts now play a pivotal role in streamlining processes and optimising performance. They bridge the gap between data insights and actionable strategies, ensuring businesses stay competitive.

Power BI

Power BI Machine Learning Machine Learning Tableau

What Are dbt Artifacts

phData

FEBRUARY 8, 2024

Data Modeling, dbt has gradually emerged as a powerful tool that largely simplifies the process of building and handling data pipelines. dbt is an open-source command-line tool that allows data engineers to transform, test, and document the data into one single hub which follows the best practices of software engineering.

Data Modeling

Data Modeling Data Models Data Warehouse Database

Building a Machine Learning Feature Platform with Snowflake, dbt, & Airflow

phData

OCTOBER 27, 2023

If that leaves you looking for more background context, you can jump to our blog called What is a Feature Store? This blog will walk through how to build your own feature platform with batch feature engineering pipelines using Airflow and dbt on the Snowflake Data Cloud. The answer to both questions is no.

Machine Learning

Machine Learning Machine Learning Python ML

What Free Tools Pair Well With The Snowflake AI Data Cloud?

phData

OCTOBER 17, 2024

Getting your data into Snowflake, creating analytics applications from the data, and even ensuring your Snowflake account runs smoothly all require some sort of tool. In this blog, we’ll review some of the best free tools for use with Snowflake Data Cloud , what they can do for you, and how to use them without breaking the bank.

AI

AI AI SQL Data Quality

How to Use a dbt Package in Your Project

phData

DECEMBER 13, 2023

In this blog, we will discuss dbt packages, when you should use a package, and how to use them in your project. You can also transform Facebook Ads or AdWords spend data into a consistent format and keep the data segregated. You can generate SQL code to unite two relations and create surrogate keys or pivot columns.

SQL

SQL Python Azure Data Models

What are Snowflake Dynamic Tables?

phData

NOVEMBER 2, 2023

The Snowflake Data Cloud has introduced a groundbreaking feature that promises to simplify and supercharge this process: Snowflake Dynamic Tables. These dynamic tables are not just another table type; they represent a game-changing approach to data pipeline development and management. What are Snowflake Dynamic Tables?

Data Pipeline

Data Pipeline SQL Data Warehouse Data Engineering

Data Modeling in Machine Learning Pipelines: Best Practices Using SQL and NoSQL Databases

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Webinars

Trending Sources

SQL vs. NoSQL: Decoding the database dilemma to perfect solutions

Webinars

Traditional vs Vector databases: Your guide to make the right choice

Object-centric Process Mining on Data Mesh Architectures

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

Tales of Data Modelers

2021: Three Game-Changing Data Modeling Perspectives

Transform your data into insights: The data analyst’s guide to Power BI

Tabular Data Exploration and Modelling with LLMs

10 Data Modeling Tools You Should Know

How to Use Custom SQL and CSVs in Sigma Computing

What Are the Best Data Modeling Methodologies & Processes for My Data Lake?

GraphRAG Is the Logical Step From RAG — So Why the Sudden Hype?

How Rocket Companies modernized their data science solution on AWS

Citus 12: Schema-based sharding for PostgreSQL

A Comprehensive Guide to Business Intelligence Analysts

Optimizing Snowflake’s Performance for Data Vault Modeling

5 Ways to Optimize Your Sigma Computing Calculations

It’s All About Relations!

Who is a BI Developer: Role, Responsibilities & Skills

How to Optimize Power BI and Snowflake for Advanced Analytics

Maximize the Power of dbt and Snowflake to Achieve Efficient and Scalable Data Vault Solutions

Best 8 Data Version Control Tools for Machine Learning 2024

Learn the Difference Between MySQL and PostgreSQL

Data science vs data analytics: Unpacking the differences

How to Build a Power BI Datamart Using Snowflake Data

Azure Data Engineer Jobs

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

dbt and Sigma Integration

Tables vs. Pivot Tables in Sigma Computing

Understanding the Benefits of Data Vault Architecture in Snowflake

How to choose a graph database: we compare 6 favorites

Hierarchies in Dimensional Modelling

Top 10 Reasons for Alation with Snowflake: Reduce Risk with Active Data Governance

Cassandra vs MongoDB

Data science vs. machine learning: What’s the difference?

Operations Analyst Job Description and Duties for 2025

What Are dbt Artifacts

Building a Machine Learning Feature Platform with Snowflake, dbt, & Airflow

What Free Tools Pair Well With The Snowflake AI Data Cloud?

How to Use a dbt Package in Your Project

What are Snowflake Dynamic Tables?

Stay Connected