Artificial Intelligence, Data Warehouse and Database

10 essential SQL concepts for data scientists: Tips and examples

Data Science Dojo

APRIL 25, 2023

SQL (Structured Query Language) is an important tool for data scientists. It is a programming language used to manipulate data stored in relational databases. Mastering SQL concepts allows a data scientist to quickly analyze large amounts of data and make decisions based on their findings.

Data Scientist

Data Scientist SQL Machine Learning Machine Learning

Data Integrity for AI: What’s Old is New Again

Precisely

JANUARY 9, 2025

Artificial Intelligence (AI) is all the rage, and rightly so. The goal of this post is to understand how data integrity best practices have been embraced time and time again, no matter the technology underpinning. There was no easy way to consolidate and analyze this data to more effectively manage our business.

Data Warehouse

Data Warehouse Hadoop Data Governance Data Lakes

Why companies need to accelerate data warehousing solution modernization

IBM Journey to AI blog

APRIL 24, 2023

Data is reported from one central repository, enabling management to draw more meaningful business insights and make faster, better decisions. By running reports on historical data, a data warehouse can clarify what systems and processes are working and what methods need improvement.

Data Warehouse

Data Warehouse Data Lakes Database Big Data

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Data Version Control for Data Lakes: Handling the Changes in Large Scale

ODSC - Open Data Science

SEPTEMBER 27, 2023

In this article, we will delve into the concept of data lakes, explore their differences from data warehouses and relational databases, and discuss the significance of data version control in the context of large-scale data management. Before we address the questions, ‘ What is data version control ?’

Data Lakes

Data Lakes Data Warehouse Database Big Data

Tackling AI’s data challenges with IBM databases on AWS

IBM Journey to AI blog

MARCH 14, 2024

Businesses face significant hurdles when preparing data for artificial intelligence (AI) applications. The existence of data silos and duplication, alongside apprehensions regarding data quality, presents a multifaceted environment for organizations to manage.

AWS

AWS Database ETL AI

Data’s destiny: Former Microsoft and Snowflake exec Bob Muglia on the future of AI and humanity

Flipboard

JUNE 17, 2023

Our guest on the GeekWire Podcast is business and tech leader Bob Muglia, a startup investor and advisor who played a pivotal role in Microsoft’s database and server products, and was CEO of data warehouse company Snowflake Computing.

AI

AI AI Data Warehouse Artificial Intelligence

Securing the data pipeline, from blockchain to AI

Dataconomy

OCTOBER 8, 2024

Generative artificial intelligence is the talk of the town in the technology world today. Space and Time’s creator SxT Labs has created three technologies that underpin its verifiable compute layer, including a blockchain indexer, a distributed data warehouse and a zero-knowledge coprocessor.

Data Pipeline

Data Pipeline AI AI Data Warehouse

Data fabric’s value to the enterprise

Tableau

MAY 11, 2022

Data fabrics are gaining momentum as the data management design for today’s challenging data ecosystems. At their most basic level, data fabrics leverage artificial intelligence and machine learning to unify and securely manage disparate data sources without migrating them to a centralized location.

Tableau

Tableau Data Warehouse Database Data Analyst

Data fabric’s value to the enterprise

Tableau

MAY 11, 2022

Data fabrics are gaining momentum as the data management design for today’s challenging data ecosystems. At their most basic level, data fabrics leverage artificial intelligence and machine learning to unify and securely manage disparate data sources without migrating them to a centralized location.

Tableau

Tableau Data Warehouse Database Data Analyst

Why optimize your warehouse with a data lakehouse strategy

IBM Journey to AI blog

APRIL 25, 2023

To do so, Presto and Spark need to readily work with existing and modern data warehouse infrastructures. Now, let’s chat about why data warehouse optimization is a key value of a data lakehouse strategy. To effectively use raw data, it often needs to be curated within a data warehouse.

Data Warehouse

Data Warehouse Data Engineer Data Engineering Data Engineering

5 Best Practices for Extracting, Analyzing, and Visualizing Data

Smart Data Collective

DECEMBER 13, 2022

Five Best Practices for Data Analytics. Extracted data must be saved someplace. There are several choices to consider, each with its own set of advantages and disadvantages: Data warehouses are used to store data that has been processed for a specific function from one or more sources. Select a Storage Platform.

Data Analysis

Data Analysis Data Analysis Analytics Analytics

Sneak peek at Microsoft Fabric price and its promising features

Dataconomy

JUNE 1, 2023

This open format allows for seamless storage and retrieval of data across different databases. By automating the integration of all Fabric workloads into OneLake, Microsoft eliminates the need for developers, analysts, and business users to create their own data silos.

Power BI

Power BI Data Lakes Azure Data Silos

How Q4 Inc. used Amazon Bedrock, RAG, and SQLDatabaseChain to address numerical and structured dataset challenges building their Q&A chatbot

Flipboard

DECEMBER 6, 2023

During the embeddings experiment, the dataset was converted into embeddings, stored in a vector database, and then matched with the embeddings of the question to extract context. The generated query is then run against the database to fetch the relevant context. Based on the initial tests, this method showed great results.

SQL

SQL Database AWS Machine Learning

Microsoft secures your place in the world of business

Dataconomy

SEPTEMBER 27, 2023

The Microsoft Certified Solutions Associate and Microsoft Certified Solutions Expert certifications cover a wide range of topics related to Microsoft’s technology suite, including Windows operating systems, Azure cloud computing, Office productivity software, Visual Studio programming tools, and SQL Server databases.

Database Administration

Database Administration Database Azure Cloud Computing

Best Practices for Data Lake Security

ODSC - Open Data Science

JUNE 22, 2023

While databases were the traditional way to store large amounts of data, a new storage method has developed that can store even more significant and varied amounts of data. These are called data lakes. What Are Data Lakes? However, even digital information has to be stored somewhere.

Data Lakes

Data Lakes Data Warehouse Database Data Science

How OLAP and AI can enable better business

IBM Journey to AI blog

DECEMBER 7, 2023

Online analytical processing (OLAP) database systems and artificial intelligence (AI) complement each other and can help enhance data analysis and decision-making when used in tandem. Defining OLAP today OLAP database systems have significantly evolved since their inception in the early 1990s.

Data Preparation

Data Preparation Database Data Analysis Data Analysis

A Primer to Scaling Pandas

ODSC - Open Data Science

AUGUST 23, 2023

Modin empowers practitioners to use pandas on data at scale, without requiring them to change a single line of code. Modin leverages our cutting-edge academic research on dataframes — the abstraction underlying pandas to bring the best of databases and distributed systems to dataframes. Run operations in pandas - all in Snowflake!

Data Warehouse

Data Warehouse Data Science Database SQL

Celebrating 40 years of Db2: Running the world’s mission critical workloads

IBM Journey to AI blog

SEPTEMBER 11, 2023

Thus, was born a single database and the relational model for transactions and business intelligence. Its early success, coupled with IBM WebSphere in the 1990s, put it in the spotlight as the database system for several Olympic games, including 1992 Barcelona, 1996 Atlanta, and the 1998 Winter Olympics in Nagano.

Database

Database SQL Data Warehouse Machine Learning

Explore data with ease: Use SQL and Text-to-SQL in Amazon SageMaker Studio JupyterLab notebooks

AWS Machine Learning Blog

APRIL 16, 2024

Solution overview With SageMaker Studio JupyterLab notebook’s SQL integration, you can now connect to popular data sources like Snowflake, Athena, Amazon Redshift, and Amazon DataZone. For example, you can visually explore data sources like databases, tables, and schemas directly from your JupyterLab ecosystem.

SQL

SQL AWS Database Data Scientist

Reinventing the data experience: Use generative AI and modern data architecture to unlock insights

AWS Machine Learning Blog

JUNE 13, 2023

The combination of large language models (LLMs), including the ease of integration that Amazon Bedrock offers, and a scalable, domain-oriented data infrastructure positions this as an intelligent method of tapping into the abundant information held in various analytics databases and data lakes.

Database

Database SQL AWS AI

Introduction to Power BI Datamarts

ODSC - Open Data Science

JUNE 12, 2023

They all agree that a Datamart is a subject-oriented subset of a data warehouse focusing on a particular business unit, department, subject area, or business functionality. The Datamart’s data is usually stored in databases containing a moving frame required for data analysis, not the full history of data.

Power BI

Power BI Data Warehouse ETL Data Preparation

11 Open Source Data Exploration Tools You Need to Know in 2023

ODSC - Open Data Science

FEBRUARY 24, 2023

There are many well-known libraries and platforms for data analysis such as Pandas and Tableau, in addition to analytical databases like ClickHouse, MariaDB, Apache Druid, Apache Pinot, Google BigQuery, Amazon RedShift, etc. With Great Expectations , data teams can express what they “expect” from their data using simple assertions.

Exploratory Data Analysis

Exploratory Data Analysis Data Visualization Data Analysis Data Analysis

IBM to help businesses scale AI workloads, for all data, anywhere

IBM Journey to AI blog

MAY 9, 2023

Watsonx.data will allow users to access their data through a single point of entry and run multiple fit-for-purpose query engines across IT environments. Through workload optimization an organization can reduce data warehouse costs by up to 50 percent by augmenting with this solution. [1]

Data Warehouse

Data Warehouse AWS AI AI

Supercharge your data strategy: Integrate and innovate today leveraging data integration

IBM Journey to AI blog

OCTOBER 22, 2024

Data is the differentiator as business leaders look to utilize their competitive edge as they implement generative AI (gen AI). Leaders feel the pressure to infuse their processes with artificial intelligence (AI) and are looking for ways to harness the insights in their data platforms to fuel this movement.

Data Silos

Data Silos Data Pipeline DataOps Business Intelligence

5 Benefits of BigQuery for Marketers

ODSC - Open Data Science

FEBRUARY 8, 2023

Common databases appear unable to cope with the immense increase in data volumes. This is where the BigQuery data warehouse comes into play. BigQuery operation principles Business intelligence projects presume collecting information from different sources into one database.

Database

Database Data Science Big Data Big Data

The Modern Data Stack Explained: What The Future Holds

Alation

JANUARY 17, 2023

The modern data stack is a combination of various software tools used to collect, process, and store data on a well-integrated cloud-based data platform. It is known to have benefits in handling data due to its robustness, speed, and scalability. A typical modern data stack consists of the following: A data warehouse.

Data Warehouse

Data Warehouse ETL Tableau Cloud Data

How to use foundation models and trusted governance to manage AI workflow risk

IBM Journey to AI blog

OCTOBER 16, 2023

Artificial intelligence (AI) adoption is still in its early stages. The Stanford Institute for Human-Centered Artificial Intelligence’s Center for Research on Foundation Models (CRFM) recently outlined the many risks of foundation models, as well as opportunities. Trustworthiness is critical.

AI

AI AI Data Warehouse ML

Data science vs data analytics: Unpacking the differences

IBM Journey to AI blog

SEPTEMBER 19, 2023

Overview: Data science vs data analytics Think of data science as the overarching umbrella that covers a wide range of tasks performed to find patterns in large datasets, structure data for use, train machine learning models and develop artificial intelligence (AI) applications.

Data Science

Data Science Analytics Analytics Data Scientist

How IBM and AWS are partnering to deliver the promise of AI for business

IBM Journey to AI blog

OCTOBER 30, 2023

Overall, this partnership enables the retailer to make data-driven decisions, improve supply chain efficiency and ultimately boost customer satisfaction, all in a secure and scalable cloud environment. The platform provides an intelligent, self-service data ecosystem that enhances data governance, quality and usability.

AWS

AWS AI Data Warehouse AI

Maximising Efficiency with ETL Data: Future Trends and Best Practices

Pickl AI

OCTOBER 17, 2024

Introduction ETL plays a crucial role in Data Management. This process enables organisations to gather data from various sources, transform it into a usable format, and load it into data warehouses or databases for analysis. It also makes predictions for the future of ETL processes.

ETL

ETL Data Warehouse Data Quality Data Governance

Db2 Warehouse delivers 4x faster query performance than previously, while cutting storage costs by 34x

IBM Journey to AI blog

JULY 11, 2023

Data warehouses are a critical component of any organization’s technology ecosystem. They provide the backbone for a range of use cases such as business intelligence (BI) reporting, dashboarding, and machine-learning (ML)-based predictive analytics that enable faster decision making and insights.

Data Warehouse

Data Warehouse Database Cloud Data Big Data

Data platform trinity: Competitive or complementary?

IBM Journey to AI blog

JANUARY 18, 2023

They defined it as : “ A data lakehouse is a new, open data management architecture that combines the flexibility, cost-efficiency, and scale of data lakes with the data management and ACID transactions of data warehouses, enabling business intelligence (BI) and machine learning (ML) on all data. ”.

Data Lakes

Data Lakes Data Warehouse Azure Apache Hadoop

AI that’s ready for business starts with data that’s ready for AI

IBM Journey to AI blog

JULY 3, 2024

Modernizing your data infrastructure to hybrid cloud for applications, analytics and gen AI Adopting multicloud and hybrid strategies is becoming mandatory, requiring databases that support flexible deployments across the hybrid cloud. This ensures you have a data foundation that grows with your data needs, wherever your data resides.

AI

AI AI Data Quality Database

What is Data Ingestion? Understanding the Basics

Pickl AI

JULY 25, 2024

It’s the critical process of capturing, transforming, and loading data into a centralised repository where it can be processed, analysed, and leveraged. Data Ingestion Meaning At its core, It refers to the act of absorbing data from multiple sources and transporting it to a destination, such as a database, data warehouse, or data lake.

Apache Kafka

Apache Kafka Data Lakes Data Warehouse Data Quality

Optimizing data flexibility and performance with hybrid cloud

IBM Journey to AI blog

JULY 24, 2024

By using open formats, these solutions provide unified data access, allowing seamless sharing of data across an organization without the need for extensive migration or restructuring. The integration with established data warehouse engines ensures compatibility with existing systems and workflows.

Data Governance

Data Governance Data Warehouse Data Preparation AI

Nielsen Sports sees 75% cost reduction in video analysis with Amazon SageMaker multi-model endpoints

AWS Machine Learning Blog

APRIL 4, 2024

Challenges with channel video segmentation Our technology is based on artificial intelligence (AI) and specifically computer vision (CV), which allows us to track brand exposure and identify its location accurately. We have built one of the largest databases of brand impressions in the world with over 6 billion data points.

AWS

AWS Machine Learning Machine Learning ML

Watsonx: a game changer for embedding generative AI into commercial solutions

IBM Journey to AI blog

NOVEMBER 15, 2023

Watsonx Assistant can serve as a user-friendly, natural-language Q&A interface for your supplier database. generates database queries and content like Requests for Proposals (RFPs) or Requests for Information (RFIs), while Watson Discovery analyzes supplier financial reports. In the background, watsonx.ai

AI

AI AI Data Warehouse Database

Exploring the AI and data capabilities of watsonx

IBM Journey to AI blog

JULY 17, 2023

Through workload optimization across multiple query engines and storage tiers, organizations can reduce data warehouse costs by up to 50 percent. 1 Watsonx.data offers built-in governance and automation to get to trusted insights within minutes, and integrations with existing databases and tools to simplify setup and user experience.

AI

AI AI Machine Learning Machine Learning

Build an automated insight extraction framework for customer feedback analysis with Amazon Bedrock and Amazon QuickSight

AWS Machine Learning Blog

JUNE 25, 2024

Amazon Bedrock , a fully managed service designed to facilitate the integration of LLMs into enterprise applications, offers a choice of high-performing LLMs from leading artificial intelligence (AI) companies like Anthropic, Mistral AI, Meta, and Amazon through a single API. The LLM generates output based on the user prompt.

AWS

AWS Natural Language Processing Machine Learning Machine Learning

AI-Powered ETL Pipeline Orchestration: Multi-Agent Systems in the Era of Generative AI

ODSC - Open Data Science

FEBRUARY 19, 2025

Well according to Brij Kishore Pandey, it stands for Extract, Transform, Load and is a fundamental process in data engineering, ensuring data moves efficiently from raw sources to structured storage for analysis. The stepsinclude: Extraction : Data is collected from multiple sources (databases, APIs, flatfiles).

ETL

ETL AI AI Data Warehouse

Big Data Syllabus: A Comprehensive Overview

Pickl AI

AUGUST 9, 2024

Velocity It indicates the speed at which data is generated and processed, necessitating real-time analytics capabilities. Businesses need to analyse data as it streams in to make timely decisions. This diversity requires flexible data processing and storage solutions.

Big Data

Big Data Big Data Big Data Analytics Big Data Analytics

How Dialog Axiata used Amazon SageMaker to scale ML models in production with AI Factory and reduced customer churn within 3 months

AWS Machine Learning Blog

MAY 8, 2024

They focused on improving customer service using data with artificial intelligence (AI) and ML and saw positive results, with their Group AI Maturity increasing from 50% to 80%, according to the TM Forum’s AI Maturity Index.

ML

ML ML AWS AI

Exalytics, Exalogic, and Exadata

Pickl AI

DECEMBER 10, 2024

Summary: Oracle’s Exalytics, Exalogic, and Exadata transform enterprise IT with optimised analytics, middleware, and database systems. These cutting-edge solutions optimise analytics, middleware, and database performance , enabling businesses to achieve unparalleled efficiency and scalability.

Database

Database Analytics Analytics Business Intelligence

ETL Process Explained: Essential Steps for Effective Data Management

Pickl AI

OCTOBER 17, 2024

It is a data integration process that involves extracting data from various sources, transforming it into a suitable format, and loading it into a target system, typically a data warehouse. ETL is the backbone of effective data management, ensuring organisations can leverage their data for informed decision-making.

ETL

ETL Data Warehouse SQL Data Quality

10 essential SQL concepts for data scientists: Tips and examples

Data Integrity for AI: What’s Old is New Again

Webinars

Trending Sources

Why companies need to accelerate data warehousing solution modernization

Webinars

Data Version Control for Data Lakes: Handling the Changes in Large Scale

Tackling AI’s data challenges with IBM databases on AWS

Data’s destiny: Former Microsoft and Snowflake exec Bob Muglia on the future of AI and humanity

Securing the data pipeline, from blockchain to AI

Data fabric’s value to the enterprise

Data fabric’s value to the enterprise

Why optimize your warehouse with a data lakehouse strategy

5 Best Practices for Extracting, Analyzing, and Visualizing Data

Sneak peek at Microsoft Fabric price and its promising features

How Q4 Inc. used Amazon Bedrock, RAG, and SQLDatabaseChain to address numerical and structured dataset challenges building their Q&A chatbot

Microsoft secures your place in the world of business

Best Practices for Data Lake Security

How OLAP and AI can enable better business

A Primer to Scaling Pandas

Celebrating 40 years of Db2: Running the world’s mission critical workloads

Explore data with ease: Use SQL and Text-to-SQL in Amazon SageMaker Studio JupyterLab notebooks

Reinventing the data experience: Use generative AI and modern data architecture to unlock insights

Introduction to Power BI Datamarts

11 Open Source Data Exploration Tools You Need to Know in 2023

IBM to help businesses scale AI workloads, for all data, anywhere

Supercharge your data strategy: Integrate and innovate today leveraging data integration

5 Benefits of BigQuery for Marketers

The Modern Data Stack Explained: What The Future Holds

How to use foundation models and trusted governance to manage AI workflow risk

Data science vs data analytics: Unpacking the differences

How IBM and AWS are partnering to deliver the promise of AI for business

Maximising Efficiency with ETL Data: Future Trends and Best Practices

Db2 Warehouse delivers 4x faster query performance than previously, while cutting storage costs by 34x

Data platform trinity: Competitive or complementary?

AI that’s ready for business starts with data that’s ready for AI

What is Data Ingestion? Understanding the Basics

Optimizing data flexibility and performance with hybrid cloud

Nielsen Sports sees 75% cost reduction in video analysis with Amazon SageMaker multi-model endpoints

Watsonx: a game changer for embedding generative AI into commercial solutions

Exploring the AI and data capabilities of watsonx

Build an automated insight extraction framework for customer feedback analysis with Amazon Bedrock and Amazon QuickSight

AI-Powered ETL Pipeline Orchestration: Multi-Agent Systems in the Era of Generative AI

Big Data Syllabus: A Comprehensive Overview

How Dialog Axiata used Amazon SageMaker to scale ML models in production with AI Factory and reduced customer churn within 3 months

Exalytics, Exalogic, and Exadata

ETL Process Explained: Essential Steps for Effective Data Management

Stay Connected