Artificial Intelligence, Business Intelligence and Data Lakes

Artificial Intelligence

Business Intelligence

Data Lakes

Data Integrity for AI: What’s Old is New Again

Precisely

JANUARY 9, 2025

Artificial Intelligence (AI) is all the rage, and rightly so. Data marts involved the creation of built-for-purpose analytic repositories meant to directly support more specific business users and reporting needs (e.g., Its time to maximize the potential of your artificial intelligence (AI) initiatives.

Data Warehouse

Data Warehouse Hadoop Data Lakes Data Governance

Data Version Control for Data Lakes: Handling the Changes in Large Scale

ODSC - Open Data Science

SEPTEMBER 27, 2023

In the ever-evolving world of big data, managing vast amounts of information efficiently has become a critical challenge for businesses across the globe. Understanding Data Lakes A data lake is a centralized repository that stores structured, semi-structured, and unstructured data in its raw format.

Data Lakes

Data Lakes Data Warehouse Database Big Data

Join 17,000+

professionals

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Trending Sources

Data Swamp, Data Lake, Data Lakehouse: What to Know

Alation

OCTOBER 21, 2021

Data Swamp vs Data Lake. When you imagine a lake, it’s likely an idyllic image of a tree-ringed body of reflective water amid singing birds and dabbling ducks. I’ll take the lake, thank you very much. But when it’s dirty, stagnant, or hard to unleash, your business will suffer. Benefits of a Data Lake.

Data Lakes

Data Lakes Data Governance Data Warehouse Business Intelligence

Webinars

How to Achieve High-Accuracy Results When Using LLMs

Maximizing Profit and Productivity: The New Era of AI-Powered Accounting

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Drowning in Data? A Data Lake May Be Your Lifesaver

ODSC - Open Data Science

SEPTEMBER 29, 2023

In today’s digital world, data is king. Organizations that can capture, store, format, and analyze data and apply the business intelligence gained through that analysis to their products or services can enjoy significant competitive advantages. But, the amount of data companies must manage is growing at a staggering rate.

Data Lakes

Data Lakes Clustering Big Data Big Data

Real-Time ML with Spark and SBERT, AI Coding Assistants, Data Lake Vendors, and ODSC East…

ODSC - Open Data Science

JUNE 1, 2023

Real-Time ML with Spark and SBERT, AI Coding Assistants, Data Lake Vendors, and ODSC East Highlights Getting Up to Speed on Real-Time Machine Learning with Spark and SBERT Learn more about real-time machine learning by using this approach that uses Apache Spark and SBERT. Well, these libraries will give you a solid start.

Data Lakes

Data Lakes ML ML Citizen Data Scientist

How Twilio generated SQL using Looker Modeling Language data with Amazon Bedrock

AWS Machine Learning Blog

AUGUST 8, 2024

As one of the largest AWS customers, Twilio engages with data, artificial intelligence (AI), and machine learning (ML) services to run their daily workloads. Data is the foundational layer for all generative AI and ML applications. The following diagram illustrates the solution architecture.

SQL

SQL Data Lakes Data Analyst AWS

Big Data – Das Versprechen wurde eingelöst

Data Science Blog

MARCH 14, 2023

Datenqualität hingegen, wurde zum wichtigen Faktor jeder Unternehmensbewertung, was Themen wie Reporting, Data Governance und schließlich dann das Data Engineering mehr noch anschob als die Data Science. Google Trends – Big Data (blue), Data Science (red), Business Intelligence (yellow) und Process Mining (green).

Big Data

Big Data Big Data Apache Hadoop Data Science

Interview – Datenstrategie und Data Teams entwickeln!

Data Science Blog

MARCH 3, 2023

Übrigens nicht mehr so stark bei den Data Scientists, auch wenn richtig gute Mitarbeiter ebenfalls rar gesät sind, den größten Bedarf haben Unternehmen eher bei den Data Engineers. Das sind die Kollegen, die die Data Warehouses oder Data Lakes aufbauen und pflegen. appeared first on Data Science Blog.

Data Warehouse

Data Warehouse Data Lakes Data Engineering Data Engineering

11 Open Source Data Exploration Tools You Need to Know in 2023

ODSC - Open Data Science

FEBRUARY 24, 2023

Metabase GitHub | Website Metabase is an easy-to-use data exploration tool that allows even non-technical users to ask questions and gain insights. This business intelligence and user experience tool allows you to build interactive dashboards, models for cleaning tables, and set up alerts to notify users when your data changes.

Exploratory Data Analysis

Exploratory Data Analysis Data Visualization Data Analysis Data Analysis

Data platform trinity: Competitive or complementary?

IBM Journey to AI blog

JANUARY 18, 2023

Data platform architecture has an interesting history. Towards the turn of millennium, enterprises started to realize that the reporting and business intelligence workload required a new solution rather than the transactional applications. A read-optimized platform that can integrate data from multiple applications emerged.

Data Lakes

Data Lakes Data Warehouse Azure Apache Hadoop

Data science vs data analytics: Unpacking the differences

IBM Journey to AI blog

SEPTEMBER 19, 2023

Overview: Data science vs data analytics Think of data science as the overarching umbrella that covers a wide range of tasks performed to find patterns in large datasets, structure data for use, train machine learning models and develop artificial intelligence (AI) applications.

Data Science

Data Science Analytics Analytics Data Scientist

How to use foundation models and trusted governance to manage AI workflow risk

IBM Journey to AI blog

OCTOBER 16, 2023

Artificial intelligence (AI) adoption is still in its early stages. As more businesses use AI systems and the technology continues to mature and change, improper use could expose a company to significant financial, operational, regulatory and reputational risks. Trustworthiness is critical.

AI AI Data Warehouse ML

A Comprehensive Guide to the main components of Big Data

Pickl AI

DECEMBER 2, 2024

As organisations grapple with this vast amount of information, understanding the main components of Big Data becomes essential for leveraging its potential effectively. Key Takeaways Big Data originates from diverse sources, including IoT and social media. Data lakes and cloud storage provide scalable solutions for large datasets.

Big Data

Big Data Big Data Data Lakes Apache Hadoop

How OLAP and AI can enable better business

IBM Journey to AI blog

DECEMBER 7, 2023

Online analytical processing (OLAP) database systems and artificial intelligence (AI) complement each other and can help enhance data analysis and decision-making when used in tandem. IBM watsonx.data is the next generation OLAP system that can help you make the most of your data.

Data Preparation

Data Preparation Database Data Analysis Data Analysis

How data stores and governance impact your AI initiatives

IBM Journey to AI blog

OCTOBER 12, 2023

To optimize data analytics and AI workloads, organizations need a data store built on an open data lakehouse architecture. This type of architecture combines the performance and usability of a data warehouse with the flexibility and scalability of a data lake.

AI AI Data Scientist Data Governance

AI that’s ready for business starts with data that’s ready for AI

IBM Journey to AI blog

JULY 3, 2024

This includes integration with your data warehouse engines, which now must balance real-time data processing and decision-making with cost-effective object storage, open source technologies and a shared metadata layer to share data seamlessly with your data lakehouse.

AI AI Data Quality Database

Why optimize your warehouse with a data lakehouse strategy

IBM Journey to AI blog

APRIL 25, 2023

In a prior blog , we pointed out that warehouses, known for high-performance data processing for business intelligence, can quickly become expensive for new data and evolving workloads. Chat with a data management expert The post Why optimize your warehouse with a data lakehouse strategy appeared first on IBM Blog.

Data Warehouse

Data Warehouse Data Engineering Data Engineering Data Engineer

What Does a Data Engineering Job Involve in 2024?

ODSC - Open Data Science

JANUARY 30, 2024

Think of it as building plumbing for data to flow smoothly throughout the organization. EVENT — ODSC East 2024 In-Person and Virtual Conference April 23rd to 25th, 2024 Join us for a deep dive into the latest data science and AI trends, tools, and techniques, from LLMs to data analytics and from machine learning to responsible AI.

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

Build an automated insight extraction framework for customer feedback analysis with Amazon Bedrock and Amazon QuickSight

AWS Machine Learning Blog

JUNE 25, 2024

Amazon Bedrock , a fully managed service designed to facilitate the integration of LLMs into enterprise applications, offers a choice of high-performing LLMs from leading artificial intelligence (AI) companies like Anthropic, Mistral AI, Meta, and Amazon through a single API. The Step Functions workflow starts.

AWS

AWS Natural Language Processing Machine Learning Machine Learning

Tackling AI’s data challenges with IBM databases on AWS

IBM Journey to AI blog

MARCH 14, 2024

Businesses face significant hurdles when preparing data for artificial intelligence (AI) applications. The existence of data silos and duplication, alongside apprehensions regarding data quality, presents a multifaceted environment for organizations to manage.

AWS

AWS Database ETL AI

Principal Financial Group uses AWS Post Call Analytics solution to extract omnichannel customer insights

AWS Machine Learning Blog

NOVEMBER 15, 2023

After a few minutes, a transcript is produced with Amazon Transcribe Call Analytics and saved to another S3 bucket for processing by other business intelligence (BI) tools. PCA’s security features ensure that any PII data was redacted from the transcript, as well as from the audio file itself.

AWS

AWS Analytics Analytics ML

Popular Data Transformation Tools: Importance and Best Practices

Pickl AI

OCTOBER 10, 2024

Inconsistent or unstructured data can lead to faulty insights, so transformation helps standardise data, ensuring it aligns with the requirements of Analytics, Machine Learning , or Business Intelligence tools. This makes drawing actionable insights, spotting patterns, and making data-driven decisions easier.

Data Quality

Data Quality AWS Machine Learning Machine Learning

Exploring the AI and data capabilities of watsonx

IBM Journey to AI blog

JULY 17, 2023

Watsonx.data is built on 3 core integrated components: multiple query engines, a catalog that keeps track of metadata, and storage and relational data sources which the query engines directly access. AMC Networks is excited by the opportunity to capitalize on the value of all of their data to improve viewer experiences.

AI AI Machine Learning Machine Learning

Introduction to Power BI Datamarts

ODSC - Open Data Science

JUNE 12, 2023

Then we have some other ETL processes to constantly land the past 5 years of data into the Datamarts. As we learned in the previous section, a Dataflow is a self-service ETL and data preparation layer connecting to various data sources, transforming the data and storing the results in CSV format in Azure Data Lake Gen 2 ( ADLS Gen2 ).

Power BI

Power BI Data Warehouse ETL Data Preparation

The Top AI Slides from ODSC West 2024

ODSC - Open Data Science

NOVEMBER 19, 2024

Dimensional Data Modeling in the Modern Era by Dustin Dorsey Slides Dustin Dorsey’s AI slides explored the evolution of dimensional data modeling, a staple in data warehousing and business intelligence.

Deep Learning

Deep Learning Deep Learning Data Science AI

How to Effectively Handle Unstructured Data Using AI

DagsHub

NOVEMBER 11, 2024

Social media conversations, comments, customer reviews, and image data are unstructured in nature and hold valuable insights, many of which are still being uncovered through advanced techniques like Natural Language Processing (NLP) and machine learning. Many find themselves swamped by the volume and complexity of unstructured data.

AI AI Data Lakes Database

A Guide to Data Analytics in the Travel Industry

Alation

MARCH 21, 2023

Having been in business for over 50 years, ARC had accumulated a massive amount of data that was stored in siloed, on-premises servers across its 7 business domains. Using Alation, ARC automated the data curation and cataloging process. “So

Analytics

Analytics Analytics Data Silos Big Data

How to Shift from Data Science to Data Engineering

ODSC - Open Data Science

JANUARY 18, 2024

ETL (Extract, Transform, Load) This is a core data engineering process for moving data from one or more sources to a destination, typically a data warehouse or data lake. The reason this is an important skill is that ETL is a critical process for data warehousing and business intelligence.

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

Shaping the future: OMRON’s data-driven journey with AWS

AWS Machine Learning Blog

APRIL 3, 2025

Amazon AppFlow was used to facilitate the smooth and secure transfer of data from various sources into ODAP. Additionally, Amazon Simple Storage Service (Amazon S3) served as the central data lake, providing a scalable and cost-effective storage solution for the diverse data types collected from different systems.

AWS

AWS Data Governance Data Silos SQL

5 Key Components of Power BI: A Comprehensive Guide

Pickl AI

MARCH 10, 2025

Summary: Power BI is a business intelligence tool that transforms raw data into actionable insights. Introduction Managing business and its key verticals can be challenging. However, with the surge of data tools like Power BI, you can not only manage the data, but at the same time draw actionable insights from it.

Power BI

Power BI Business Intelligence Business Intelligence Analytics

Definite Guide to Building a Machine Learning Platform

The MLOps Blog

MARCH 21, 2023

Other users Some other users you may encounter include: Data engineers , if the data platform is not particularly separate from the ML platform. Analytics engineers and data analysts , if you need to integrate third-party business intelligence tools and the data platform, is not separate.

Machine Learning

Machine Learning Machine Learning Data Scientist ML

Data democratization: How data architecture can drive business decisions and AI initiatives

IBM Journey to AI blog

AUGUST 4, 2023

It’s distributed both in the cloud and on-premises, allowing extensive use and movement across clouds, apps and networks, as well as stores of data at rest. An architecture designed for data democratization aims to be flexible, integrated, agile and secure to enable the use of data and artificial intelligence (AI) at scale.

Data Lakes

Data Lakes AI AI Data Governance

Will private data work in a new-era AI world?

Dataconomy

NOVEMBER 19, 2024

Many companies are making a business out of helping enterprises get data out of old systems, and tools like Apache Airflow are helping streamline these processes. But even if data is no longer stuck in mainframes, it’s still fragmented across systems like cloud SaaS services or data lakes.

AI AI Business Intelligence Business Intelligence

Boost productivity by using AI in cloud operational health management

AWS Machine Learning Blog

OCTOBER 11, 2024

Create business intelligence (BI) dashboards for visual representation and analysis of event data. Figure: AI chatbot workflow Archiving and reporting layer The archiving and reporting layer handles streaming, storing, and extracting, transforming, and loading (ETL) operational event data.

AWS

AWS AI AI Data Lakes

Governing ML lifecycle at scale: Best practices to set up cost and usage visibility of ML workloads in multi-account environments

AWS Machine Learning Blog

NOVEMBER 14, 2024

Tags can be added at an Amazon DataZone domain and used for organizing data assets, users, and projects. Usage of data is tracked through the data consumers, such as Amazon Athena , Amazon Redshift , or Amazon SageMaker. You can define metadata tags and assign them to resources like databases and tables.

ML ML AWS Machine Learning

Data Science Current

Data Integrity for AI: What’s Old is New Again

Data Version Control for Data Lakes: Handling the Changes in Large Scale

Webinars

Trending Sources

Data Swamp, Data Lake, Data Lakehouse: What to Know

Webinars

Drowning in Data? A Data Lake May Be Your Lifesaver

Real-Time ML with Spark and SBERT, AI Coding Assistants, Data Lake Vendors, and ODSC East…

How Twilio generated SQL using Looker Modeling Language data with Amazon Bedrock

Big Data – Das Versprechen wurde eingelöst

Interview – Datenstrategie und Data Teams entwickeln!

11 Open Source Data Exploration Tools You Need to Know in 2023

Data platform trinity: Competitive or complementary?

Data science vs data analytics: Unpacking the differences

How to use foundation models and trusted governance to manage AI workflow risk

A Comprehensive Guide to the main components of Big Data

How OLAP and AI can enable better business

How data stores and governance impact your AI initiatives

AI that’s ready for business starts with data that’s ready for AI

Why optimize your warehouse with a data lakehouse strategy

What Does a Data Engineering Job Involve in 2024?

Build an automated insight extraction framework for customer feedback analysis with Amazon Bedrock and Amazon QuickSight

Tackling AI’s data challenges with IBM databases on AWS

Principal Financial Group uses AWS Post Call Analytics solution to extract omnichannel customer insights

Popular Data Transformation Tools: Importance and Best Practices

Exploring the AI and data capabilities of watsonx

Introduction to Power BI Datamarts

The Top AI Slides from ODSC West 2024

How to Effectively Handle Unstructured Data Using AI

A Guide to Data Analytics in the Travel Industry

How to Shift from Data Science to Data Engineering

Shaping the future: OMRON’s data-driven journey with AWS

5 Key Components of Power BI: A Comprehensive Guide

Definite Guide to Building a Machine Learning Platform

Data democratization: How data architecture can drive business decisions and AI initiatives

Will private data work in a new-era AI world?

Boost productivity by using AI in cloud operational health management

Governing ML lifecycle at scale: Best practices to set up cost and usage visibility of ML workloads in multi-account environments

Stay Connected