Data Models, Data Pipeline and ML - Data Science Current

ML Collaboration: Best Practices From 4 ML Teams

The MLOps Blog

DECEMBER 28, 2022

The onset of the pandemic has triggered a rapid increase in the demand and adoption of ML technology. Building ML team Following the surge in ML use cases that have the potential to transform business, the leaders are making a significant investment in ML collaboration, building teams that can deliver the promise of machine learning.

ML

ML ML Data Scientist Machine Learning

Streamlining Process Configuration in Machine Learning with Hydra

Pickl AI

NOVEMBER 29, 2024

It enhances scalability, experimentation, and reproducibility, allowing ML teams to focus on innovation. This blog highlights the importance of organised, flexible configurations in ML workflows and introduces Hydra. Machine Learning projects evolve rapidly, frequently introducing new data , models, and hyperparameters.

Machine Learning

Machine Learning Machine Learning ML ML

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

Alignment to other tools in the organization’s tech stack Consider how well the MLOps tool integrates with your existing tools and workflows, such as data sources, data engineering platforms, code repositories, CI/CD pipelines, monitoring systems, etc. and Pandas or Apache Spark DataFrames.

Machine Learning

Machine Learning Machine Learning ML ML

Webinars

Automation, Evolved: Your New Playbook For Smarter Knowledge Work

MORE WEBINARS

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

AWS Machine Learning Blog

SEPTEMBER 18, 2024

The ZMP analyzes billions of structured and unstructured data points to predict consumer intent by using sophisticated artificial intelligence (AI) to personalize experiences at scale. Hosted on Amazon ECS with tasks run on Fargate, this platform streamlines the end-to-end ML workflow, from data ingestion to model deployment.

AWS

AWS Machine Learning Machine Learning ML

What Lays Ahead in 2024? AI/ML Predictions for the New Year

Iguazio

DECEMBER 18, 2023

For data science practitioners, productization is key, just like any other AI or ML technology. However, it's important to contextualize generative AI within the broader landscape of AI and ML technologies. By doing so, you can ensure quality and production-ready models. Here’s to a successful 2024!

ML

ML ML AI AI

Building and Scaling Gen AI Applications with Simplicity, Performance and Risk Mitigation in Mind Using Iguazio (acquired by McKinsey) and MongoDB

Iguazio

JULY 22, 2024

MongoDB for end-to-end AI data management MongoDB Atlas , an integrated suite of data services centered around a multi-cloud NoSQL database, enables developers to unify operational, analytical, and AI data services to streamline building AI-enriched applications. Atlas Vector Search lets you search unstructured data.

AI

AI AI ML ML

Mastering Version Control for ML Models: Best Practices You Need to Know

DagsHub

AUGUST 29, 2024

Source: Author Introduction Machine learning (ML) models, like other software, are constantly changing and evolving. Version control systems (VCS) play a key role in this area by offering a structured method to track changes made to models and handle versions of data and code used in these ML projects.

ML

ML ML Python Machine Learning

What Lays Ahead in 2024? AI/ML Predictions for the New Year

Iguazio

DECEMBER 18, 2023

For data science practitioners, productization is key, just like any other AI or ML technology. However, it's important to contextualize generative AI within the broader landscape of AI and ML technologies. By doing so, you can ensure quality and production-ready models. Here’s to a successful 2024!

ML

ML ML AI AI

Future-Proofing Your App: Strategies for Building Long-Lasting Apps

Iguazio

MAY 29, 2024

The 4 Gen AI Architecture Pipelines The four pipelines are: 1. The Data Pipeline The data pipeline is the foundation of any AI system. It's responsible for collecting and ingesting the data from various external sources, processing it and managing the data.

Data Pipeline

Data Pipeline AI AI ML

The Evolution of Customer Data Modeling: From Static Profiles to Dynamic Customer 360

phData

SEPTEMBER 27, 2024

Introduction: The Customer Data Modeling Dilemma You know, that thing we’ve been doing for years, trying to capture the essence of our customers in neat little profile boxes? For years, we’ve been obsessed with creating these grand, top-down customer data models. Yeah, that one.

Data Models

Data Models Data Modeling Apache Kafka Data Lakes

Comparing Tools For Data Processing Pipelines

The MLOps Blog

MARCH 15, 2023

If you will ask data professionals about what is the most challenging part of their day to day work, you will likely discover their concerns around managing different aspects of data before they get to graduate to the data modeling stage. This ensures that the data is accurate, consistent, and reliable.

Data Pipeline

Data Pipeline ETL SQL Data Quality

Self-Service Analytics for Google Cloud, now with Looker and Tableau

Tableau

OCTOBER 8, 2021

Leveraging Looker’s semantic layer will provide Tableau customers with trusted, governed data at every stage of their analytics journey. With its LookML modeling language, Looker provides a unique, modern approach to define governed and reusable data models to build a trusted foundation for analytics.

Tableau

Tableau Analytics Analytics Machine Learning

Implementing GenAI in Practice

Iguazio

JANUARY 22, 2024

In addition, MLOps practices like building data, experting tracking, versioning, artifacts and others, also need to be part of the GenAI productization process. For example, when indexing a new version of a document, it’s important to take care of versioning in the ML pipeline. This helps cleanse the data.

Data Pipeline

Data Pipeline ML ML Data Warehouse

How to use foundation models and trusted governance to manage AI workflow risk

IBM Journey to AI blog

OCTOBER 16, 2023

It includes processes that trace and document the origin of data, models and associated metadata and pipelines for audits. An AI governance framework ensures the ethical, responsible and transparent use of AI and machine learning (ML). ” Are foundation models trustworthy?

AI

AI AI Data Warehouse ML

Building Scalable AI Pipelines with MLOps: A Guide for Software Engineers

ODSC - Open Data Science

OCTOBER 7, 2024

In today’s landscape, AI is becoming a major focus in developing and deploying machine learning models. It isn’t just about writing code or creating algorithms — it requires robust pipelines that handle data, model training, deployment, and maintenance. Model Training: Running computations to learn from the data.

Machine Learning

Machine Learning Machine Learning AI AI

What Do Data Scientists Do? A Guide to AI Maturity, Challenges, and Solutions

DataRobot Blog

SEPTEMBER 13, 2022

Data scientists drive business outcomes. Many implement machine learning and artificial intelligence to tackle challenges in the age of Big Data. They develop and continuously optimize AI/ML models , collaborating with stakeholders across the enterprise to inform decisions that drive strategic business value.

Data Scientist

Data Scientist ML ML AI

LLMOps vs. MLOps: Understanding the Differences

Iguazio

FEBRUARY 8, 2024

They are characterized by their enormous size, complexity, and the vast amount of data they process. These elements need to be taken into consideration when managing, streamlining and deploying LLMs in ML pipelines, hence the specialized discipline of LLMOps. Data Pipeline - Manages and processes various data sources.

ML

ML ML Data Scientist AI

How to Choose MLOps Tools: In-Depth Guide for 2024

DagsHub

APRIL 21, 2024

Source: [link] Similarly, while building any machine learning-based product or service, training and evaluating the model on a few real-world samples does not necessarily mean the end of your responsibilities. You need to make that model available to the end users, monitor it, and retrain it for better performance if needed.

Machine Learning

Machine Learning Machine Learning ML ML

Self-Service Analytics for Google Cloud, now with Looker and Tableau

Tableau

OCTOBER 8, 2021

Leveraging Looker’s semantic layer will provide Tableau customers with trusted, governed data at every stage of their analytics journey. With its LookML modeling language, Looker provides a unique, modern approach to define governed and reusable data models to build a trusted foundation for analytics.

Tableau

Tableau Analytics Analytics Machine Learning

The Official Machine Learning and AI Platform of Hacktoberfest 2023

DagsHub

SEPTEMBER 10, 2023

DagsHub is a centralized platform to host and manage machine learning projects, including code, data, models, experiments, annotations, model registry, and more! Intermediate Data Pipeline : Build data pipelines using DVC for automation and versioning of Open Source Machine Learning projects.

Machine Learning

Machine Learning Machine Learning Data Pipeline AI

Implementing Gen AI for Financial Services

Iguazio

FEBRUARY 20, 2024

Building MLOpsPedia This demo on Github shows how to fine tune an LLM domain expert and build an ML application Read More Building Gen AI for Production The ability to successfully scale and drive adoption of a generative AI application requires a comprehensive enterprise approach. Let’s dive into the data management pipeline.

AI

AI AI Data Pipeline Analytics

How to Manage Unstructured Data in AI and Machine Learning Projects

DagsHub

OCTOBER 23, 2024

Managing unstructured data is essential for the success of machine learning (ML) projects. Without structure, data is difficult to analyze and extracting meaningful insights and patterns is challenging. This article will discuss managing unstructured data for AI and ML projects. What is Unstructured Data?

Machine Learning

Machine Learning Machine Learning Data Lakes AI

Generative AI in Software Development

Mlearning.ai

JUNE 16, 2023

Generative AI can be used to automate the data modeling process by generating entity-relationship diagrams or other types of data models and assist in UI design process by generating wireframes or high-fidelity mockups. GPT-4 Data Pipelines: Transform JSON to SQL Schema Instantly Blockstream’s public Bitcoin API.

AI

AI AI Data Analysis Data Analysis

Best 8 Experiment Tracking Tools for Machine Learning 2024

DagsHub

DECEMBER 5, 2023

This is where ML experiment tracking comes into play! What is ML Experiment Tracking? ML experiment tracking is the process of recording, organizing, and analyzing the results of ML experiments. It helps data scientists keep track of their experiments, reproduce their results, and collaborate with others effectively.

Machine Learning

Machine Learning Machine Learning ML ML

Best 8 Data Version Control Tools for Machine Learning 2024

DagsHub

DECEMBER 11, 2023

DagsHub DagsHub is a centralized Github-based platform that allows Machine Learning and Data Science teams to build, manage and collaborate on their projects. In addition to versioning code, teams can also version data, models, experiments and more. It does not support the ‘dvc repro’ command to reproduce its data pipeline.

Machine Learning

Machine Learning Machine Learning Data Lakes Big Data

Discover the Snowflake Architecture With All its Pros and Cons- NIX United

Mlearning.ai

FEBRUARY 16, 2023

Thus, the solution allows for scaling data workloads independently from one another and seamlessly handling data warehousing, data lakes , data sharing, and engineering. Machine Learning Integration Opportunities Organizations harness machine learning (ML) algorithms to make forecasts on the data.

Data Warehouse

Data Warehouse Business Intelligence Business Intelligence Database

Capital One’s data-centric solutions to banking business challenges

Snorkel AI

MAY 12, 2023

Piyush Puri: Please join me in welcoming to the stage our next speakers who are here to talk about data-centric AI at Capital One, the amazing team who may or may not have coined the term, “what’s in your wallet.” What can get less attention is the foundational element of what makes AI and ML shine. That’s data.

Machine Learning

Machine Learning Machine Learning ML ML

Capital One’s data-centric solutions to banking business challenges

Snorkel AI

MAY 12, 2023

Piyush Puri: Please join me in welcoming to the stage our next speakers who are here to talk about data-centric AI at Capital One, the amazing team who may or may not have coined the term, “what’s in your wallet.” What can get less attention is the foundational element of what makes AI and ML shine. That’s data.

Machine Learning

Machine Learning Machine Learning ML ML

Demystifying Time Series Database: A Comprehensive Guide

Pickl AI

JULY 8, 2024

Security is Paramount Implement robust security measures to protect sensitive time series data. Integration with Data Pipelines and Analytics TSDBs often work in tandem with other data tools to create a comprehensive data ecosystem for analysis and insights generation.

Database

Database Data Pipeline Machine Learning Machine Learning

The Ultimate Modern Data Stack Migration Guide

phData

JULY 18, 2023

Why Migrate to a Modern Data Stack? Data teams can focus on delivering higher-value data tasks with better organizational visibility. Move Beyond One-off Analytics: The Modern Data Stack empowers you to elevate your data for advanced analytics and integration of AI/ML, enabling faster generation of actionable business insights.

Data Warehouse

Data Warehouse Analytics Analytics Cloud Data

How to Build an End-To-End ML Pipeline

The MLOps Blog

MAY 9, 2023

One of the most prevalent complaints we hear from ML engineers in the community is how costly and error-prone it is to manually go through the ML workflow of building and deploying models. Building end-to-end machine learning pipelines lets ML engineers build once, rerun, and reuse many times.

ML

ML ML Machine Learning Machine Learning

Data Scientists in the Age of AI Agents and AutoML

Towards AI

JANUARY 22, 2025

Simply put, focusing solely on data analysis, coding or modeling will no longer cuts it for most corporate jobs. You have to understand data, how to extract value from them and how to monitor model performances. These two languages cover most data science workflows.

Data Scientist

Data Scientist EDA AI Exploratory Data Analysis

Gen AI Trends and Scaling Strategies for 2025

Iguazio

MARCH 20, 2025

AI engineering - AI is being democratized for developers and engineers, expanding beyond the limited pool of data scientists. Companies are building AI tools and frameworks that empower engineers to integrate AI into applications without needing deep expertise in ML. AI Agents and multi-agent systems.

AI

AI AI Data Pipeline Data Scientist

Data Science Current

ML Collaboration: Best Practices From 4 ML Teams

Streamlining Process Configuration in Machine Learning with Hydra

Webinars

Trending Sources

MLOps Landscape in 2023: Top Tools and Platforms

Webinars

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

What Lays Ahead in 2024? AI/ML Predictions for the New Year

Building and Scaling Gen AI Applications with Simplicity, Performance and Risk Mitigation in Mind Using Iguazio (acquired by McKinsey) and MongoDB

Mastering Version Control for ML Models: Best Practices You Need to Know

What Lays Ahead in 2024? AI/ML Predictions for the New Year

Future-Proofing Your App: Strategies for Building Long-Lasting Apps

The Evolution of Customer Data Modeling: From Static Profiles to Dynamic Customer 360

Comparing Tools For Data Processing Pipelines

Self-Service Analytics for Google Cloud, now with Looker and Tableau

Implementing GenAI in Practice

How to use foundation models and trusted governance to manage AI workflow risk

Building Scalable AI Pipelines with MLOps: A Guide for Software Engineers

What Do Data Scientists Do? A Guide to AI Maturity, Challenges, and Solutions

LLMOps vs. MLOps: Understanding the Differences

How to Choose MLOps Tools: In-Depth Guide for 2024

Self-Service Analytics for Google Cloud, now with Looker and Tableau

The Official Machine Learning and AI Platform of Hacktoberfest 2023

Implementing Gen AI for Financial Services

How to Manage Unstructured Data in AI and Machine Learning Projects

Generative AI in Software Development

Best 8 Experiment Tracking Tools for Machine Learning 2024

Best 8 Data Version Control Tools for Machine Learning 2024

Discover the Snowflake Architecture With All its Pros and Cons- NIX United

Capital One’s data-centric solutions to banking business challenges

Capital One’s data-centric solutions to banking business challenges

Demystifying Time Series Database: A Comprehensive Guide

The Ultimate Modern Data Stack Migration Guide

How to Build an End-To-End ML Pipeline

Data Scientists in the Age of AI Agents and AutoML

Gen AI Trends and Scaling Strategies for 2025

Stay Connected