Data Preparation, Data Science and Events

Exploring the Power of Microsoft Fabric: A Hands-On Guide with a Sales Use Case

Data Science Dojo

SEPTEMBER 11, 2024

These experiences facilitate professionals from ingesting data from different sources into a unified environment and pipelining the ingestion, transformation, and processing of data to developing predictive models and analyzing the data by visualization in interactive BI reports.

Power BI

Power BI Data Pipeline Data Warehouse Data Engineering

Life of modern-day alchemists: What does a data scientist do?

Dataconomy

AUGUST 16, 2023

Today’s question is, “What does a data scientist do.” ” Step into the realm of data science, where numbers dance like fireflies and patterns emerge from the chaos of information. In this blog post, we’re embarking on a thrilling expedition to demystify the enigmatic role of data scientists.

Data Scientist

Data Scientist Data Science Machine Learning Machine Learning

Deploy large language models for a healthtech use case on Amazon SageMaker

AWS Machine Learning Blog

FEBRUARY 6, 2024

Pharmaceutical companies sell a variety of different, often novel, drugs on the market, where sometimes unintended but serious adverse events can occur. These events can be reported anywhere, from hospitals or at home, and must be responsibly and efficiently monitored. The training job is built using the SageMaker PyTorch estimator.

AWS

AWS ML ML Data Preparation

Webinars

Agent Tooling: Connecting AI to Your Tools, Systems & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

How to Modernize Manufacturing Without Losing Control

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

GraphReduce: Using Graphs for Feature Engineering Abstractions

ODSC - Open Data Science

SEPTEMBER 25, 2023

Unfortunately, our data engineering and machine learning ops teams haven’t built a feature vector for us, so all of the relevant data lives in a relational schema in separate tables. Understanding Relationships: GraphReduce doesn’t help with this part, so you’ll need to profile the data, talk to a data guru, or use emerging technology.

Data Preparation

Data Preparation Machine Learning Machine Learning ML

How to Implement Augmented Analytics for Data-Driven Decision-Making

ODSC - Open Data Science

FEBRUARY 12, 2024

You can even use generative AI to supplement your data sets with synthetic data for privacy or accuracy. Most businesses already recognize the need to automate the actual analysis of data, but you can go further. Automating the data preparation and interpretation phases will take much time and effort out of the equation, too.

Augmented Analytics

Augmented Analytics Analytics Analytics Data Science

Accelerate client success management through email classification with Hugging Face on Amazon SageMaker

AWS Machine Learning Blog

SEPTEMBER 12, 2023

Scalable Capital’s data science and client service teams identified that one of the largest bottlenecks in servicing our clients was responding to email inquiries. The following diagram shows the workflow for our email classifier project, but can also be generalized to other data science projects.

Data Science

Data Science Data Scientist AWS ML

Decoding Demand: The Data Science Approach to Forecasting Trends

Pickl AI

JULY 1, 2024

Demand forecasting, powered by data science, helps predict customer needs. Optimize inventory, streamline operations, and make data-driven decisions for success. Data Science empowers businesses to leverage the power of data for accurate and insightful demand forecasts.

Data Science

Data Science Decision Trees Machine Learning Machine Learning

Predictive Analytics: 4 Primary Aspects of Predictive Analytics

Smart Data Collective

SEPTEMBER 16, 2020

These statistical models are growing as a result of the wide swaths of available current data as well as the advent of capable artificial intelligence and machine learning. Data Sourcing. The applications of predictive analytics are extensive and often require four key components to maintain effectiveness.

Predictive Analytics

Predictive Analytics Analytics Analytics Decision Trees

How Light & Wonder built a predictive maintenance solution for gaming machines on AWS

AWS Machine Learning Blog

JUNE 22, 2023

Working with AWS, Light & Wonder recently developed an industry-first secure solution, Light & Wonder Connect (LnW Connect), to stream telemetry and machine health data from roughly half a million electronic gaming machines distributed across its casino customer base globally when LnW Connect reaches its full potential.

AWS

AWS ML ML Machine Learning

How Marubeni is optimizing market decisions using AWS machine learning and analytics

AWS Machine Learning Blog

MARCH 8, 2023

Manager Data Science at Marubeni Power International. Therefore, the ingestion components need to be able to manage authentication, data sourcing in pull mode, data preprocessing, and data storage. Because the data is being fetched hourly, a mechanism is also required to orchestrate and schedule ingestion jobs.

AWS

AWS Machine Learning Machine Learning Analytics

AIOps vs. MLOps: Harnessing big data for “smarter” ITOPs

IBM Journey to AI blog

AUGUST 12, 2024

Here, we’ll discuss the key differences between AIOps and MLOps and how they each help teams and businesses address different IT and data science challenges. Data characteristics and preprocessing AIOps tools handle a range of data sources and types, including system logs, performance metrics, network data and application events.

Big Data

Big Data Big Data ML ML

2024 Mexican Grand Prix: Formula 1 Prediction Challenge Results

Ocean Protocol

NOVEMBER 28, 2024

Introduction The Formula 1 Prediction Challenge: 2024 Mexican Grand Prix brought together data scientists to tackle one of the most dynamic aspects of racing — pit stop strategies. With every second on the track critical, the challenge showcased how data can shape decisions that define race outcomes.

Cross Validation

Cross Validation Decision Trees Data Scientist Data Science

Your guide to generative AI and ML at AWS re:Invent 2024

AWS Machine Learning Blog

NOVEMBER 19, 2024

The excitement is building for the fourteenth edition of AWS re:Invent, and as always, Las Vegas is set to host this spectacular event. This session covers the technical process, from data preparation to model customization techniques, training strategies, deployment considerations, and post-customization evaluation.

AWS

AWS ML ML AI

How Northpower used computer vision with AWS to automate safety inspection risk assessments

AWS Machine Learning Blog

SEPTEMBER 27, 2024

Recent events including Tropical Cyclone Gabrielle have highlighted the susceptibility of the grid to extreme weather and emphasized the need for climate adaptation with resilient infrastructure. Data preparation SageMaker Ground Truth employs a human workforce made up of Northpower volunteers to annotate a set of 10,000 images.

AWS

AWS Data Lakes ML ML

Unlocking Tabular Data’s Hidden Potential

ODSC - Open Data Science

MAY 10, 2023

Unfortunately, even the data science industry — which should recognize tabular data’s true value — often underestimates its relevance in AI. Many mistakenly equate tabular data with business intelligence rather than AI, leading to a dismissive attitude toward its sophistication.

Data Scientist

Data Scientist Data Science Deep Learning Deep Learning

WiBD Spring Hackathon 2024: A Journey of Learning and Collaboration

Women in Big Data

JULY 19, 2024

The Women in Big Data (WiBD) Spring Hackathon 2024, organized by WiDS and led by WiBD’s Global Hackathon Director Rupa Gangatirkar , sponsored by Gilead Sciences, offered an exciting opportunity to sharpen data science skills while addressing critical social impact challenges.

Data Science

Data Science Big Data Big Data Machine Learning

How LLMs are Transforming Bot Building, Botnet Detection at Scale, and Declarative ML for Engineers

ODSC - Open Data Science

APRIL 13, 2023

Hands-on Data-Centric AI: Data Preparation Tuning — Why and How? Going into developing machine learning models with a hands-on, data-centric AI approach has its benefits and requires a few extra steps to achieve. Here’s how to get there. Here are a few common mashups that may be right up your alley.

ML

ML ML Data Science Machine Learning

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

AWS Machine Learning Blog

NOVEMBER 16, 2023

The data science team expected an AI-based automated image annotation workflow to speed up a time-consuming labeling process. Enable a data science team to manage a family of classic ML models for benchmarking statistics across multiple medical units.

ML

ML ML AWS AI

Bringing More AI to Snowflake, the Data Cloud

DataRobot Blog

FEBRUARY 28, 2023

A seamless user experience when deploying and monitoring DataRobot models to Snowflake Monitoring service health, drift, and accuracy of DataRobot models in Snowflake “Organizations are looking for mature data science platforms that can scale to the size of their entire business. launch event on March 16th.

Exploratory Data Analysis

Exploratory Data Analysis ML ML AI

What is Data Mining?

Pickl AI

FEBRUARY 21, 2023

Businesses require Data Scientists to perform Data Mining processes and invoke valuable data insights using different software and tools. What is Data Mining and how is it related to Data Science ? What is Data Mining? Why is Data Mining Important? are the various data mining tools.

Data Mining

Data Mining Data Mining Data Mining Data Scientist

LLM distillation techniques to explode in importance in 2024

Snorkel AI

NOVEMBER 9, 2023

LLM distillation will become a much more common and important practice for data science teams in 2024, according to a poll of attendees at Snorkel AI’s 2023 Enterprise LLM Virtual Summit. As data science teams reorient around the enduring value of small, deployable models, they’re also learning how LLMs can accelerate data labeling.

Data Science

Data Science Data Scientist Data Preparation AI

5 Free Data Visualization Tools to Showcase Your Data in 2024

ODSC - Open Data Science

FEBRUARY 19, 2024

Most of these features also come with AI assistance to help users find the best way to visualize their data. One thing that sets it apart is Power BI’s ability to simplify the often complex and time-consuming task of data preparation. Interested in attending an ODSC event? Learn more about our upcoming events here.

Data Visualization

Data Visualization Power BI Tableau Data Science

“Fall in love with your data”—Snorkel AI’s Enterprise LLM Summit

Snorkel AI

JANUARY 26, 2024

25 Enterprise LLM Summit: Building GenAI with Your Data drew over a thousand engaged attendees across three and a half hours and nine sessions. The eight speakers at the event—the second in our Enterprise LLM series—united around one theme: AI data development drives enterprise AI success. Snorkel AI’s Jan.

Data Science

Data Science AI AI Machine Learning

13 Companies Leading the Way in AI Development

ODSC - Open Data Science

OCTOBER 9, 2023

HPCC is a high-performance computing platform that helps organizations process and analyze large amounts of data. Qwak Qwak is a data science platform that simplifies and accelerates the machine learning lifecycle. It provides a unified platform for data preparation, model training, deployment, and monitoring.

Machine Learning

Machine Learning Machine Learning Data Science AI

Principles of MLOps

Heartbeat

FEBRUARY 1, 2023

First, we have data scientists who are in charge of creating and training machine learning models. They might also help with data preparation and cleaning. The machine learning engineers are in charge of taking the models developed by data scientists and deploying them into production.

Machine Learning

Machine Learning Machine Learning Data Scientist ML

AI Development Lifecycle Learnings of What Changed with LLMs

ODSC - Open Data Science

FEBRUARY 5, 2025

Common Pitfalls in LLM Development Neglecting Data Preparation: Poorly prepared data leads to subpar evaluation and iterations, reducing generalizability and stakeholder confidence. Real-world applications often expose gaps that proper data preparation could have preempted. Evaluation: Tools likeNotion.

Data Preparation

Data Preparation AI AI Data Scientist

How Vericast optimized feature engineering using Amazon SageMaker Processing

AWS Machine Learning Blog

MAY 3, 2023

This includes gathering, exploring, and understanding the business and technical aspects of the data, along with evaluation of any manipulations that may be needed for the model building process. One aspect of this data preparation is feature engineering.

AWS

AWS Machine Learning Machine Learning ML

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

AWS Machine Learning Blog

JULY 31, 2023

Data preparation, feature engineering, and feature impact analysis are techniques that are essential to model building. These activities play a crucial role in extracting meaningful insights from raw data and improving model performance, leading to more robust and insightful results.

ML

ML ML Data Preparation Machine Learning

Introducing our New Book: Implementing MLOps in the Enterprise

Iguazio

DECEMBER 14, 2023

Who This Book Is For This book is for practitioners in charge of building, managing, maintaining, and operationalizing the ML process end to end: Data science / AI / ML leaders: Heads of Data Science, VPs of Advanced Analytics, AI Lead etc. The book contains a full chapter dedicated to generative AI. Key Takeaways 1.

ML

ML ML Data Science Data Preparation

Your Complete Roadmap to Become an Azure Data Scientist

Pickl AI

SEPTEMBER 5, 2024

Summary: This blog provides a comprehensive roadmap for aspiring Azure Data Scientists, outlining the essential skills, certifications, and steps to build a successful career in Data Science using Microsoft Azure. Integration: Seamlessly integrates with popular Data Science tools and frameworks, such as TensorFlow and PyTorch.

Azure

Azure Data Scientist Data Science Machine Learning

How Kakao Games automates lifetime value prediction from game data using Amazon SageMaker and AWS Glue

AWS Machine Learning Blog

MARCH 1, 2023

The result of these events can be evaluated afterwards so that they make better decisions in the future. With this proactive approach, Kakao Games can launch the right events at the right time. Kakao Games can then create a promotional event not to leave the game. However, this approach is reactive.

AWS

AWS ML ML ETL

LLM distillation techniques to explode in importance in 2024

Snorkel AI

NOVEMBER 9, 2023

LLM distillation will become a much more common and important practice for data science teams in 2024, according to a poll of attendees at Snorkel AI’s 2023 Enterprise LLM Virtual Summit. As data science teams reorient around the enduring value of small, deployable models, they’re also learning how LLMs can accelerate data labeling.

Data Science

Data Science Data Scientist Data Preparation AI

On the implementation of digital tools

Dataconomy

OCTOBER 15, 2024

It was a crucial lesson in the power of using tangible, recent events to illustrate potential value. Tool choice may influence design, as each tool has preferred data structures, though corporate strategy and cost considerations may ultimately drive the decision. Future trends Emerging trends are reshaping the data analytics landscape.

Data Modeling

Data Modeling Data Models Analytics Analytics

Everything New Coming to ODSC East 2025

ODSC - Open Data Science

DECEMBER 16, 2024

And, for the tenth anniversary of ODSC East , we are pulling out all of the stops with new tracks, new events, and even a new location. Youll gain immediate, practical skills in Python, data preparation, machine learning modeling, and retrieval-augmented generation (RAG), all leading up to AI Agents. Find outbelow!

Machine Learning

Machine Learning Machine Learning Data Preparation Artificial Intelligence

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

See also Thoughtworks’s guide to Evaluating MLOps Platforms End-to-end MLOps platforms End-to-end MLOps platforms provide a unified ecosystem that streamlines the entire ML workflow, from data preparation and model development to deployment and monitoring. Check out the Metaflow Docs. neptune.ai

Machine Learning

Machine Learning Machine Learning ML ML

Spur Telecom Growth with Location Intelligence

Precisely

DECEMBER 4, 2023

For example, location-based insights can be delivered through web GIS (geographic information systems) applications or data science models. Enriched consumer data can shed light on the demographic makeup of a community, income levels, psychographics, lifestyle attributes, and more.

Data Science

Data Science Analytics Analytics Data Preparation

Time Complexity for Data Scientists

Pickl AI

JULY 2, 2024

Time Complexity in Data Structures and Algorithms Data structures and algorithms are the building blocks of Data Science workflows. Sorting Algorithms Sorting algorithms play a crucial role in data preparation. Searching Algorithms Efficient searching is essential for various Data Science tasks.

Data Scientist

Data Scientist Algorithm Data Science Machine Learning

Optimize pet profiles for Purina’s Petfinder application using Amazon Rekognition Custom Labels and AWS Step Functions

AWS Machine Learning Blog

OCTOBER 18, 2023

The solution focuses on the fundamental principles of developing an AI/ML application workflow of data preparation, model training, model evaluation, and model monitoring. Matthew Chasse is a Data Science consultant at Amazon Web Services, where he helps customers build scalable machine learning solutions.

AWS

AWS ML ML Machine Learning

Must-Have Prompt Engineering Skills for 2024

ODSC - Open Data Science

JANUARY 29, 2024

They design intricate sequences of prompts, leveraging their knowledge of AI, machine learning, and data science to guide powerful LLMs (Large Language Models) towards complex tasks. Data science methodologies and skills can be leveraged to design these experiments, analyze results, and iteratively improve prompt strategies.

Data Science

Data Science Machine Learning Machine Learning Natural Language Processing

Build an end-to-end MLOps pipeline using Amazon SageMaker Pipelines, GitHub, and GitHub Actions

AWS Machine Learning Blog

DECEMBER 13, 2023

We create an automated model build pipeline that includes steps for data preparation, model training, model evaluation, and registration of the trained model in the SageMaker Model Registry. You can create event-driven workflows triggered by specific events, like when code is pushed to a repository or a pull request is created.

AWS

AWS ML ML Data Preparation

“Fall in love with your data”—Snorkel AI’s Enterprise LLM Summit

Snorkel AI

JANUARY 26, 2024

25 Enterprise LLM Summit: Building GenAI with Your Data drew over a thousand engaged attendees across three and a half hours and nine sessions. The eight speakers at the event—the second in our Enterprise LLM series—united around one theme: AI data development drives enterprise AI success. Snorkel AI’s Jan.

Data Science

Data Science Data Scientist AI AI

Introducing watsonx: The future of AI for business

IBM Journey to AI blog

MAY 9, 2023

It offers its users advanced machine learning, data management , and generative AI capabilities to train, validate, tune and deploy AI systems across the business with speed, trusted data, and governance. It helps facilitate the entire data and AI lifecycle, from data preparation to model development, deployment and monitoring.

AI

AI AI Data Warehouse Machine Learning

Introducing the DataRobot AI Cloud: A Closer Look

DataRobot

SEPTEMBER 14, 2021

DataRobot now delivers both visual and code-centric data preparation and data pipelines, along with automated machine learning that is composable, and can be driven by hosted notebooks or a graphical user experience. Virtual Event. Finally, I’m excited to announce nearly 100 new features in DataRobot 7.2 September 23.

AI

AI AI Data Pipeline Data Preparation

How to: Focus on three areas for a holistic data governance approach for self-service analytics

Tableau

SEPTEMBER 23, 2021

This enables employees to see data details like definitions and formulas, lineage and ownership information, as well as important data quality notifications, from certification status to events, like if a data source refresh failed and the information isn’t up to date. Data modeling. Data migration .

Data Governance

Data Governance Analytics Analytics Tableau

Exploring the Power of Microsoft Fabric: A Hands-On Guide with a Sales Use Case

Life of modern-day alchemists: What does a data scientist do?

Webinars

Trending Sources

Deploy large language models for a healthtech use case on Amazon SageMaker

Webinars

GraphReduce: Using Graphs for Feature Engineering Abstractions

How to Implement Augmented Analytics for Data-Driven Decision-Making

Accelerate client success management through email classification with Hugging Face on Amazon SageMaker

Decoding Demand: The Data Science Approach to Forecasting Trends

Predictive Analytics: 4 Primary Aspects of Predictive Analytics

How Light & Wonder built a predictive maintenance solution for gaming machines on AWS

How Marubeni is optimizing market decisions using AWS machine learning and analytics

AIOps vs. MLOps: Harnessing big data for “smarter” ITOPs

2024 Mexican Grand Prix: Formula 1 Prediction Challenge Results

Your guide to generative AI and ML at AWS re:Invent 2024

How Northpower used computer vision with AWS to automate safety inspection risk assessments

Unlocking Tabular Data’s Hidden Potential

WiBD Spring Hackathon 2024: A Journey of Learning and Collaboration

How LLMs are Transforming Bot Building, Botnet Detection at Scale, and Declarative ML for Engineers

Philips accelerates development of AI-enabled healthcare solutions with an MLOps platform built on Amazon SageMaker

Bringing More AI to Snowflake, the Data Cloud

What is Data Mining?

LLM distillation techniques to explode in importance in 2024

5 Free Data Visualization Tools to Showcase Your Data in 2024

“Fall in love with your data”—Snorkel AI’s Enterprise LLM Summit

13 Companies Leading the Way in AI Development

Principles of MLOps

AI Development Lifecycle Learnings of What Changed with LLMs

How Vericast optimized feature engineering using Amazon SageMaker Processing

Is your model good? A deep dive into Amazon SageMaker Canvas advanced metrics

Introducing our New Book: Implementing MLOps in the Enterprise

Your Complete Roadmap to Become an Azure Data Scientist

How Kakao Games automates lifetime value prediction from game data using Amazon SageMaker and AWS Glue

LLM distillation techniques to explode in importance in 2024

On the implementation of digital tools

Everything New Coming to ODSC East 2025

MLOps Landscape in 2023: Top Tools and Platforms

Spur Telecom Growth with Location Intelligence

Time Complexity for Data Scientists

Optimize pet profiles for Purina’s Petfinder application using Amazon Rekognition Custom Labels and AWS Step Functions

Must-Have Prompt Engineering Skills for 2024

Build an end-to-end MLOps pipeline using Amazon SageMaker Pipelines, GitHub, and GitHub Actions

“Fall in love with your data”—Snorkel AI’s Enterprise LLM Summit

Introducing watsonx: The future of AI for business

Introducing the DataRobot AI Cloud: A Closer Look

How to: Focus on three areas for a holistic data governance approach for self-service analytics

Stay Connected