March, 2025

article thumbnail

@HPCpodcast: Dr. Ian Cutress on the State of Advanced Chips, the GPU Landscape and AI Compute, Global Chip Manufacturing and GTC Expectations

insideBIGDATA

[link] Just before GTC (and for the 100th episode of the @HPCpodcast and this one sponsored by liquid cooling company CoolIT), we welcome special guest and high-powered chip industry analyst Dr.

AI 329
article thumbnail

Towards AI-Driven Sign Language Generation with Non-Manual Markers

Machine Learning Research at Apple

Sign languages are essential for the Deaf and Hard-of-Hearing (DHH) community. Sign language generation systems have the potential to support communication by translating from written languages, such as English, into signed videos. However, current systems often fail to meet user needs due to poor translation of grammatical structures, the absence of facial cues and body language, and insufficient visual and motion fidelity.

AI 259
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Evaluating LLMs Series Part 1: Evaluating Language Models with BLEU Metric

Analytics Vidhya

In artificial intelligence, evaluating the performance of language models presents a unique challenge. Unlike image recognition or numerical predictions, language quality assessment doesn’t yield to simple binary measurements. Enter BLEU (Bilingual Evaluation Understudy), a metric that has become the cornerstone of machine translation evaluation since its introduction by IBM researchers in 2002.

article thumbnail

GPT 4.5: The New Addition to Open AI’s GPT Family

Data Science Dojo

The world of AI never stands still, and 2025 is proving to be a groundbreaking year. The first big moment came with the launch of DeepSeek -V3, a highly advanced large language model (LLM) that made waves with its cutting-edge advancements in training optimization, achieving remarkable performance at a fraction of the cost of its competitors. Now, the next major milestone of the AI world is here – Open AI’s GPT 4.5.

AI 209
article thumbnail

How to Achieve High-Accuracy Results When Using LLMs

Speaker: Ben Epstein, Stealth Founder & CTO | Tony Karrer, Founder & CTO, Aggregage

When tasked with building a fundamentally new product line with deeper insights than previously achievable for a high-value client, Ben Epstein and his team faced a significant challenge: how to harness LLMs to produce consistent, high-accuracy outputs at scale. In this new session, Ben will share how he and his team engineered a system (based on proven software engineering approaches) that employs reproducible test variations (via temperature 0 and fixed seeds), and enables non-LLM evaluation m

article thumbnail

Headroom for AI development

Machine Learning (Theory)

( Dylan Foster and Alex Lamb both helped in creating this.) In thinking about what are good research problems, its sometimes helpful to switch from what is understood to what is clearly possible. This encourages us to think beyond simply improving the existing system. For example, we have seen instances throughout the history of machine learning where researchers have argued for fixing an architecture and using it for short-term success, ignoring potential for long-term disruption.

AI 157
article thumbnail

Chaos in Cloudflare’s Lisbon office: securing the Internet with wave motion

Hacker News

Over the years, Cloudflare has gained fame for many things, including our technical blog, but also as a tech company securing the Internet using lava lamps , a story that began as a research/science project almost 10 years ago. In March 2025, we added another layer to its legacy: a "wall of entropy" made of 50 wave machines in constant motion at our Lisbon office, the company's European HQ.

AI 175

More Trending

article thumbnail

Exploring Prediction Targets in Masked Pre-Training for Speech Foundation Models

Machine Learning Research at Apple

Speech foundation models, such as HuBERT and its variants, are pre-trained on large amounts of unlabeled speech data and then used for a range of downstream tasks. These models use a masked prediction objective, where the model learns to predict information about masked input segments from the unmasked context. The choice of prediction targets in this framework impacts their performance on downstream tasks.

179
179
article thumbnail

Why RAG Systems Fail and How to Fix Them

Analytics Vidhya

Retrieval-Augmented Generation (RAG) enhances large language models (LLMs) by integrating external knowledge, making responses more informative and context-aware. However, RAG fails in many scenarios, affecting its ability to generate accurate and relevant outputs. These issues in RAG systems impact applications in various domains, from customer support to research and content generation.

Analytics 196
article thumbnail

Cool Site Shows Exactly Which Books Zuckerberg's Minions Illegally Downloaded to Train Meta's AI

Flipboard

For all the revolutionary change artificial intelligence promises, it also makes lofty demands. For starters, AI is extraordinarily power hungry. Generating all the electricity that AI datacenters consume takes forest-loads of energy, not to mention hardware and cooling infrastructure. That stuff all costs a lot, making AI a huge money pit. That's had a big effect on our economy, as the tiniest bit of AI hype can send huge shockwaves through Wall Street and beyond.

AI 162
article thumbnail

The Hundred-Page Language Models Book: A Great Technical Intro to LLMs

KDnuggets

The Hundred-Page Language Models Book is the LLM book you shouldn't miss.

333
333
article thumbnail

The 2nd Generation of Innovation Management: A Survival Guide

Speaker: Chris Townsend, VP of Product Marketing, Wellspring

Over the past decade, companies have embraced innovation with enthusiasm—Chief Innovation Officers have been hired, and in-house incubators, accelerators, and co-creation labs have been launched. CEOs have spoken with passion about “making everyone an innovator” and the need “to disrupt our own business.” But after years of experimentation, senior leaders are asking: Is this still just an experiment, or are we in it for the long haul?

article thumbnail

Firmware update bricks HP printers, makes them unable to use HP cartridges

Hacker News

HP, along with other printer brands , is infamous for issuing firmware updates that brick already-purchased printers that have tried to use third-party ink. In a new form of frustration, HP is now being accused of issuing a firmware update that broke customers laser printerseven though the devices are loaded with HP-brand toner. The firmware update in question is version 20250209, which HP issued on March 4 for its LaserJet MFP M232-M237models.

181
181
article thumbnail

Nexla Expands AI-Powered Integration Platform for Enterprise-Grade GenAI

insideBIGDATA

SAN MATEO, Calif., March 04, 2025 — AI-powered integration company Nexla announced a major update to the Nexla Integration Platform, expanding its no-code integration, RAG pipeline engineering, and data governance capabilities with the intent to make enterprise-grade GenAI more accessible.

article thumbnail

When Does a Predictor Know Its Own Loss?

Machine Learning Research at Apple

Given a predictor and a loss function, how well can we predict the loss that the predictor will incur on an input? This is the problem of loss prediction, a key computational task associated with uncertainty estimation for a predictor. In a classification setting, a predictor will typically predict a distribution over labels and hence have its own estimate of the loss that it will incur, given by the entropy of the predicted distribution.

162
162
article thumbnail

Top 5 Generative AI Breakthroughs of February 2025: GPT-4.5, Grok-3, and More!

Analytics Vidhya

February 2025 has been yet another game-changing month for generative AI, bringing us some of the most anticipated model upgrades and groundbreaking new features. From xAIs Grok 3 and Anthropics Claude 3.7 Sonnet, to OpenAIs GPT-4.5 and the promise of GPT-5, this month saw fierce competition in the AI race. Meanwhile, both OpenAI and Perplexity […] The post Top 5 Generative AI Breakthroughs of February 2025: GPT-4.5, Grok-3, and More!

AI 206
article thumbnail

Apache Airflow® Best Practices: DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Why extracting data from PDFs is still a nightmare for data experts

Flipboard

For years, businesses, governments, and researchers have struggled with a persistent problem: How to extract usable data from Portable Document Format (PDF) files. These digital documents serve as containers for everything from scientific research to government records, but their rigid formats often trap the data inside , making it difficult for machines to read and analyze.

article thumbnail

Python Tooling Beyond Pandas: Libraries to Broaden Your Data Science Toolkit

KDnuggets

Pandas alternative libraries that you might not know before.

article thumbnail

Deep Learning Is Not So Mysterious or Different

Hacker News

Deep neural networks are often seen as different from other model classes by defying conventional notions of generalization. Popular examples of anomalous generalization behaviour include benign overfitting, double descent, and the success of overparametrization. We argue that these phenomena are not distinct to neural networks, or particularly mysterious.

article thumbnail

Palantir and Databricks Announce AI Product Partnership

insideBIGDATA

SAN FRANCISCO,March 13, 2025 —DatabricksandPalantir Technologies Inc.(NASDAQ:PLTR), provider of enterprise operating systems, today announced a strategic product partnership that combines Palantir’s AI operating system and Databricks’ platform for AI, data warehousing and data engineering.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

DR-MPC: Deep Residual Model Predictive Control for Real-World Social Navigation

Machine Learning Research at Apple

How can a robot safely navigate around people exhibiting complex motion patterns? Reinforcement Learning (RL) or Deep RL (DRL) in simulation holds some promise, although much prior work relies on simulators that fail to precisely capture the nuances of real human motion. To address this gap, we propose Deep Residual Model Predictive Control (DR-MPC), a method to enable robots to quickly and safely perform DRL from real-world crowd navigation data.

147
147
article thumbnail

Enhancing Code Quality with LangGraph Reflection

Analytics Vidhya

The LangGraph Reflection Framework is a type of agentic framework which offers a powerful way to improve language model outputs through an iterative critique process using Generative AI. This article breaks down how to implement a reflection agent that validates Python code using Pyright and improves its quality using GPT-4o mini. AI agents play a crucial role […] The post Enhancing Code Quality with LangGraph Reflection appeared first on Analytics Vidhya.

Python 182
article thumbnail

Nvidia announces “Rubin Ultra” and “Feynman” AI chips for 2027 and 2028

Flipboard

On Tuesday at Nvidia's GTC 2025 conference in San Jose, California, CEO Jensen Huang revealed several new AI-accelerating GPUs the company plans to release over the coming months and years. He also revealed more specifications about previously announced chips. The centerpiece announcement was Vera Rubin, first teased at Computex 2024 and now scheduled for release in the second half of 2026.

AI 172
article thumbnail

Statistical Methods for Evaluating LLM Performance

Machine Learning Mastery

In this article, we explore statistical methods for evaluating LLM performance, an essential step to guarantee stability and effectiveness.

292
292
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

SmolDocling: An ultra-compact VLM for end-to-end multi-modal document conversion

Hacker News

We introduce SmolDocling, an ultra-compact vision-language model targeting end-to-end document conversion. Our model comprehensively processes entire pages by generating DocTags, a new universal markup format that captures all page elements in their full context with location. Unlike existing approaches that rely on large foundational models, or ensemble solutions that rely on handcrafted pipelines of multiple specialized models, SmolDocling offers an end-to-end conversion for accurately capturi

122
122
article thumbnail

@HPCpodcast: Live from GTC 2025, Among the Crowds for the New AI Compute Landscape

insideBIGDATA

[link] In what is becoming an annual tradition for the @HPCpodcast, we present “Live from Nvidia GTC 2025,” covering highlights from the Nvidia extravaganza with an AI-everywhere theme.

AI 195
article thumbnail

Fundamental Challenges in Evaluating Text2SQL Solutions and Detecting Their Limitations

Machine Learning Research at Apple

In this work, we dive into the fundamental challenges of evaluating Text2SQL solutions and highlight potential failure causes and the potential risks of relying on aggregate metrics in existing benchmarks. We identify two largely unaddressed limitations in current open benchmarks: (1) data quality issues in the evaluation data mainly attributed to the lack of capturing the probabilistic nature of translating a natural language description into a structured query (e.g., NL ambiguity), and (2) the

SQL 130
article thumbnail

Generative AI Salary Trends 2025 Edition

Analytics Vidhya

Wondering what all the buzz is about? Or maybe youre asking yourself: Is this the right career move for me? Generative AI is taking the world by storm, and with it comes a gold rush for talent. From generating images to powering chatbots that sound eerily human, professionals in this field are in high demand […] The post Generative AI Salary Trends 2025 Edition appeared first on Analytics Vidhya.

AI 180
article thumbnail

Marketing Operations in 2025: A New Framework for Success

Speaker: Mike Rizzo, Founder & CEO, MarketingOps.com and Darrell Alfonso, Director of Marketing Strategy and Operations, Indeed.com

Though rarely in the spotlight, marketing operations are the backbone of the efficiency, scalability, and alignment that define top-performing marketing teams. In this exclusive webinar led by industry visionaries Mike Rizzo and Darrell Alfonso, we’re giving marketing operations the recognition they deserve! We will dive into the 7 P Model —a powerful framework designed to assess and optimize your marketing operations function.

article thumbnail

World's first "Synthetic Biological Intelligence" runs on living human cells

Flipboard

The world's first "biological computer" that fuses human brain cells with silicon hardware to form fluid neural networks has been commercially launched, ushering in a new age of AI technology. The CL1, from Australian company Cortical Labs, offers a whole new kind of computing intelligence one that's more dynamic, sustainable and energy efficient than any AI that currently exists and we will start to see its potential when it's in users' hands in the coming months

AI 182
article thumbnail

Unlocking the Potential of AI Agents: From Pilots to Production Success

databricks

While 85% of global enterprises already use Generative AI (GenAI), organizations face significant challenges scaling these projects beyond the pilot phase. Even the most advanced.

AI 296
article thumbnail

Pen and Paper Exercises in Machine Learning

Hacker News

This is a collection of (mostly) pen-and-paper exercises in machine learning. The exercises are on the following topics: linear algebra, optimisation, directed graphical models, undirected graphical models, expressive power of graphical models, factor graphs and message passing, inference for hidden Markov models, model-based learning (including ICA and unnormalised models), sampling and Monte-Carlo integration, and variational inference.

article thumbnail

Lenovo Unveils AI Inferencing Server

insideBIGDATA

March 3, 2025 Today, Lenovo announced an entry-level edge AI inferencing server designed to make edge AI accessible and affordable for SMBs and enterprises alike1.

AI 195
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, Terrence Sheflin, and Mahyar Ghasemali

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.