Thu.May 02, 2024

article thumbnail

Finetuning Llama 3 with Odds Ratio Preference Optimization

Analytics Vidhya

Introduction Large Language Models are often trained rather than built, requiring multiple steps to perform well. These steps, including Supervised Fine Tuning (SFT) and Preference Alignment, are crucial for learning new things and aligning with human responses. However, each step takes a significant amount of time and computing resources. One solution is the Odd Ratio […] The post Finetuning Llama 3 with Odds Ratio Preference Optimization appeared first on Analytics Vidhya.

Analytics 311
article thumbnail

Revolutionizing Data in Sports: The Game-Changing Impact of Databricks Marketplace and Delta Sharing

databricks

Unlock the power of advanced sports analytics with Databricks Marketplace and Delta Sharing. Discover how these platforms are transforming the sports industry by enabling seamless data access, collaboration, and real-time insights. Leverage a diverse array of data assets to optimize performance, enhance fan engagement, and gain a competitive edge. Explore the future of sports analytics, powered by Databricks.

Analytics 226
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

World’s First Autonomous Car Race Held at Abu Dhabi’s Yas Marina

Analytics Vidhya

Can you imagine a car race without any drivers? Well, it’s no longer imagination, but a reality now! That’s right, the world witnessed the first-ever professional autonomous car race over the weekend. The Abu Dhabi Autonomous Racing League (A2RL) held at Yas Marina marked a significant leap in the motor racing world. With autonomous cars […] The post World’s First Autonomous Car Race Held at Abu Dhabi’s Yas Marina appeared first on Analytics Vidhya.

Analytics 292
article thumbnail

Conformal Prediction via Regression-as-Classification

Machine Learning Research at Apple

Conformal prediction (CP) for regression can be challenging, especially when the output distribution is heteroscedastic, multimodal, or skewed. Some of the issues can be addressed by estimating a distribution over the output, but in reality, such approaches can be sensitive to estimation error and yield unstable intervals. Here, we circumvent the challenges by converting regression to a classification problem and then use CP for classification to obtain CP sets for regression.

219
219
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

How to Improve Dataset Selection with ChatGPT?

Analytics Vidhya

Introduction Right choice of appropriate datasets is essential in today’s data-driven environment to facilitate well-informed decision-making and uncover insightful information. It might be intimidating to navigate the enormous amount of data that is available, though. This article examines how the dataset selection process can be streamlined by using ChatGPT.

Analytics 292
article thumbnail

Containerize Python Apps with Docker in 5 Easy Steps

KDnuggets

Get up and running with Docker with this tutorial on containerizing Python applications.

Python 311

More Trending

article thumbnail

Extremist Militias Are Coordinating in More Than 100 Facebook Groups

Hacker News

After lying low for years in the aftermath of January 6, militia extremist groups and profiles have been quietly reorganizing and ramping up recruitment and rhetoric on Facebook.

182
182
article thumbnail

Meet India’s ChatGPT Rival – Hanooman GPT is Here!

Analytics Vidhya

Introduction It’s not Tuesday, but it’s still a Hanooman’s day. Finally, the SML-powered Hanooman GPT is here! India now has its own indigenous alternative to OpenAI’s viral ChatGPT model. Hanooman GPT is a series of open-source Indic large language models developed by the Indian Institute of Technology (IIT) Bombay in partnership with healthcare AI firm […] The post Meet India’s ChatGPT Rival – Hanooman GPT is Here!

Analytics 283
article thumbnail

Microsoft bans U.S. police departments from using enterprise AI tool

Hacker News

Microsoft has changed its policy to ban U.S. police departments from using generative AI through the Azure OpenAI Service, the company’s fully managed, enterprise-focused wrapper around OpenAI technologies.

Azure 182
article thumbnail

Gecko by Google: Pioneering the Next Generation of Text Embedding Models

Analytics Vidhya

Introduction Welcome to the world of text embeddings where text is converted into numbers! This world has recently been turned around by the distillation of large language models (LLMs) into efficient and compact forms. Google’s latest innovation, Gecko, is the lastest advancement in this technology, revolutionizing the way we handle textual data.

Analytics 275
article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Nurses Say Hospital Adoption of Half-Cooked 'AI' Is Reckless

Hacker News

We've noted repeatedly that while "AI" (language learning models) hold a lot of potential, the rushed implementation of half-assed early variants are causing no shortage of headaches across journalism, media, health care, and other sectors.

AI 181
article thumbnail

AI on the Go: Anthropic Launches Claude Mobile App

Analytics Vidhya

San Francisco-based AI company Anthropic is making waves with the launch of its first smartphone app, bringing Claude to mobiles. This move positions Anthropic as a serious contender in the AI game, going head-to-head with industry giants like OpenAI and Google. The new iPhone app caters to both free and paid Claude users. It seamlessly […] The post AI on the Go: Anthropic Launches Claude Mobile App appeared first on Analytics Vidhya.

AI 273
article thumbnail

The life and times of an Abstract Syntax Tree

Hacker News

By Francesco Bertolaccini You've reached computer programming nirvana. Your journey has led you down many paths, including believing that God wrote the universe in LISP, but now the truth is clear in your mind: every problem can be solved by writing one more compiler. It's true.

article thumbnail

Gemini Upgrade 2024: Focus on Boosting Power and Accessibility

Analytics Vidhya

This is exciting news for language enthusiasts and AI users worldwide! Gemini Upgrade is here! On April 30, 2024, the Gemini mobile app received a major update that expanded its reach and accessibility worldwide. This update breaks down language barriers, making Gemini an international AI experience. The Gemini app is now available in various languages, […] The post Gemini Upgrade 2024: Focus on Boosting Power and Accessibility appeared first on Analytics Vidhya.

Analytics 270
article thumbnail

Apache Airflow®: The Ultimate Guide to DAG Writing

Speaker: Tamara Fingerlin, Developer Advocate

In this new webinar, Tamara Fingerlin, Developer Advocate, will walk you through many Airflow best practices and advanced features that can help you make your pipelines more manageable, adaptive, and robust. She'll focus on how to write best-in-class Airflow DAGs using the latest Airflow features like dynamic task mapping and data-driven scheduling!

article thumbnail

Dissecting LockBit v3 Ransomware

Hacker News

We analyzed a variant of LockBit v3 ransomware, and rediscovered a bug that allows us to decrypt some data without paying the ransom. We also found a design flaw that may cause permanent data loss.

179
179
article thumbnail

Fine-tune Llama 3 using Direct Preference Optimization

Analytics Vidhya

Introduction Large Language Models have revolutionized productivity by enabling tasks like Q&A, dynamic code generation, and agentic systems. However, pre-trained vanilla models are often biased and can produce harmful content. To improve performance, algorithms like Reinforcement Learning with Human Feedback and Direct Preference Optimization (DPO) can be used.

Algorithm 270
article thumbnail

Pleasure or Pain? He Maps the Neural Circuits That Decide

Hacker News

The work of the neuroscientist Ishmail Abdus-Saboor has opened up a world of insights into precisely how much pleasure and pain animals experience during different forms of touch.

179
179
article thumbnail

Meet Victoria Shi, the World’s First AI-Generated Foreign Minister

Analytics Vidhya

Ukraine has embarked on a pioneering venture by introducing the world’s first ‘AI diplomat’. The country has developed an artificial intelligence (AI) generated spokesperson to be the face and voice of their Ministry of Foreign Affairs. The AI, named Victoria Shi, has been launched to announce updates on various fronts, on behalf of the Ukrainian […] The post Meet Victoria Shi, the World’s First AI-Generated Foreign Minister appeared first on Analytics Vidhya.

article thumbnail

The Cloud Development Environment Adoption Report

Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.

article thumbnail

Silk Helped the Armies of Genghis Khan Conquer Asia

Hacker News

At least until the spring of 1221, Merv, now in Turkmenistan, from which the tiny eggs of Bombyx mori had moved to Persia and the lands to its west, was still a most splendid city. Around March 6 of that year, it ceased to be so.

178
178
article thumbnail

Announcement: Meta’s Llama 3 Hackathon Offers $10k+ in Prizes

Analytics Vidhya

Hey there, AI fans! Meta, along with Cerebral Valley and SHACK15, is throwing a YOU-DON’T-WANNA-MISS event for you – the Meta Llama 3 Hackathon on May 11th and 12th! It’s your chance to build amazing apps using Meta’s newest and coolest large language model, Llama 3. In this event you will dive into the latest […] The post Announcement: Meta’s Llama 3 Hackathon Offers $10k+ in Prizes appeared first on Analytics Vidhya.

Analytics 241
article thumbnail

The Snapdragon 855's iGPU

Hacker News

Qualcomm's Adreno 6xx architecture has been superseded Adreno 7xx, but it's still used in countless devices, including the current-gen Snapdragon 8cx Gen 3. Here, I'll be looking at the Adreno 640 GPU in the Snapdragon 855. Zarif98 on Reddit kindly provided a OnePlus 7 Pro, and I'll be using that to check out Adreno 640.

178
178
article thumbnail

Getting Started with PyTest: Effortlessly Write and Run Tests in Python

KDnuggets

Exploring the Test-Driven Development Paradigm in Python

Python 238
article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

A lawsuit argues Meta is required by law to let you control your own feed

Hacker News

A new lawsuit argues US law requires Meta to give users more control over their Facebook feeds. The outcome could reshape how the platform’s algorithm affects our lives.

Algorithm 182
article thumbnail

How Much We Work

FlowingData

In our younger years, we have school and more important things to do, but then we get older and there are bills to pay. These charts show the shift and the sweet release of retirement.

131
131
article thumbnail

'I will never go back': Ontario doctor says new AI notetaking saved her job

Hacker News

Ontario is piloting artificial intelligence software to help doctors take notes and reduce the paperwork they have to do. One doctor says it saved her career.

article thumbnail

AWS Inferentia and AWS Trainium deliver lowest cost to deploy Llama 3 models in Amazon SageMaker JumpStart

AWS Machine Learning Blog

Today, we’re excited to announce the availability of Meta Llama 3 inference on AWS Trainium and AWS Inferentia based instances in Amazon SageMaker JumpStart. The Meta Llama 3 models are a collection of pre-trained and fine-tuned generative text models. Amazon Elastic Compute Cloud (Amazon EC2) Trn1 and Inf2 instances, powered by AWS Trainium and AWS Inferentia2, provide the most cost-effective way to deploy Llama 3 models on AWS.

AWS 123
article thumbnail

Introducing CDEs to Your Enterprise

Explore how enterprises can enhance developer productivity and onboarding by adopting self-hosted Cloud Development Environments (CDEs). This whitepaper highlights the simplicity and flexibility of cloud-based development over traditional setups, demonstrating how large teams can leverage economies of scale to boost efficiency and developer satisfaction.

article thumbnail

They thought they were joining an accelerator – instead they lost their startups

Hacker News

Lacey Hunter thought all was well as she put her startup through the three-month Newchip accelerator. Then the organization filed for bankruptcy in May 2023.

181
181
article thumbnail

Get started with Amazon Titan Text Embeddings V2: A new state-of-the-art embeddings model on Amazon Bedrock

AWS Machine Learning Blog

Embeddings are integral to various natural language processing (NLP) applications, and their quality is crucial for optimal performance. They are commonly used in knowledge bases to represent textual data as dense vectors, enabling efficient similarity search and retrieval. In Retrieval Augmented Generation (RAG), embeddings are used to retrieve relevant passages from a corpus to provide context for language models to generate informed, knowledge-grounded responses.

AWS 118
article thumbnail

Spotify moves lyrics behind a paywall

Hacker News

Spotify didn't offer any more detail about why it's now paywalling lyrics, but clearly, it's a bid to push more people to its paid tier.

181
181
article thumbnail

Revealing the Secrets of Startup Success: A Venture Capital Investments Challenge

Ocean Protocol

Podium : Venture Capital Investments Data Challenge Introduction The Venture Capital Investments Challenge engaged data scientists and analysts to decode the complexities of startup funding and success. This challenge drew on an extensive dataset covering various aspects of the venture capital ecosystem. Key datasets included acquisitions, degrees, funding rounds, funds, investments, IPOs, milestones, objects, offices, people, relationships, and several specialized sets designed for in-depth ana

article thumbnail

Marketing Operations in 2025: A New Framework for Success

Speaker: Mike Rizzo, Founder & CEO, MarketingOps.com and Darrell Alfonso, Director of Marketing Strategy and Operations, Indeed.com

Though rarely in the spotlight, marketing operations are the backbone of the efficiency, scalability, and alignment that define top-performing marketing teams. In this exclusive webinar led by industry visionaries Mike Rizzo and Darrell Alfonso, we’re giving marketing operations the recognition they deserve! We will dive into the 7 P Model —a powerful framework designed to assess and optimize your marketing operations function.