Sat.Sep 07, 2019 - Fri.Sep 13, 2019

article thumbnail

A Data Scientist’s Guide to 8 Types of Sampling Techniques

Analytics Vidhya

Overview Sampling is a popular statistical concept – learn how it works in this article We will also talk about eight different types of. The post A Data Scientist’s Guide to 8 Types of Sampling Techniques appeared first on Analytics Vidhya.

article thumbnail

10 Great Python Resources for Aspiring Data Scientists

KDnuggets

This is a collection of 10 interesting resources in the form of articles and tutorials for the aspiring data scientist new to Python, meant to provide both insight and practical instruction when starting on your journey.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Trending Sources

article thumbnail

Everything you want to know about GDPR’s Right to be Forgotten in Blockchain

Dataconomy

What is the big problem with the right to be forgotten (right to erasure, Article 17) under the GDPR? As Blockchain generally is immutable, and the GDPR requires personal data to be deleted – many people therefore conclude that it is impossible to store any kind of personal data on. The post Everything you want to know about GDPR’s Right to be Forgotten in Blockchain appeared first on Dataconomy.

187
187
article thumbnail

The Role of Big Data In The Maintenance Industry

Smart Data Collective

As industry buzzwords, “Big Data” is one of those phrases that has become seemingly ubiquitous. Everyone wants to be using big data to better their operation. The maintenance department is no exception to this trend. Accordingly, maintenance teams are beginning to embrace the use of big data and analytics to improve performance. In emphasizing the use of “big data”, maintenance can establish predictive maintenance programs, which reduce downtime and save on maintenance costs.

article thumbnail

Optimizing The Modern Developer Experience with Coder

Many software teams have migrated their testing and production workloads to the cloud, yet development environments often remain tied to outdated local setups, limiting efficiency and growth. This is where Coder comes in. In our 101 Coder webinar, you’ll explore how cloud-based development environments can unlock new levels of productivity. Discover how to transition from local setups to a secure, cloud-powered ecosystem with ease.

article thumbnail

Become a Video Analysis Expert: A Simple Approach to Automatically Generating Highlights using Python

Analytics Vidhya

Overview Build your own highlights package in Python using a simple approach That’s right – learn how automatic highlight generation works without using machine. The post Become a Video Analysis Expert: A Simple Approach to Automatically Generating Highlights using Python appeared first on Analytics Vidhya.

Python 283
article thumbnail

Train sklearn 100x Faster

KDnuggets

As compute gets cheaper and time to market for machine learning solutions becomes more critical, we’ve explored options for speeding up model training. One of those solutions is to combine elements from Spark and scikit-learn into our own hybrid solution.

More Trending

article thumbnail

How Big Data Is Transforming Social Media Marketing

Smart Data Collective

Big Data is among one of the most impressive tech advancements that have hit the marketing world in recent memory. While it has been tossed around as a buzzword in certain circles, Big Data is so much more than just a phrase. For a definition , Oracle recommends Gartner’s 2001 description of Big Data, which describes it as data containing a greater variety, getting to the source in increasing volume and at ever-higher velocity.

article thumbnail

4 Key Aspects of a Data Science Project Every Data Scientist and Leader Should Know

Analytics Vidhya

Overview A data-science-driven product consists of multiple aspects every leader needs to be aware of Machine learning algorithms are one part of a whole. The post 4 Key Aspects of a Data Science Project Every Data Scientist and Leader Should Know appeared first on Analytics Vidhya.

article thumbnail

Classification vs Prediction

KDnuggets

It is important to distinguish prediction and classification. In many decision-making contexts, classification represents a premature decision, because classification combines prediction and decision making and usurps the decision maker in specifying costs of wrong decisions.

article thumbnail

3 Reasons to Ditch Excel for FP&A Data Consolidation & Validation

DataRobot Blog

Financial Planning and Analysis (FP&A) business professionals are responsible for mapping out a company’s financial future. They transform company goals into actionable plans by analyzing the current state of financial management affairs, then take the time to create a roadmap plan that details how to reach the destination. . Creating those plans require ingesting massive amounts of data resources, aggregating, cleansing, and standardizing that data, and then performing analysis on the finis

article thumbnail

15 Modern Use Cases for Enterprise Business Intelligence

Large enterprises face unique challenges in optimizing their Business Intelligence (BI) output due to the sheer scale and complexity of their operations. Unlike smaller organizations, where basic BI features and simple dashboards might suffice, enterprises must manage vast amounts of data from diverse sources. What are the top modern BI use cases for enterprise businesses to help you get a leg up on the competition?

article thumbnail

5 Reasons Why You Should Store Big Data In The Cloud

Smart Data Collective

Gone are the days when storage of information can only be done with the traditional remote servers which are located in a secluded location. Today, the in-thing is cloud data storage where information and data are stored electronically online. With this approach, you can store unlimited data online (in the cloud) and access it anywhere. Several essays and many articles have been written on storage clouds and benefits of the cloud , but this piece puts forward five of the biggest benefits that yo

article thumbnail

WNS Analytics Wizard 2019: Top 3 Winners’ Solutions from our Biggest Data Science Hackathon

Analytics Vidhya

Overview Here’s a unique data science challenge we don’t come across often – a marketing analytics hackathon! We bring you the top 3 inspiring. The post WNS Analytics Wizard 2019: Top 3 Winners’ Solutions from our Biggest Data Science Hackathon appeared first on Analytics Vidhya.

article thumbnail

Many Heads Are Better Than One: The Case For Ensemble Learning

KDnuggets

While ensembling techniques are notoriously hard to set up, operate, and explain, with the latest modeling, explainability and monitoring tools, they can produce more accurate and stable predictions. And better predictions can be better for business.

article thumbnail

Talking with Coz: Pure Origins and the Future of Storage

DataCentric podcast

Want to hear a good origin story? Or about the future of data? You're in luck. As Pure Storage heads into its annual Pure Accelerate Conference in Austin next week, it's looking to celebrate its 10th anniversary. 10 years in which Pure has grown from a seed-stage start-up to a ~$4B publically traded company. And Pure continues to be a disrupter in the storage industry.

40
article thumbnail

Marketing Operations in 2025: A New Framework for Success

Speaker: Mike Rizzo, Founder & CEO, MarketingOps.com and Darrell Alfonso, Director of Marketing Strategy and Operations, Indeed.com

Though rarely in the spotlight, marketing operations are the backbone of the efficiency, scalability, and alignment that define top-performing marketing teams. In this exclusive webinar led by industry visionaries Mike Rizzo and Darrell Alfonso, we’re giving marketing operations the recognition they deserve! We will dive into the 7 P Model —a powerful framework designed to assess and optimize your marketing operations function.

article thumbnail

AI Drives The Inception Of Three Cutting-Edge Smart Home Products

Smart Data Collective

Artificial intelligence is coming to our homes. A growing number of people use smart devices that are developed with state-of-the-art AI technology. The market for smart homes is going to rise as new AI advances bring big changes to the industry. One survey from last year found that only 12-16% of homes in the United States are equipped with smart devices.

AI 78
article thumbnail

AI Simplified: Supervised Machine Learning

DataRobot

It is well-known that the AI revolution is transforming industries and businesses around the world. In this AI Simplified video, we define supervised machine learning and share some ways the military leverages this technology to maintain safety and ensure preparedness.

article thumbnail

Scikit-Learn vs mlr for Machine Learning

KDnuggets

How does the scikit-learn machine learning library for Python compare to the mlr package for R? Following along with a machine learning workflow through each approach, and see if you can gain a competitive advantage by knowing both frameworks.

article thumbnail

The 5 Graph Algorithms That Data Scientists Should Know

KDnuggets

In this post, I am going to be talking about some of the most important graph algorithms you should know and how to implement them using Python.

Algorithm 297
article thumbnail

Prepare Now: 2025s Must-Know Trends For Product And Data Leaders

Speaker: Jay Allardyce, Deepak Vittal, and Terrence Sheflin

As we look ahead to 2025, business intelligence and data analytics are set to play pivotal roles in shaping success. Organizations are already starting to face a host of transformative trends as the year comes to a close, including the integration of AI in data analytics, an increased emphasis on real-time data insights, and the growing importance of user experience in BI solutions.

article thumbnail

There is No Free Lunch in Data Science

KDnuggets

There is no such thing as a free lunch in life or data science. Here, we'll explore some science philosophy and discuss the No Free Lunch theorems to find out what they mean for the field of data science.

article thumbnail

Common Machine Learning Obstacles

KDnuggets

In this blog, Seth DeLand of MathWorks discusses two of the most common obstacles relate to choosing the right classification model and eliminating data overfitting.

article thumbnail

The State of Transfer Learning in NLP

KDnuggets

This post expands on the NAACL 2019 tutorial on Transfer Learning in NLP organized by Matthew Peters, Swabha Swayamdipta, Thomas Wolf, and Sebastian Ruder. This post highlights key insights and takeaways and provides updates based on recent work.

241
241
article thumbnail

BERT is changing the NLP landscape

KDnuggets

BERT is changing the NLP landscape and making chatbots much smarter by enabling computers to better understand speech and respond intelligently in real-time.

AI 260
article thumbnail

The Cloud Development Environment Adoption Report

Cloud Development Environments (CDEs) are changing how software teams work by moving development to the cloud. Our Cloud Development Environment Adoption Report gathers insights from 223 developers and business leaders, uncovering key trends in CDE adoption. With 66% of large organizations already using CDEs, these platforms are quickly becoming essential to modern development practices.

article thumbnail

Can graph machine learning identify hate speech in online social networks?

KDnuggets

Online hate speech is a complex subject. Follow this demonstration using state-of-the-art graph neural network models to detect hateful users based on their activities on the Twitter social network.

article thumbnail

OpenStreetMap Data to ML Training Labels for Object Detection

KDnuggets

I am really interested in creating a tight, clean pipeline for disaster relief applications, where we can use something like crowd sourced building polygons from OSM to train a supervised object detector to discover buildings in an unmapped location.

ML 229
article thumbnail

Ensemble Methods for Machine Learning: AdaBoost

KDnuggets

It turned out that, if we ask the weak algorithm to create a whole bunch of classifiers (all weak for definition), and then combine them all, what may figure out is a stronger classifier.

article thumbnail

A 2019 Guide to Speech Synthesis with Deep Learning

KDnuggets

In this article, we’ll look at research and model architectures that have been written and developed to do just that using deep learning.

article thumbnail

How to Drive Cost Savings, Efficiency Gains, and Sustainability Wins with MES

Speaker: Nikhil Joshi, Founder & President of Snic Solutions

Is your manufacturing operation reaching its efficiency potential? A Manufacturing Execution System (MES) could be the game-changer, helping you reduce waste, cut costs, and lower your carbon footprint. Join Nikhil Joshi, Founder & President of Snic Solutions, in this value-packed webinar as he breaks down how MES can drive operational excellence and sustainability.

article thumbnail

How DeepMind and Waymo are Using Evolutionary Competition to Train Self-Driving Vehicles

KDnuggets

Recently, Alphabet’s subsidiaries Waymo and DeepMind partnered to find a more efficient process to train self-driving vehicles algorithms and their work took them back to one of the cornerstones of our history as species: evolution.

Algorithm 216
article thumbnail

Discover Your Path Toward Data Science with ODSC’s Mini-Bootcamp

KDnuggets

ODSC has developed a mini-bootcamp, designed to reduce the time and monetary costs of discovering which pathway into data science you should take. In this article, we’ll discuss seven reasons why ODSC’s Mini-Bootcamp might be right for you.

article thumbnail

Data Driven Government – Agenda, Washington, DC, Sep 25

KDnuggets

Data Driven Government is coming to Washington, DC, Sep 26, and includes a stellar lineup of experts who will share the emerging trends and best practices of government agencies in the current use of data analytics to enhance mission outcomes. Use code KDNUGGETS to get 15% off.

Analytics 206
article thumbnail

Clearsense chooses Io-Tahoe’s Smart Data Discovery to navigate healthcare data challenges

KDnuggets

Io-Tahoe, a pioneer in Smart Data Discovery and AI-Driven Data Catalog products, has announced that Clearsense, a scalable data platform as a service built for healthcare, has chosen the smart data discovery platform to automatically discover and catalog relationships across immense amounts of medical and clinical data.

AI 189
article thumbnail

Improving the Accuracy of Generative AI Systems: A Structured Approach

Speaker: Anindo Banerjea, CTO at Civio & Tony Karrer, CTO at Aggregage

When developing a Gen AI application, one of the most significant challenges is improving accuracy. This can be especially difficult when working with a large data corpus, and as the complexity of the task increases. The number of use cases/corner cases that the system is expected to handle essentially explodes. 💥 Anindo Banerjea is here to showcase his significant experience building AI/ML SaaS applications as he walks us through the current problems his company, Civio, is solving.