article thumbnail

Meet the finalists of the Pushback to the Future Challenge

DrivenData Labs

Summary of approach: Our solution for Phase 1 is a gradient boosted decision tree approach with a lot of feature engineering. We used the LightGBM library for boosted decision trees because it has absolute error as a built-in objective function and it is much faster for model training than similar tree ensemble based algorithms.

article thumbnail

Streaming Machine Learning Without a Data Lake

ODSC - Open Data Science

Commonly used technologies for data storage are the Hadoop Distributed File System (HDFS), Amazon S3, Google Cloud Storage (GCS), or Azure Blob Storage, as well as tools like Apache Hive, Apache Spark, and TensorFlow for data processing and analytics.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Maximizing SaaS application analytics value with AI

IBM Journey to AI blog

Predictive analytics forecast future events based on historical data; AI and ML models—such as regression analysis , neural networks and decision trees —enhance the accuracy of these predictions.

article thumbnail

Artificial Intelligence Using Python: A Comprehensive Guide

Pickl AI

Decision Trees Decision trees recursively partition data into subsets based on the most significant attribute values. Python’s Scikit-learn provides easy-to-use interfaces for constructing decision tree classifiers and regressors, enabling intuitive model visualisation and interpretation.

article thumbnail

Understanding and Building Machine Learning Models

Pickl AI

For example, linear regression is typically used to predict continuous variables, while decision trees are great for classification and regression tasks. Decision trees are easy to interpret but prone to overfitting. predicting house prices), Linear Regression, Decision Trees, or Random Forests could be good choices.

article thumbnail

Efficient Machine Learning Pipelines with DVC and MLFlow

Mlearning.ai

DVC uses external storage such as Azure blob storage, Amazon’s S3, Google cloud storage or even a basic google drive folder in order to store the version history of large data. For the sake of this walkthrough, we will choose to use a decision tree which is a pretty basic regressor.

article thumbnail

How to Choose MLOps Tools: In-Depth Guide for 2024

DagsHub

It offers implementations of various machine learning algorithms, including linear and logistic regression , decision trees , random forests , support vector machines , clustering algorithms , and more.