Data Pipeline, Demo and SQL - Data Science Current

Boosting RAG-based intelligent document assistants using entity extraction, SQL querying, and agents with Amazon Bedrock

AWS Machine Learning Blog

DECEMBER 6, 2023

To overcome these limitations, we propose a solution that combines RAG with metadata and entity extraction, SQL querying, and LLM agents, as described in the following sections. Typically, these analytical operations are done on structured data, using tools such as pandas or SQL engines.

SQL

SQL AWS Analytics Analytics

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

AWS Machine Learning Blog

OCTOBER 24, 2024

A provisioned or serverless Amazon Redshift data warehouse. Basic knowledge of a SQL query editor. Implementation steps Load data to the Amazon Redshift cluster Connect to your Amazon Redshift cluster using Query Editor v2. For this post we’ll use a provisioned Amazon Redshift cluster. A SageMaker domain.

Data Warehouse

Data Warehouse Machine Learning Machine Learning Cloud Data

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snorkel AI

MAY 26, 2023

[link] Ahmad Khan, head of artificial intelligence and machine learning strategy at Snowflake gave a presentation entitled “Scalable SQL + Python ML Pipelines in the Cloud” about his company’s Snowpark service at Snorkel AI’s Future of Data-Centric AI virtual conference in August 2022. Welcome everybody.

SQL

SQL ML ML Python

Webinars

How to Achieve High-Accuracy Results When Using LLMs

MORE WEBINARS

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snorkel AI

MAY 26, 2023

[link] Ahmad Khan, head of artificial intelligence and machine learning strategy at Snowflake gave a presentation entitled “Scalable SQL + Python ML Pipelines in the Cloud” about his company’s Snowpark service at Snorkel AI’s Future of Data-Centric AI virtual conference in August 2022. Welcome everybody.

SQL

SQL ML ML Python

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Flipboard

AUGUST 17, 2023

Amazon Redshift uses SQL to analyze structured and semi-structured data across data warehouses, operational databases, and data lakes, using AWS-designed hardware and ML to deliver the best price-performance at any scale. Enter a stack name, such as Demo-Redshift. yaml locally.

ML

ML ML AWS Data Warehouse

Gartner BI Bake Off: Data Catalogs and the Opioid Epidemic

Alation

FEBRUARY 20, 2020

While this year the BI Bake Off is designed for BI vendors, we wanted to show how the Alation Data Catalog can help make the analysis of this important dataset more effective and efficient. . Alation BI Bake Off Demo. With Alation, you can search for assets across the entire data pipeline.

SQL

SQL Hadoop Analytics Analytics

phData Toolkit August 2023 Update

phData

SEPTEMBER 7, 2023

Over the last month, we’ve been heavily focused on adding additional support for SQL translations to our SQL Translations tool. Specifically, we’ve been introducing fixes and features for our Microsoft SQL Server to Snowflake translation. This is where the SQL Translation tool can be a massive accelerator for your migration.

SQL

SQL Data Profiling Data Pipeline Database

How to Setup a Project in Snowpark Using a Python IDE

phData

JULY 2, 2024

Snowpark, offered by the Snowflake AI Data Cloud , consists of libraries and runtimes that enable secure deployment and processing of non-SQL code, such as Python, Java, and Scala. Developers can seamlessly build data pipelines, ML models, and data applications with User-Defined Functions and Stored Procedures.

Python

Python SQL Data Pipeline ML

phData Toolkit July 2023 Update

phData

JULY 29, 2023

We’ve been focusing on two key areas: Microsoft SQL Server to Snowflake Data Cloud SQL translations and our new Advisor tool within the phData Toolkit. Operational Risks identify operational risks such as data loss or failures in the event of an unforeseen outage or disaster. Let’s dive in.

SQL

SQL Database Data Pipeline

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

” – James Tu, Research Scientist at Waabi Play with this project live For more: Dive into documentation Get in touch if you’d like to go through a custom demo with your team Comet ML Comet ML is a cloud-based experiment tracking and optimization platform. Flyte Flyte is a platform for orchestrating ML pipelines at scale.

Machine Learning

Machine Learning Machine Learning ML ML

What is Snowflake’s Data Quality Monitoring Feature and How is it Used?

phData

OCTOBER 25, 2024

It’s common to have terabytes of data in most data warehouses, data quality monitoring is often challenging and cost-intensive due to dependencies on multiple tools and eventually ignored. This results in poor credibility and data consistency after some time, leading businesses to mistrust the data pipelines and processes.

Data Quality

Data Quality Data Pipeline Data Governance Database

Apache Kafka and Apache Flink: An open-source match made in heaven

IBM Journey to AI blog

NOVEMBER 3, 2023

In this spirit, IBM introduced IBM Event Automation with an intuitive, easy to use, no code format that enables users with little to no training in SQL, java, or python to leverage events, no matter their role. Request a live demo to see how working with real-time events can benefit your business. Hungry for more?

Apache Kafka

Apache Kafka Data Warehouse Data Pipeline Big Data

Deploy generative AI agents in your contact center for voice and chat using Amazon Connect, Amazon Lex, and Amazon Bedrock Knowledge Bases

AWS Machine Learning Blog

SEPTEMBER 24, 2024

An optional CloudFormation stack to deploy a data pipeline to enable a conversation analytics dashboard. This is where the content for the demo solution will be stored. For the demo solution, choose the default ( Claude V3 Sonnet ). For the hotel-bot demo, try the default of 4. Do not specify an S3 prefix.

AWS

AWS AI AI Natural Language Processing

Schema Detection and Evolution in Snowflake

phData

MARCH 1, 2024

This functionality eliminates the need for manual schema adjustments, streamlining the data ingestion process and ensuring quicker access to data for their consumers. As you can see in the above demo, it is incredibly simple to use INFER_SCHEMA and SCHEMA EVOLUTION features to speed up data ingestion into Snowflake.

Data Engineer

Data Engineer Data Engineering Data Engineering Data Engineering

How to Build an End-to-End Energy Price Forecasting Solution with Snowflake

phData

JANUARY 31, 2024

For a short demo on Snowpark, be sure to check out the video below. Utilizing Streamlit as a Front-End At this point, we have all of our data processing, model training, inference, and model evaluation steps set up with Snowpark. What was once a SQL-based data warehousing tool is now so much more.

Machine Learning

Machine Learning Machine Learning Python Data Scientist

Software Engineering Patterns for Machine Learning

The MLOps Blog

SEPTEMBER 7, 2023

These combinations of Python code and SQL play a crucial role but can be challenging to keep them robust for their entire lifetime. Directives and architectural tricks for robust data pipelines Gain insights into an extensive array of directives and architectural strategies tailored for the development of highly dependable data pipelines.

Machine Learning

Machine Learning Machine Learning ETL ML

Experimenting with GenAI: Building Self-Healing CI/CD Pipelines for dbt Cloud

phData

AUGUST 22, 2024

Consider a data pipeline that detects its own failures, diagnoses the issue, and recommends the fix—all automatically. This is the potential of self-healing pipelines, and this blog explores how to implement them using dbt, Snowflake Cortex , and GitHub Actions.

SQL

SQL Data Quality Python Data Warehouse

Snorkel AI partners with Snowflake to bring data-centric AI to the Snowflake Data Cloud

Snorkel AI

JANUARY 24, 2023

Snowpark, which is Snowflake’s developer framework that extends the benefits of the Data Cloud beyond SQL to Python, Scala, and Java, can be used to scale batch inference across your Snowflake data warehouse. Schedule a custom demo tailored to your use case with our ML experts today.

AI

AI AI ML ML

Snorkel AI partners with Snowflake to bring data-centric AI to the Snowflake Data Cloud

Snorkel AI

JANUARY 24, 2023

Snowpark, which is Snowflake’s developer framework that extends the benefits of the Data Cloud beyond SQL to Python, Scala, and Java, can be used to scale batch inference across your Snowflake data warehouse. Schedule a custom demo tailored to your use case with our ML experts today.

AI

AI AI ML ML

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Flipboard

NOVEMBER 24, 2023

This use case highlights how large language models (LLMs) are able to become a translator between human languages (English, Spanish, Arabic, and more) and machine interpretable languages (Python, Java, Scala, SQL, and so on) along with sophisticated internal reasoning.

Database

Database AWS ETL SQL

Announcing the ODSC West 2023 Preliminary Schedule

ODSC - Open Data Science

SEPTEMBER 20, 2023

Tuesday is the first day of the AI Expo and Demo Hall , where you can connect with our conference partners and check out the latest developments and research from leading tech companies. Finally, get ready for some All Hallows Eve fun with Halloween Data After Dark , featuring a costume contest, candy, and more. What’s next?

Data Wrangling

Data Wrangling Data Science Machine Learning Machine Learning

Generative AI in Software Development

Mlearning.ai

JUNE 16, 2023

Generative AI can be used to automate the data modeling process by generating entity-relationship diagrams or other types of data models and assist in UI design process by generating wireframes or high-fidelity mockups. GPT-4 Data Pipelines: Transform JSON to SQL Schema Instantly Blockstream’s public Bitcoin API.

AI

AI AI Data Analysis Data Analysis

Introducing Agile Data Governance – Alation TrustCheck

Alation

FEBRUARY 20, 2020

The rise of data lakes, IOT analytics, and big data pipelines has introduced a new world of fast, big data. With TrustCheck, best practices and compliance rules can be shared easily and embedded directly into the workflow of the data consumers. To see the full capabilities of TrustCheck, watch the full demo below.

Data Governance

Data Governance Tableau Analytics Analytics

Definite Guide to Building a Machine Learning Platform

The MLOps Blog

MARCH 21, 2023

An ML platform standardizes the technology stack for your data team around best practices to reduce incidental complexities with machine learning and better enable teams across projects and workflows. We ask this during product demos, user and support calls, and on our MLOps LIVE podcast. Data engineers are mostly in charge of it.

Machine Learning

Machine Learning Machine Learning Data Scientist ML

Data Science Current

Boosting RAG-based intelligent document assistants using entity extraction, SQL querying, and agents with Amazon Bedrock

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

Webinars

Trending Sources

Snowflake Snowpark: cloud SQL and Python ML pipelines

Webinars

Snowflake Snowpark: cloud SQL and Python ML pipelines

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Gartner BI Bake Off: Data Catalogs and the Opioid Epidemic

phData Toolkit August 2023 Update

How to Setup a Project in Snowpark Using a Python IDE

phData Toolkit July 2023 Update

MLOps Landscape in 2023: Top Tools and Platforms

What is Snowflake’s Data Quality Monitoring Feature and How is it Used?

Apache Kafka and Apache Flink: An open-source match made in heaven

Deploy generative AI agents in your contact center for voice and chat using Amazon Connect, Amazon Lex, and Amazon Bedrock Knowledge Bases

Schema Detection and Evolution in Snowflake

How to Build an End-to-End Energy Price Forecasting Solution with Snowflake

Software Engineering Patterns for Machine Learning

Experimenting with GenAI: Building Self-Healing CI/CD Pipelines for dbt Cloud

Snorkel AI partners with Snowflake to bring data-centric AI to the Snowflake Data Cloud

Snorkel AI partners with Snowflake to bring data-centric AI to the Snowflake Data Cloud

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Announcing the ODSC West 2023 Preliminary Schedule

Generative AI in Software Development

Introducing Agile Data Governance – Alation TrustCheck

Definite Guide to Building a Machine Learning Platform

Stay Connected