Database, Demo and SQL - Data Science Current

Enterprise-grade natural language to SQL generation using LLMs: Balancing accuracy, latency, and scale

Flipboard

APRIL 24, 2025

These tables house complex domain-specific schemas, with instances of nested tables and multi-dimensional data that require complex database queries and domain-specific knowledge for data retrieval.

SQL

SQL Database AWS ML

MongoRAG: Leveraging MongoDB Atlas as a Vector Database with Databricks-Deployed Embedding Model and LLMs for Retrieval-Augmented Generation

Towards AI

JANUARY 29, 2025

Retrieval Augmented Generation generally consists of Three major steps, I will explain them briefly down below – Information Retrieval The very first step involves retrieving relevant information from a knowledge base, database, or vector database, where we store the embeddings of the data from which we will retrieve information.

Database

Database Clustering Python SQL

5 Reasons Why SQL is Still the Most Accessible Language for New Data Scientists

ODSC - Open Data Science

APRIL 6, 2023

Though both are great to learn, what gets left out of the conversation is a simple yet powerful programming language that everyone in the data science world can agree on, SQL. But why is SQL, or Structured Query Language , so important to learn? Let’s start with the first clause often learned by new SQL users, the WHERE clause.

SQL

SQL Data Scientist Database Data Science

Webinars

Going Beyond Chatbots: Connecting AI to Your Tools, Systems, & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Smart Tech + Human Expertise = How to Modernize Manufacturing Without Losing Control

MORE WEBINARS

Boosting RAG-based intelligent document assistants using entity extraction, SQL querying, and agents with Amazon Bedrock

AWS Machine Learning Blog

DECEMBER 6, 2023

With the right underlying embedding model, capable of producing accurate semantic representations of the input document chunks and the input questions, and an efficient semantic search module, this solution is able to answer questions that require retrieving existent information in a database of documents.

SQL

SQL AWS Analytics Analytics

An integrated experience for all your data and AI with Amazon SageMaker Unified Studio (preview)

Flipboard

DECEMBER 11, 2024

Data processing and SQL analytics Analyze, prepare, and integrate data for analytics and AI using Amazon Athena, Amazon EMR, AWS Glue, and Amazon Redshift. With the SQL editor, you can query data lakes, databases, data warehouses, and federated data sources. For Project name , enter a name (for example, demo).

SQL

SQL AWS Data Lakes AI

How healthcare payers and plans can empower members with generative AI

AWS Machine Learning Blog

SEPTEMBER 12, 2024

From a broad perspective, the complete solution can be divided into four distinct steps: text-to-SQL generation, SQL validation, data retrieval, and data summarization. A pre-configured prompt template is used to call the LLM and generate a valid SQL query. The following diagram illustrates this workflow.

SQL

SQL AWS AI AI

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

AWS Machine Learning Blog

OCTOBER 24, 2024

Basic knowledge of a SQL query editor. Database name : Enter dev. Database user : Enter awsuser. A provisioned or serverless Amazon Redshift data warehouse. For this post we’ll use a provisioned Amazon Redshift cluster. A SageMaker domain. A QuickSight account (optional). Deploy the Cloudformation template to your account.

Data Warehouse

Data Warehouse Machine Learning Machine Learning Cloud Data

Tackling AI’s data challenges with IBM databases on AWS

IBM Journey to AI blog

MARCH 14, 2024

Also, traditional database management tasks, including backups, upgrades and routine maintenance drain valuable time and resources, hindering innovation. By using fit-for-purpose databases, customers can efficiently run workloads, using the appropriate engine at the optimal cost to optimize analytics for the best price-performance.

AWS

AWS Database ETL AI

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

Flipboard

NOVEMBER 24, 2023

This use case highlights how large language models (LLMs) are able to become a translator between human languages (English, Spanish, Arabic, and more) and machine interpretable languages (Python, Java, Scala, SQL, and so on) along with sophisticated internal reasoning.

Database

Database AWS ETL SQL

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

AWS Machine Learning Blog

MAY 30, 2024

In this post, we describe how CBRE partnered with AWS Prototyping to develop a custom query environment allowing natural language query (NLQ) prompts by using Amazon Bedrock, AWS Lambda , Amazon Relational Database Service (Amazon RDS), and Amazon OpenSearch Service. The wrapper function runs the SQL query using psycopg2.

AWS

AWS SQL Database AI

Visualizing graph data without a graph database

Cambridge Intelligence

OCTOBER 25, 2023

Visualizing graph data doesn’t necessarily depend on a graph database… Working on a graph visualization project? You might assume that graph databases are the way to go – they have the word “graph” in them, after all. Do I need a graph database? It depends on your project. Unstructured? Under construction?

Database

Database Data Modeling Data Models Algorithm

Automate invoice processing with Streamlit and Amazon Bedrock

AWS Machine Learning Blog

NOVEMBER 14, 2024

In this post, we save the data in JSON format, but you can also choose to store it in your preferred SQL or NoSQL database. Run the Streamlit demo Now that you have the components in place and the invoices processed using Amazon Bedrock, it’s time to deploy the Streamlit application.

AWS

AWS Python AI AI

Citus 12: Schema-based sharding for PostgreSQL

Hacker News

JULY 18, 2023

What if you could automatically shard your PostgreSQL database across any number of servers and get industry-leading performance at scale without any special data modelling steps? And if you want to see demos of some of this functionality, be sure to join us for the livestream of the Citus 12.0 Updates page. Let’s dive in!

Database

Database SQL Data Modeling Data Models

How to Split Text For Vector Embeddings in Snowflake

phData

NOVEMBER 28, 2024

“ Vector Databases are completely different from your cloud data warehouse.” – You might have heard that statement if you are involved in creating vector embeddings for your RAG-based Gen AI applications. Enhanced Search and Retrieval Augmented Generation: Vector search systems work by matching queries with embeddings in a database.

Python

Python Database SQL Machine Learning

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snorkel AI

MAY 26, 2023

[link] Ahmad Khan, head of artificial intelligence and machine learning strategy at Snowflake gave a presentation entitled “Scalable SQL + Python ML Pipelines in the Cloud” about his company’s Snowpark service at Snorkel AI’s Future of Data-Centric AI virtual conference in August 2022. Welcome everybody.

SQL

SQL ML ML Python

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snorkel AI

MAY 26, 2023

[link] Ahmad Khan, head of artificial intelligence and machine learning strategy at Snowflake gave a presentation entitled “Scalable SQL + Python ML Pipelines in the Cloud” about his company’s Snowpark service at Snorkel AI’s Future of Data-Centric AI virtual conference in August 2022. Welcome everybody.

SQL

SQL ML ML Python

Top Free and Paid Sessions on the Ai+ Training Platform

ODSC - Open Data Science

FEBRUARY 13, 2023

Topics include python fundamentals, SQL for data science, statistics for machine learning, and more. Deep Learning with Tensorflow 2 and Pytorch Originally recorded as a live training, this session serves as a primer on deep learning theory that will bring the revolutionary machine learning approach to life with hands-on demos.

Machine Learning

Machine Learning Machine Learning Deep Learning Deep Learning

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Flipboard

AUGUST 17, 2023

Amazon Redshift uses SQL to analyze structured and semi-structured data across data warehouses, operational databases, and data lakes, using AWS-designed hardware and ML to deliver the best price-performance at any scale. Enter a stack name, such as Demo-Redshift. yaml locally. For Prepare template , select Template is ready.

ML

ML ML AWS Data Warehouse

Analyzing the history of Tableau innovation

Tableau

DECEMBER 1, 2021

Chris had earned an undergraduate computer science degree from Simon Fraser University and had worked as a database-oriented software engineer. In 2004, Tableau got both an initial series A of venture funding and Tableau’s first EOM contract with the database company Hyperion—that’s when I was hired. Release v1.0

Tableau

Tableau ML ML Database

The Ascent of ChatGPT

ODSC - Open Data Science

FEBRUARY 14, 2023

Since its release on November 30, 2022 by OpenAI , the ChatGPT public demo has taken the world by storm. These models are the technology behind Open AI’s DALL-E and GPT-3 , and are powerful enough to understand natural language commands and generate high-quality code to instantly query databases.

Database

Database AI AI Natural Language Processing

Unleashing the power of Presto: The Uber case study

IBM Journey to AI blog

SEPTEMBER 25, 2023

This blog takes you on a journey into the world of Uber’s analytics and the critical role that Presto, the open source SQL query engine, plays in driving their success. This allowed them to focus on SQL-based query optimization to the nth degree. They stood up a file-based data lake alongside their analytical database.

Data Lakes

Data Lakes Analytics Analytics Clustering

Snowpark Stored Procedure Development using Python Worksheets

phData

JULY 19, 2023

It exposes new interfaces for development in Python , Scala, or Java to supplement Snowflake’s original SQL interface. SQL is, of course, the lingua franca for data, but there are many applications and development teams that rely heavily on other languages. What’s New with Snowpark? ” Of course! conda create -n snowpark python=3.8

Python

Python SQL Database

Snowpark Stored Procedure Development using Python Worksheets

phData

JULY 19, 2023

It exposes new interfaces for development in Python , Scala, or Java to supplement Snowflake’s original SQL interface. SQL is, of course, the lingua franca for data, but there are many applications and development teams that rely heavily on other languages. What’s New with Snowpark? ” Of course! conda create -n snowpark python=3.8

Python

Python SQL Database

3 reasons why business and data analysts need to work with real-time events

IBM Journey to AI blog

OCTOBER 17, 2023

A business event is represented by a change in state of the data flowing between your applications, systems and databases and, most importantly, the time it occurred. A business event can describe anything that happens which is significant to an enterprise’s operation.

Data Analyst

Data Analyst Data Visualization Business Intelligence Business Intelligence

How to Talk to Your Data with ChatGPT, Snowflake, & dbt

phData

MAY 31, 2023

From there, ChatGPT generates a SQL query which is then executed in the Snowflake Data Cloud , and the results are brought back into the application in a table format. In this case, after the SQL query is executed on Snowflake, it is converted into a Python dataframe, and basic graphic code is executed to generate the image.

SQL

SQL Data Modeling Data Models Database

phData Toolkit July 2023 Update

phData

JULY 29, 2023

We’ve been focusing on two key areas: Microsoft SQL Server to Snowflake Data Cloud SQL translations and our new Advisor tool within the phData Toolkit. SQL Translation Updates When customers are looking to migrate between platforms, there’s always a challenge in migrating existing code. Let’s dive in.

SQL

SQL Database Data Pipeline

Boost productivity with video conferencing transcripts and summaries with the Amazon Chime SDK Meeting Summarizer solution

AWS Machine Learning Blog

JUNE 4, 2024

In this demo, an outbound call is made using the CreateSipMediaApplicationCall API. And what are you using for your database ? spk_1: Oh , yeah , for a database , we currently have a 200 gigabyte database running my as to will and I can’t remember the version. But um the thing about our database is sometimes it lags.

AWS

AWS Database AI AI

5 Must-Know Pillars of a Data Science and AI Foundation

ODSC - Open Data Science

MARCH 2, 2023

SQL Databases might sound scary, but honestly, they’re not all that bad. And much of that is thanks to SQL (Structured Query Language). Believe it or not, SQL is about to celebrate its fiftieth birthday next year as it was first developed in 1974 as part of IBM’s System R Project. Learning is learning.

Data Science

Data Science SQL Deep Learning Deep Learning

phData Toolkit August 2023 Update

phData

SEPTEMBER 7, 2023

Over the last month, we’ve been heavily focused on adding additional support for SQL translations to our SQL Translations tool. Specifically, we’ve been introducing fixes and features for our Microsoft SQL Server to Snowflake translation. This is where the SQL Translation tool can be a massive accelerator for your migration.

SQL

SQL Data Profiling Data Pipeline Database

The 10 Best Business Intelligence Tools For Small And Big Business

Smart Data Collective

JULY 22, 2019

With the help of SQL and R, this tool analyzes your data and turns it into pretty interactive dashboards within minutes. You can add time-based and custom filters, write SQL, get charts, and share dashboards with the team. Its analytics can integrate with different SQL databases and different data warehouses.

Business Intelligence

Business Intelligence Business Intelligence Power BI Tableau

How to Save Trained Model in Python

The MLOps Blog

MAY 10, 2023

To ensure security and JSON/pickle benefits, you can save your model to a dedicated database. Next, you will see how you can save an ML model in a database. Storing ML models in a database There is also scope for you to save your ML models in relational databases PostgreSQL , MySQL , Oracle SQL , etc.

Python

Python ML ML Database

Top Data Analytics Skills and Platforms for 2023

ODSC - Open Data Science

APRIL 3, 2023

Data Wrangling: Data Quality, ETL, Databases, Big Data The modern data analyst is expected to be able to source and retrieve their own data for analysis. Competence in data quality, databases, and ETL (Extract, Transform, Load) are essential. SQL excels with big data and statistics, making it important in order to query databases.

Analytics

Analytics Analytics Data Analyst Data Science

Analyzing the history of Tableau innovation

Tableau

DECEMBER 1, 2021

Chris had earned an undergraduate computer science degree from Simon Fraser University and had worked as a database-oriented software engineer. In 2004, Tableau got both an initial series A of venture funding and Tableau’s first OEM contract with the database company Hyperion—that’s when I was hired. Release v1.0

Tableau

Tableau ML ML Database

How To Setup Schemachange for Snowflake CI/CD

phData

AUGUST 29, 2023

Many tools can help teams migrate SQL code more efficiently, such as Liquibase, Flyway, and schemachange. How do you deploy your SQL code into Production? When utilized effectively, it is essential to store all SQL code in a version control system such as git. sql Always A__[description].sql Repeatable R__[description].sql

Azure

Azure SQL Database Python

How to use Netezza Performance Server query data in Amazon Simple Storage Service (S3)

IBM Journey to AI blog

JANUARY 10, 2023

Background on the Netezza Performance Server capability demo. Netezza Performance Server (NPS) has recently added the ability to access Parquet files by defining a Parquet file as an external table in the database. Prerequisites for the demo. Figure 1 – NPS database table definitions. The data definition.

Data Warehouse

Data Warehouse Data Analysis Data Analysis SQL

Maximizing your event-driven architecture investments: Unleashing the power of Apache Kafka with IBM Event Automation

IBM Journey to AI blog

FEBRUARY 12, 2024

It offers businesses the capability to capture and process real-time information from diverse sources, such as databases, software applications and cloud services. Instead of requiring skilled Flink structured query language (SQL) programmers, other business teams can immediately extract actionable insights from relevant events.

Apache Kafka

Apache Kafka EDA SQL Database

All the Hard Stuff Nobody Talks About when Building Products with LLMs

Hacker News

MAY 26, 2023

To be blunt, a lot of that hype is just some demo b t that would fall over the instant anyone tried to use it for a real task that their job depends on. It’s kinda like SQL injection, except worse and with no solution today. However, I want to talk about something else first. blog post that explains it.

Machine Learning

Machine Learning Machine Learning Database SQL

How to Use Exploratory Notebooks [Best Practices]

The MLOps Blog

OCTOBER 20, 2023

Example template for an exploratory notebook | Source: Author How to organize code in Jupyter notebook For exploratory tasks, the code to produce SQL queries, pandas data wrangling, or create plots is not important for readers. In those cases, most of the data exploration and wrangling will be done through SQL. documentation.

SQL

SQL Database Data Scientist Python

Top 5 MLOps World Takeaways

phData

DECEMBER 16, 2024

The session included a demo on running a vLLM server for your own large language model (LLM) inference service using Modal. Dolt: Version Control For Databases Imagine Git and MySQL had a baby: thats Dolt, the worlds first and only version-controlled SQL database.

ML

ML ML Database Machine Learning

Implementing Knowledge Bases for Amazon Bedrock in support of GDPR (right to be forgotten) requests

AWS Machine Learning Blog

MAY 31, 2024

Challenges associated with these stages involve not knowing all touchpoints where data is persisted, maintaining a data pre-processing pipeline for document chunking, choosing a chunking strategy, vector database, and indexing strategy, generating embeddings, and any manual steps to purge data from vector stores and keep it in sync with source data.

AWS

AWS Machine Learning Machine Learning Database

Role-based access controls in Snorkel Flow secure enterprise data

Snorkel AI

MAY 14, 2024

For example, they can enable access for SQL for annotators in one workspace, Snowflake for data scientists in another workspace, and lock down local file upload for all roles across all workspaces. For example, they can choose to only allow administrators and super administrators to add or edit configurations for the company’s SQL database.

Data Scientist

Data Scientist SQL AI AI

Role-based access controls in Snorkel Flow secure enterprise data

Snorkel AI

MAY 14, 2024

For example, they can enable access for SQL for annotators in one workspace, Snowflake for data scientists in another workspace, and lock down local file upload for all roles across all workspaces. For example, they can choose to only allow administrators and super administrators to add or edit configurations for the company’s SQL database.

SQL

SQL Data Scientist Machine Learning Machine Learning

TigerGraph tutorial: how to integrate with ReGraph

Cambridge Intelligence

NOVEMBER 7, 2022

We’ve seen significant interest in TigerGraph’s fast, scalable graph database platform recently. In response, I put together this TigerGraph tutorial to create a React graph visualization application that integrates with their cloud database. Now we’ll create a database query that we can use in ReGraph.

Database

Database SQL Data Scientist Data Visualization

Enterprise-grade natural language to SQL generation using LLMs: Balancing accuracy, latency, and scale

MongoRAG: Leveraging MongoDB Atlas as a Vector Database with Databricks-Deployed Embedding Model and LLMs for Retrieval-Augmented Generation

Webinars

Trending Sources

5 Reasons Why SQL is Still the Most Accessible Language for New Data Scientists

Webinars

Boosting RAG-based intelligent document assistants using entity extraction, SQL querying, and agents with Amazon Bedrock

An integrated experience for all your data and AI with Amazon SageMaker Unified Studio (preview)

How healthcare payers and plans can empower members with generative AI

Enhance your Amazon Redshift cloud data warehouse with easier, simpler, and faster machine learning using Amazon SageMaker Canvas

Tackling AI’s data challenges with IBM databases on AWS

How SnapLogic built a text-to-pipeline application with Amazon Bedrock to translate business intent into action

CBRE and AWS perform natural language queries of structured data using Amazon Bedrock

Visualizing graph data without a graph database

Automate invoice processing with Streamlit and Amazon Bedrock

Citus 12: Schema-based sharding for PostgreSQL

How to Split Text For Vector Embeddings in Snowflake

Snowflake Snowpark: cloud SQL and Python ML pipelines

Snowflake Snowpark: cloud SQL and Python ML pipelines

Top Free and Paid Sessions on the Ai+ Training Platform

Build ML features at scale with Amazon SageMaker Feature Store using data from Amazon Redshift

Top 20 most-asked questions about Amazon RDS for Db2 answered

Analyzing the history of Tableau innovation

The Ascent of ChatGPT

Unleashing the power of Presto: The Uber case study

Snowpark Stored Procedure Development using Python Worksheets

Snowpark Stored Procedure Development using Python Worksheets

3 reasons why business and data analysts need to work with real-time events

How to Talk to Your Data with ChatGPT, Snowflake, & dbt

phData Toolkit July 2023 Update

Boost productivity with video conferencing transcripts and summaries with the Amazon Chime SDK Meeting Summarizer solution

5 Must-Know Pillars of a Data Science and AI Foundation

phData Toolkit August 2023 Update

The 10 Best Business Intelligence Tools For Small And Big Business

How to Save Trained Model in Python

Top Data Analytics Skills and Platforms for 2023

Analyzing the history of Tableau innovation

How To Setup Schemachange for Snowflake CI/CD

How to use Netezza Performance Server query data in Amazon Simple Storage Service (S3)

Maximizing your event-driven architecture investments: Unleashing the power of Apache Kafka with IBM Event Automation

All the Hard Stuff Nobody Talks About when Building Products with LLMs

How to Use Exploratory Notebooks [Best Practices]

Top 5 MLOps World Takeaways

Implementing Knowledge Bases for Amazon Bedrock in support of GDPR (right to be forgotten) requests

Role-based access controls in Snorkel Flow secure enterprise data

Role-based access controls in Snorkel Flow secure enterprise data

TigerGraph tutorial: how to integrate with ReGraph

Stay Connected