Data Engineering, Database and SQL - Data Science Current

SQL vs NoSQL Databases – A Key Concept Every Data Engineer Should Know

Analytics Vidhya

OCTOBER 17, 2020

Overview Understand what SQL and NoSQL databases are. Go through the prominent difference between SQL and No SQL Databases. The post SQL vs NoSQL Databases – A Key Concept Every Data Engineer Should Know appeared first on Analytics Vidhya. This is not an exhaustive.

SQL

SQL Data Engineering Data Engineering Data Engineer

Introduction to SQL for Data Engineering

Analytics Vidhya

APRIL 23, 2022

Introduction In this article, we will be looking for a very common yet very important topic i.e. SQL also pronounced as Ess-cue-ell. So this time I’ll be answering some of the factual questions about SQL which every beginner needs to know before getting […].

SQL

SQL Data Engineer Data Engineering Data Engineering

How to Normalize Relational Databases With SQL Code?

Analytics Vidhya

FEBRUARY 27, 2023

Introduction Data is the new oil in this century. The database is the major element of a data science project. To generate actionable insights, the database must be centralized and organized efficiently. So, we are […] The post How to Normalize Relational Databases With SQL Code?

Database

Database SQL Data Science Analytics

Webinars

Going Beyond Chatbots: Connecting AI to Your Tools, Systems, & Data

Automation, Evolved: Your New Playbook for Smarter Knowledge Work

Smart Tech + Human Expertise = How to Modernize Manufacturing Without Losing Control

MORE WEBINARS

SQL Injection: The Cyber Attack Hiding in Your Database

Analytics Vidhya

FEBRUARY 2, 2023

Introduction SQL injection is an attack in which a malicious user can insert arbitrary SQL code into a web application’s query, allowing them to gain unauthorized access to a database. We can use this to steal sensitive information or make unauthorized changes to the data stored in the database.

SQL

SQL Database Analytics Analytics

Hands-On Tutorial to Analyze Data using Spark SQL

Analytics Vidhya

FEBRUARY 5, 2020

Overview Relational databases are ubiquitous, but what happens when you need to scale your infrastructure? We will discuss the role Spark SQL plays in. The post Hands-On Tutorial to Analyze Data using Spark SQL appeared first on Analytics Vidhya.

SQL

SQL Database Analytics Analytics

SQL and PL/SQL – An Unmissable Comparison

Analytics Vidhya

OCTOBER 12, 2022

Data is getting significant and gaining more traction by the day. Hence it is required to store such a large amount of data carefully. This brings up databases, and SQL and PL/SQL stand […]. The post SQL and PL/SQL – An Unmissable Comparison appeared first on Analytics Vidhya.

SQL

SQL Data Science Database Analytics

A brief introduction to SQL Alchemy

Analytics Vidhya

JULY 30, 2022

Introduction The structured data we generally deal with gets stored in a tabular format in relational databases. And stored data in these databases can be accessed by a query language called “sequel” or SQL. The post A brief introduction to SQL Alchemy appeared first on Analytics Vidhya.

SQL

SQL Database Data Science Analytics

How to screw SQL to anything with Apache Calcite

Analytics Vidhya

OCTOBER 1, 2021

This article was published as a part of the Data Science Blogathon Overview of Apache Calcite Making your own SQL database or running SQL queries against a NoSQL database seems to be a very daunting task. And if we are talking about a distributed database, then the complexity increases many times over.

SQL

SQL Database Data Science Analytics

A Beginner’s Guide to ClickHouse Database

KDnuggets

SEPTEMBER 13, 2024

Learn how to install ClickHouse DBMS, create a database, and run SQL queries using native and Python clients.

Database

Database SQL Python Data Engineering

Interacting with Remote Databases – PostgreSQL and DBAPIs

Analytics Vidhya

SEPTEMBER 22, 2022

Introduction When creating data pipelines, Software Engineers and Data Engineers frequently work with databases using Database Management Systems like PostgreSQL. The post Interacting with Remote Databases – PostgreSQL and DBAPIs appeared first on Analytics Vidhya.

Database

Database Data Pipeline Data Engineering Data Engineering

MSSQL vs MySQL: Comparing Powerhouses of Databases

Analytics Vidhya

AUGUST 30, 2023

Introduction In the bustling arena of database management systems, two heavyweight contenders emerge, each carrying its arsenal of features and capabilities. In one corner, we have the suave and sophisticated Microsoft SQL Server (MSSQL), donned in the elegance of enterprise-level prowess.

Database

Database SQL Analytics Analytics

Step-by-Step Roadmap to Learn SQL in 2023

Analytics Vidhya

FEBRUARY 28, 2023

Introduction Structured Query Language is a powerful language to manage and manipulate data stored in databases. SQL is widely used in the field of data science and is considered an essential skill to have if you work with data.

SQL

SQL Database Data Science Analytics

How is AWS Athena Different from other Databases

Analytics Vidhya

JULY 23, 2022

Introduction Amazon Athena is an interactive query service based on open-source Apache Presto that allows you to analyze data stored in Amazon S3 using ANSI SQL directly. The post How is AWS Athena Different from other Databases appeared first on Analytics Vidhya.

AWS

AWS Database SQL Data Science

Getting Started with Graph Database Queries, with Cheat Sheet!

KDnuggets

NOVEMBER 6, 2023

Graph databases are quickly becoming a core part of the analytics toolset for enterprise IT organizations. If you know SQL, you can easily learn Cypher and open up a huge opportunity for data analysis.

Database

Database SQL Data Analysis Data Analysis

Apache Cassandra Data Model(CQL) – Schema and Database Design

Analytics Vidhya

SEPTEMBER 11, 2021

Manipulation of data in this manner was inconvenient and caused knowing the API’s intricacies. Although the Cassandra query language is like SQL, its data modeling approaches are entirely […]. The post Apache Cassandra Data Model(CQL) – Schema and Database Design appeared first on Analytics Vidhya.

Data Models

Data Models Data Modeling Database SQL

Understand the ACID and BASE in Morden Data Engineering

Analytics Vidhya

DECEMBER 12, 2022

Introduction Dear Data Engineers, this article is a very interesting topic. Let me give some flashback; a few years ago, Mr.Someone in the discussion coined the new word how ACID and BASE properties of DATA. The post Understand the ACID and BASE in Morden Data Engineering appeared first on Analytics Vidhya.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

An Introduction to MongoDB

Analytics Vidhya

NOVEMBER 1, 2022

This article was published as a part of the Data Science Blogathon. Introduction When we hear the word “DATABASE”, the first thought that comes to our mind is SQL! No doubt, SQL and relational databases are widely popular and used extensively for storing data.

SQL

SQL Database Data Science Analytics

Database Normalization- A Step-by-Step Guide with Examples

Analytics Vidhya

AUGUST 16, 2022

This article was published as a part of the Data Science Blogathon. Introduction As an SQL Developer, you regularly work with enormous amounts of data stored in different tables that are present inside databases. The post Database Normalization- A Step-by-Step Guide with Examples appeared first on Analytics Vidhya.

Database

Database SQL Data Science Analytics

Most Essential 2023 Interview Questions on Data Engineering

Analytics Vidhya

FEBRUARY 7, 2023

Introduction Data engineering is the field of study that deals with the design, construction, deployment, and maintenance of data processing systems. The goal of this domain is to collect, store, and process data efficiently and efficiently so that it can be used to support business decisions and power data-driven applications.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

Understanding the Basics of Database Normalization

Analytics Vidhya

MARCH 2, 2023

Introduction Data normalization is the process of building a database according to what is known as a canonical form, where the final product is a relational database with no data redundancy. More specifically, normalization involves organizing data according to attributes assigned as part of a larger data model.

Database

Database Data Modeling Data Models Analytics

What is relational about Relational Databases?

Analytics Vidhya

AUGUST 14, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon Pretty much everything or all sorts of information available online is. The post What is relational about Relational Databases? appeared first on Analytics Vidhya.

Database

Database Data Science Analytics Analytics

Remote Data Science Jobs: 5 High-Demand Roles for Career Growth

Data Science Dojo

OCTOBER 31, 2024

Top Employers Microsoft, Facebook, and consulting firms like Accenture are actively hiring in this field of remote data science jobs, with salaries generally ranging from $95,000 to $140,000. Their role is crucial in understanding the underlying data structures and how to leverage them for insights.

Data Science

Data Science Data Scientist Machine Learning Machine Learning

Introduction to Partitioned hive table and PySpark

Analytics Vidhya

OCTOBER 28, 2021

The official description of Hive is- ‘Apache Hive data warehouse software project built on top of Apache Hadoop for providing data query and analysis. Hive gives an SQL-like interface to query data stored in various databases and […].

Apache Hadoop

Apache Hadoop Data Warehouse Hadoop SQL

How To Create An Aggregation Pipeline In MongoDB

Analytics Vidhya

APRIL 12, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon. Introduction MongoDB is a free open-source No-SQL document database. The post How To Create An Aggregation Pipeline In MongoDB appeared first on Analytics Vidhya.

SQL

SQL Data Science Database Analytics

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Data Science Blog

MAY 20, 2024

Continuous Integration and Continuous Delivery (CI/CD) for Data Pipelines: It is a Game-Changer with AnalyticsCreator! The need for efficient and reliable data pipelines is paramount in data science and data engineering. It offers full BI-Stack Automation, from source to data warehouse through to frontend.

Data Pipeline

Data Pipeline Data Warehouse Azure Data Lakes

Navigate your way to success – Top 10 data science careers to pursue in 2023

Data Science Dojo

MAY 10, 2023

Data Engineer Data engineers are responsible for building, maintaining, and optimizing data infrastructures. They require strong programming skills, expertise in data processing, and knowledge of database management.

Data Science

Data Science Data Scientist Database Administration Machine Learning

Big data engineering simplified: Exploring roles of distributed systems

Data Science Dojo

JULY 24, 2023

They allow data processing tasks to be distributed across multiple machines, enabling parallel processing and scalability. Its characteristics can be summarized as follows: Volume : Big Data involves datasets that are too large to be processed by traditional database management systems. databases), semi-structured data (e.g.,

Big Data

Big Data Big Data Data Engineer Data Engineering

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

Data Science Connect

JANUARY 27, 2023

Data engineering is a crucial field that plays a vital role in the data pipeline of any organization. It is the process of collecting, storing, managing, and analyzing large amounts of data, and data engineers are responsible for designing and implementing the systems and infrastructure that make this possible.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

AWS Machine Learning Blog

NOVEMBER 20, 2024

In today’s data-intensive business landscape, organizations face the challenge of extracting valuable insights from diverse data sources scattered across their infrastructure. The solution combines data from an Amazon Aurora MySQL-Compatible Edition database and data stored in an Amazon Simple Storage Service (Amazon S3) bucket.

Database

Database AWS SQL ETL

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

Data Science Blog

SEPTEMBER 19, 2023

So why using IaC for Cloud Data Infrastructures? For Data Warehouse Systems that often require powerful (and expensive) computing resources, this level of control can translate into significant cost savings. The following Terraform script will create an Azure Resource Group, a SQL Server, and a SQL Database.

Data Warehouse

Data Warehouse Azure SQL Database

How to Get Started as a Data Engineer

Smart Data Collective

OCTOBER 11, 2021

If you enjoy working with data, or if you’re just interested in a career with a lot of potential upward trajectory, you might consider a career as a data engineer. But what exactly does a data engineer do, and how can you begin your career in this niche? What Is a Data Engineer?

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

Exploring the fundamentals of online transaction processing databases

Dataconomy

APRIL 27, 2023

What is an online transaction processing database (OLTP)? OLTP is the backbone of modern data processing, a critical component in managing large volumes of transactions quickly and efficiently. This approach allows businesses to efficiently manage large amounts of data and leverage it to their advantage in a highly competitive market.

Database

Database Data Scientist Data Mining Data Mining

Imperva optimizes SQL generation from natural language using Amazon Bedrock

AWS Machine Learning Blog

JUNE 20, 2024

The data is stored in a data lake and retrieved by SQL using Amazon Athena. The following figure shows a search query that was translated to SQL and run. The problem Making data accessible to users through applications has always been a challenge. Constructing SQL queries from natural language isn’t a simple task.

SQL

SQL Database AWS Machine Learning

MongoDB Replication and Sharding- A Complete Introduction

Analytics Vidhya

DECEMBER 26, 2022

This article was published as a part of the Data Science Blogathon. Introduction A NoSQL database is a non-relational database that does not use the traditional table-based schema of a relational database. NoSQL databases are often used for big data and real-time web applications.

Database

Database Big Data Big Data Data Science

How Twilio generated SQL using Looker Modeling Language data with Amazon Bedrock

AWS Machine Learning Blog

AUGUST 8, 2024

Managing and retrieving the right information can be complex, especially for data analysts working with large data lakes and complex SQL queries. This tool converts questions from data analysts asked in natural language (such as “Which table contains customer address information?”)

SQL

SQL Data Lakes Data Analyst AWS

A Deep Dive into Data Replication: Most Effective Way to Protect Your Data

Analytics Vidhya

FEBRUARY 22, 2023

Introduction Data replication is also known as database replication, which is copying data to ensure that all information remains consistent across all data resources in real-time. data replication is like a safety net that keeps your information safe from disappearing or falling through the cracks.

Database

Database Analytics Analytics SQL

Python and MySQL: A Practical Introduction for Data Analysis

Analytics Vidhya

AUGUST 25, 2021

This article was published as a part of the Data Science Blogathon Introduction Let’s look at a practical example of how to make SQL queries to a MySQL server from Python code: CREATE, SELECT, UPDATE, JOIN, etc. Most applications interact with data in some form. Therefore, programming languages ??(Python

Data Analysis

Data Analysis Data Analysis Python SQL

How to Develop Serverless Code Using Azure Functions?

Analytics Vidhya

JANUARY 30, 2023

Whether we are analyzing IoT data streams, managing scheduled events, processing document uploads, responding to database changes, etc. Azure functions allow developers […] The post How to Develop Serverless Code Using Azure Functions? appeared first on Analytics Vidhya.

Azure

Azure Database Analytics Analytics

Understanding the need for DBMS

Analytics Vidhya

AUGUST 20, 2022

This article was published as a part of the Data Science Blogathon. Introduction A Database is a collection of inter-related data, and a Database Management System is a set of programs that helps users create and maintain this data. DBMS is a computer-based data record-keeping system.

Database

Database Data Science Analytics Analytics

A Beginner’s Guide to MySQL: Part 2

Analytics Vidhya

JULY 10, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon Pre-requisites – Basic knowledge of any database. – Basic understanding of. The post A Beginner’s Guide to MySQL: Part 2 appeared first on Analytics Vidhya.

Database

Database Data Science Analytics Analytics

SQL vs NoSQL Databases – A Key Concept Every Data Engineer Should Know

Introduction to SQL for Data Engineering

Webinars

Trending Sources

How to Normalize Relational Databases With SQL Code?

Webinars

SQL Injection: The Cyber Attack Hiding in Your Database

Hands-On Tutorial to Analyze Data using Spark SQL

SQL and PL/SQL – An Unmissable Comparison

A brief introduction to SQL Alchemy

How to screw SQL to anything with Apache Calcite

A Beginner’s Guide to ClickHouse Database

Top 5 SQL Interview Questions

Interacting with Remote Databases – PostgreSQL and DBAPIs

MSSQL vs MySQL: Comparing Powerhouses of Databases

Step-by-Step Roadmap to Learn SQL in 2023

How is AWS Athena Different from other Databases

Top 5 SQL Interview Questions With Implementation

Getting Started with Graph Database Queries, with Cheat Sheet!

Apache Cassandra Data Model(CQL) – Schema and Database Design

Understand the ACID and BASE in Morden Data Engineering

An Introduction to MongoDB

Database Normalization- A Step-by-Step Guide with Examples

Most Essential 2023 Interview Questions on Data Engineering

Understanding the Basics of Database Normalization

What is relational about Relational Databases?

Remote Data Science Jobs: 5 High-Demand Roles for Career Growth

Introduction to Partitioned hive table and PySpark

How To Create An Aggregation Pipeline In MongoDB

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Top 8 Interview Questions on Apache Sqoop

Navigate your way to success – Top 10 data science careers to pursue in 2023

Big data engineering simplified: Exploring roles of distributed systems

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

Unify structured data in Amazon Aurora and unstructured data in Amazon S3 for insights using Amazon Q

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

How to Get Started as a Data Engineer

Exploring the fundamentals of online transaction processing databases

Top 6 Amazon Athena Interview Questions

Imperva optimizes SQL generation from natural language using Amazon Bedrock

MongoDB Replication and Sharding- A Complete Introduction

How Twilio generated SQL using Looker Modeling Language data with Amazon Bedrock

A Deep Dive into Data Replication: Most Effective Way to Protect Your Data

Python and MySQL: A Practical Introduction for Data Analysis

How to Develop Serverless Code Using Azure Functions?

Understanding the need for DBMS

A Beginner’s Guide to MySQL: Part 2

Stay Connected