Data Engineering, Database and Python

Interacting with Remote Databases – PostgreSQL and DBAPIs

Analytics Vidhya

SEPTEMBER 22, 2022

Introduction When creating data pipelines, Software Engineers and Data Engineers frequently work with databases using Database Management Systems like PostgreSQL. The post Interacting with Remote Databases – PostgreSQL and DBAPIs appeared first on Analytics Vidhya.

Database

Database Data Pipeline Data Engineer Data Engineering

Introduction to Redis OM in Python

Analytics Vidhya

JANUARY 25, 2023

Introduction Redis OM is a widely used in-memory database deployed as a cache or database and message broker. It is well-suited for high-performance, real-time applications that need low-latency data access. Redis supports several data types, including strings, lists, sets, and hyperloglogs.

Python

Python Database Analytics Analytics

Introduction to Apache CouchDB using Python

Analytics Vidhya

JULY 23, 2022

Introduction Apache CouchDB is an open-source, document-based NoSQL database developed by Apache Software Foundation and used by big companies like Apple, GenCorp Technologies, and Wells Fargo. CouchDB is similar to MongoDB and uses JSON, also known as Javascript Object Notation, to store data, […].

Python

Python Database Data Science Analytics

Webinars

Mastering Apache Airflow® 3.0: What’s New (and What’s Next) for Data Orchestration

MORE WEBINARS

A Beginner’s Guide to ClickHouse Database

KDnuggets

SEPTEMBER 13, 2024

Learn how to install ClickHouse DBMS, create a database, and run SQL queries using native and Python clients.

Database

Database SQL Python Data Engineer

Web Scrapping- Tool for Data Engineering

Analytics Vidhya

SEPTEMBER 26, 2022

The post Web Scrapping- Tool for Data Engineering appeared first on Analytics Vidhya. The usefulness of the topic is one that easily helps other disciplines. Web content could be required in a way that makes it less effective to visit and use a website […].

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

Learning Database for Data Science Tutorial – Perform MongoDB Indexing using PyMongo

Analytics Vidhya

SEPTEMBER 15, 2020

Overview Indexing is MongoDB – a key aspect to managing and executing your database queries efficiently in data science Learn how indexing works in. The post Learning Database for Data Science Tutorial – Perform MongoDB Indexing using PyMongo appeared first on Analytics Vidhya.

Data Science

Data Science Database Analytics Analytics

Introduction to Google Firebase Cloud Storage using Python

Analytics Vidhya

JULY 16, 2022

It aims to replace conventional backend servers for web and mobile applications by offering multiple services on the same platform like authentication, real-time database, Firestore (NoSQL database), cloud functions, […]. The post Introduction to Google Firebase Cloud Storage using Python appeared first on Analytics Vidhya.

Python

Python Database Data Science Analytics

One-stop-shop for Connecting Snowflake to Python!

Analytics Vidhya

MAY 25, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon In this article, we will learn to connect the Snowflake database. The post One-stop-shop for Connecting Snowflake to Python! appeared first on Analytics Vidhya.

Python

Python Data Science Database Analytics

What are Data Access Object and Data Transfer Object in Python?

Analytics Vidhya

FEBRUARY 6, 2023

Especially while working with databases, it is often considered a good practice to follow a design pattern. This ensures easy […] The post What are Data Access Object and Data Transfer Object in Python? The pattern is not an actual code but a template that can be used to solve problems in different situations.

Python

Python Database Analytics Analytics

SQL Injection: The Cyber Attack Hiding in Your Database

Analytics Vidhya

FEBRUARY 2, 2023

Introduction SQL injection is an attack in which a malicious user can insert arbitrary SQL code into a web application’s query, allowing them to gain unauthorized access to a database. We can use this to steal sensitive information or make unauthorized changes to the data stored in the database.

SQL

SQL Database Analytics Analytics

Apache Cassandra Data Model(CQL) – Schema and Database Design

Analytics Vidhya

SEPTEMBER 11, 2021

Manipulation of data in this manner was inconvenient and caused knowing the API’s intricacies. Although the Cassandra query language is like SQL, its data modeling approaches are entirely […]. The post Apache Cassandra Data Model(CQL) – Schema and Database Design appeared first on Analytics Vidhya.

Data Modeling

Data Modeling Data Models Database SQL

Remote Data Science Jobs: 5 High-Demand Roles for Career Growth

Data Science Dojo

OCTOBER 31, 2024

Top Employers Microsoft, Facebook, and consulting firms like Accenture are actively hiring in this field of remote data science jobs, with salaries generally ranging from $95,000 to $140,000. Strong analytical skills and the ability to work with large datasets are critical, as is familiarity with data modeling and ETL processes.

Data Science

Data Science Data Scientist Machine Learning Machine Learning

Python and MySQL: A Practical Introduction for Data Analysis

Analytics Vidhya

AUGUST 25, 2021

This article was published as a part of the Data Science Blogathon Introduction Let’s look at a practical example of how to make SQL queries to a MySQL server from Python code: CREATE, SELECT, UPDATE, JOIN, etc. Most applications interact with data in some form. Python is no exception) provide tools for storing […].

Data Analysis

Data Analysis Data Analysis Python SQL

Big data engineering simplified: Exploring roles of distributed systems

Data Science Dojo

JULY 24, 2023

They allow data processing tasks to be distributed across multiple machines, enabling parallel processing and scalability. Its characteristics can be summarized as follows: Volume : Big Data involves datasets that are too large to be processed by traditional database management systems. databases), semi-structured data (e.g.,

Big Data

Big Data Big Data Data Engineering Data Engineering

How to connect MongoDB database with Django

Analytics Vidhya

JUNE 16, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon Introduction Let’s consider a scenario where you are working on a. The post How to connect MongoDB database with Django appeared first on Analytics Vidhya.

Database

Database Data Science Analytics Analytics

Getting Started with MongoDB database for Data Science

Analytics Vidhya

APRIL 26, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon. Data Science without data is similar to fishing without fish. The post Getting Started with MongoDB database for Data Science appeared first on Analytics Vidhya.

Data Science

Data Science Database Analytics Analytics

Building a Formula 1 Streaming Data Pipeline With Kafka and Risingwave

KDnuggets

SEPTEMBER 5, 2023

Build a streaming data pipeline using Formula 1 data, Python, Kafka, RisingWave as the streaming database, and visualize all the real-time data in Grafana.

Data Pipeline

Data Pipeline Database Python Data Engineer

Navigate your way to success – Top 10 data science careers to pursue in 2023

Data Science Dojo

MAY 10, 2023

Data Engineer Data engineers are responsible for building, maintaining, and optimizing data infrastructures. They require strong programming skills, expertise in data processing, and knowledge of database management.

Data Science

Data Science Data Scientist Database Administration Machine Learning

Redis Interview Questions: Preparing You for Your First Job

Analytics Vidhya

FEBRUARY 2, 2023

Introduction Year after year, the intake for either freshers or experienced in the fields dealing with Data Science, AI/ML, and Data Engineering has been increasing rapidly. And one […] The post Redis Interview Questions: Preparing You for Your First Job appeared first on Analytics Vidhya.

ML

ML ML Database Data Engineer

Show HN: A local Python prototyping tool for Jupyter and Streamlit

Hacker News

OCTOBER 27, 2023

A Kurtosis package for Python data engineers, deploying a Jupyter notebook along with a configurable set of databases, and a visualization tool (Streamlit) - GitHub - galenmarchetti/jupyter-notebook-package: A Kurtosis package for Python data engineers, deploying a Jupyter notebook along with a configurable set of databases, and a visualization tool (..)

Python

Python Data Engineering Data Engineer Data Engineering

Hands-On Tutorial to Analyze Data using Spark SQL

Analytics Vidhya

FEBRUARY 5, 2020

Overview Relational databases are ubiquitous, but what happens when you need to scale your infrastructure? The post Hands-On Tutorial to Analyze Data using Spark SQL appeared first on Analytics Vidhya. We will discuss the role Spark SQL plays in.

SQL

SQL Database Analytics Analytics

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

Data Science Connect

JANUARY 27, 2023

Data engineering is a crucial field that plays a vital role in the data pipeline of any organization. It is the process of collecting, storing, managing, and analyzing large amounts of data, and data engineers are responsible for designing and implementing the systems and infrastructure that make this possible.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

How to Get Started as a Data Engineer

Smart Data Collective

OCTOBER 11, 2021

If you enjoy working with data, or if you’re just interested in a career with a lot of potential upward trajectory, you might consider a career as a data engineer. But what exactly does a data engineer do, and how can you begin your career in this niche? What Is a Data Engineer?

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

Data Science Blog

SEPTEMBER 19, 2023

So why using IaC for Cloud Data Infrastructures? For Data Warehouse Systems that often require powerful (and expensive) computing resources, this level of control can translate into significant cost savings. using for loops in Python). IaC allows these teams to collaborate more effectively.

Data Warehouse

Data Warehouse Azure SQL Database

MongoDB Replication and Sharding- A Complete Introduction

Analytics Vidhya

DECEMBER 26, 2022

This article was published as a part of the Data Science Blogathon. Introduction A NoSQL database is a non-relational database that does not use the traditional table-based schema of a relational database. NoSQL databases are often used for big data and real-time web applications.

Database

Database Big Data Big Data Data Science

Navigating the World of Data Engineering: A Beginners Guide.

Towards AI

MARCH 21, 2023

Navigating the World of Data Engineering: A Beginner’s Guide. A GLIMPSE OF DATA ENGINEERING ❤ IMAGE SOURCE: BY AUTHOR Data or data? No matter how you read or pronounce it, data always tells you a story directly or indirectly. Data engineering can be interpreted as learning the moral of the story.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

How to Develop Serverless Code Using Azure Functions?

Analytics Vidhya

JANUARY 30, 2023

Whether we are analyzing IoT data streams, managing scheduled events, processing document uploads, responding to database changes, etc. Azure functions allow developers […] The post How to Develop Serverless Code Using Azure Functions? appeared first on Analytics Vidhya.

Azure

Azure Database Analytics Analytics

A Brief Introduction to Apache HBase and it’s Architecture

Analytics Vidhya

OCTOBER 12, 2022

This article was published as a part of the Data Science Blogathon. Introduction Since the 1970s, relational database management systems have solved the problems of storing and maintaining large volumes of structured data.

Hadoop

Hadoop Big Data Big Data Data Science

Introduction to Partitioned hive table and PySpark

Analytics Vidhya

OCTOBER 28, 2021

The official description of Hive is- ‘Apache Hive data warehouse software project built on top of Apache Hadoop for providing data query and analysis. Hive gives an SQL-like interface to query data stored in various databases and […].

Apache Hadoop

Apache Hadoop Data Warehouse Hadoop SQL

How To Create An Aggregation Pipeline In MongoDB

Analytics Vidhya

APRIL 12, 2021

ArticleVideo Book This article was published as a part of the Data Science Blogathon. Introduction MongoDB is a free open-source No-SQL document database. The post How To Create An Aggregation Pipeline In MongoDB appeared first on Analytics Vidhya.

SQL

SQL Data Science Database Analytics

Introduction to Apache Spark and its Datasets

Analytics Vidhya

AUGUST 17, 2022

This article was published as a part of the Data Science Blogathon. Introduction In this article, we will introduce you to the big data ecosystem and the role of Apache Spark in Big data. We will also cover the Distributed database system, the backbone of big data. In today’s world, data is the fuel.

Big Data

Big Data Big Data Data Science Database

Understanding Dask in Depth

Analytics Vidhya

FEBRUARY 5, 2023

Introduction Many different datasets are available for data scientists, machine learning engineers, and data engineers. Finding the best tools to evaluate each dataset […] The post Understanding Dask in Depth appeared first on Analytics Vidhya.

Data Scientist

Data Scientist Machine Learning Machine Learning Data Engineer

Data Science Career Paths: Analyst, Scientist, Engineer – What’s Right for You?

How to Learn Machine Learning

APRIL 26, 2025

The field of data science is now one of the most preferred and lucrative career options available in the area of data because of the increasing dependence on data for decision-making in businesses, which makes the demand for data science hires peak. Data Sources and Collection Everything in data science begins with data.

Data Science

Data Science Data Analyst Data Scientist Machine Learning

What Does a Data Engineer’s Career Path Look Like?

Smart Data Collective

NOVEMBER 8, 2020

Forging a Career Path in the Field of Data Science. With advancing technology, the data science space is rapidly evolving. Unlike the old days where data was readily stored and available from a single database and data scientists only needed to learn a few programming languages, data has grown with technology.

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

Future of Data and AI – March 2023 Edition

Data Science Dojo

MAY 18, 2023

Introduction to Python for Data Science: This lecture introduces the tools and libraries used in Python for data science and engineering. It covers basic concepts such as data processing, feature engineering, data visualization, modeling, and model evaluation. Want to dive deep into Python?

Data Science

Data Science AI AI SQL

Azure Data Engineer Jobs

Pickl AI

APRIL 6, 2023

Accordingly, one of the most demanding roles is that of Azure Data Engineer Jobs that you might be interested in. The following blog will help you know about the Azure Data Engineering Job Description, salary, and certification course. How to Become an Azure Data Engineer?

Azure

Azure Data Engineering Data Engineering Data Engineering

Discover the Most Important Fundamentals of Data Engineering

Pickl AI

NOVEMBER 4, 2024

Summary: The fundamentals of Data Engineering encompass essential practices like data modelling, warehousing, pipelines, and integration. Understanding these concepts enables professionals to build robust systems that facilitate effective data management and insightful analysis. What is Data Engineering?

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

10 Best Data Engineering Books [Beginners to Advanced]

Pickl AI

AUGUST 1, 2023

Aspiring and experienced Data Engineers alike can benefit from a curated list of books covering essential concepts and practical techniques. These 10 Best Data Engineering Books for beginners encompass a range of topics, from foundational principles to advanced data processing methods. What is Data Engineering?

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Pickl AI

JULY 25, 2023

Unfolding the difference between data engineer, data scientist, and data analyst. Data engineers are essential professionals responsible for designing, constructing, and maintaining an organization’s data infrastructure. Read more to know.

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

An integrated experience for all your data and AI with Amazon SageMaker Unified Studio (preview)

Flipboard

DECEMBER 11, 2024

Organizations are building data-driven applications to guide business decisions, improve agility, and drive innovation. Many of these applications are complex to build because they require collaboration across teams and the integration of data, tools, and services. Choose the plus sign and for Notebook , choose Python 3.

SQL

SQL AWS Data Lakes AI

A Guide to Choose the Best Data Science Bootcamp

Data Science Dojo

JULY 3, 2024

Data science bootcamps are intensive short-term educational programs designed to equip individuals with the skills needed to enter or advance in the field of data science. They cover a wide range of topics, ranging from Python, R, and statistics to machine learning and data visualization.

Data Science

Data Science Machine Learning Machine Learning Data Visualization

Exploring Open-Source Innovations: 13 Companies Offering Cutting-Edge Solutions

ODSC - Open Data Science

MARCH 21, 2025

PlotlyInteractive Data Visualization Plotly is a leader in interactive data visualization tools, offering open-source graphing libraries in Python, R, JavaScript, and more. Their solutions, including Dash, make it easier for developers and data scientists to build analytical web applications with minimalcoding.

Data Scientist

Data Scientist Data Visualization Data Science Data Lakes

What Does a Data Engineering Job Involve in 2024?

ODSC - Open Data Science

JANUARY 30, 2024

Data engineering is a hot topic in the AI industry right now. And as data’s complexity and volume grow, its importance across industries will only become more noticeable. But what exactly do data engineers do? So let’s do a quick overview of the job of data engineer, and maybe you might find a new interest.

Data Engineering

Data Engineering Data Engineering Data Engineer Data Engineering

The Full Stack Data Scientist Part 6: Automation with Airflow

Applied Data Science

MAY 6, 2021

To keep myself sane, I use Airflow to automate tasks with simple, reusable pieces of code for frequently repeated elements of projects, for example: Web scraping ETL Database management Feature building and data validation And much more! Note that we can use the core python package datetime to help us define our DAGs.

Data Scientist

Data Scientist Python Data Science Database

Interacting with Remote Databases – PostgreSQL and DBAPIs

Introduction to Redis OM in Python

Webinars

Trending Sources

Introduction to Apache CouchDB using Python

Webinars

A Beginner’s Guide to ClickHouse Database

Web Scrapping- Tool for Data Engineering

Learning Database for Data Science Tutorial – Perform MongoDB Indexing using PyMongo

Introduction to Google Firebase Cloud Storage using Python

One-stop-shop for Connecting Snowflake to Python!

What are Data Access Object and Data Transfer Object in Python?

SQL Injection: The Cyber Attack Hiding in Your Database

Apache Cassandra Data Model(CQL) – Schema and Database Design

Remote Data Science Jobs: 5 High-Demand Roles for Career Growth

Python and MySQL: A Practical Introduction for Data Analysis

Big data engineering simplified: Exploring roles of distributed systems

How to connect MongoDB database with Django

Getting Started with MongoDB database for Data Science

Building a Formula 1 Streaming Data Pipeline With Kafka and Risingwave

Navigate your way to success – Top 10 data science careers to pursue in 2023

Redis Interview Questions: Preparing You for Your First Job

Show HN: A local Python prototyping tool for Jupyter and Streamlit

Hands-On Tutorial to Analyze Data using Spark SQL

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

How to Get Started as a Data Engineer

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

MongoDB Replication and Sharding- A Complete Introduction

Navigating the World of Data Engineering: A Beginners Guide.

How to Develop Serverless Code Using Azure Functions?

A Brief Introduction to Apache HBase and it’s Architecture

Introduction to Partitioned hive table and PySpark

How To Create An Aggregation Pipeline In MongoDB

Introduction to Apache Spark and its Datasets

Understanding Dask in Depth

Data Science Career Paths: Analyst, Scientist, Engineer – What’s Right for You?

What Does a Data Engineer’s Career Path Look Like?

Future of Data and AI – March 2023 Edition

Azure Data Engineer Jobs

Discover the Most Important Fundamentals of Data Engineering

10 Best Data Engineering Books [Beginners to Advanced]

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

An integrated experience for all your data and AI with Amazon SageMaker Unified Studio (preview)

A Guide to Choose the Best Data Science Bootcamp

Exploring Open-Source Innovations: 13 Companies Offering Cutting-Edge Solutions

What Does a Data Engineering Job Involve in 2024?

The Full Stack Data Scientist Part 6: Automation with Airflow

Stay Connected