Streaming Machine Learning Without a Data Lake
ODSC - Open Data Science
MAY 31, 2023
Commonly used technologies for data storage are the Hadoop Distributed File System (HDFS), Amazon S3, Google Cloud Storage (GCS), or Azure Blob Storage, as well as tools like Apache Hive, Apache Spark, and TensorFlow for data processing and analytics. Yes, many people still need a data lake (for their relevant data, not all enterprise data).
Let's personalize your content