Data Engineering for Machine Learning Pipelines

book24h

Power User
LV
5
 
Csatlakozás
2024.09.10.
Üzenetek
20,532
Reakció pontszám
2
Díjak
5
Kor
37
ff23817521828e89f6377297bd26a209.webp

Free Download Data Engineering for Machine Learning Pipelines: From Python Libraries to ML Pipelines and Cloud Platforms
English | 2024 | ASIN : B0D8XR4NJJ | 631 Pages | PDF | 33 MB
The book begins by explaining data analytics and transformation, delving into the Pandas library, its capabilities, and nuances. It then explores emerging libraries such as Polars and CuDF, providing insights into GPU-based computing and cutting-edge data manipulation techniques. The text discusses the importance of data validation in engineering processes, introducing tools such as Great Expectations and Pandera to ensure data quality and reliability. The book delves into API design and development, with a specific focus on leveraging the power of FastAPI. It covers authentication, authorization, and real-world applications, enabling you to construct efficient and secure APIs using FastAPI. Also explored is concurrency in data engineering, examining Dask's capabilities from basic setup to crafting advanced machine learning pipelines. The book includes development and delivery of data engineering pipelines using leading cloud platforms such as AWS, Google Cloud, and Microsoft Azure. The concluding chapters concentrate on real-time and streaming data engineering pipelines, emphasizing Apache Kafka and workflow orchestration in data engineering. Workflow tools such as Airflow and Prefect are introduced to seamlessly manage and automate complex data workflows.


Buy Premium From My Links To Get Resumable Support,Max Speed & Support Me
Code:
            
                
                
                    
                   
                    A kód megtekintéséhez jelentkezz be.
					Please log in to view the code.
                
            
        
Links are Interchangeable - Single Extraction
 
Top Alul