Build Python-based Machine Learning and Deep Learning Models
Buch, Englisch, 210 Seiten, Format (B × H): 155 mm x 235 mm, Gewicht: 353 g
ISBN: 978-1-4842-4960-4
Verlag: Apress
You'll start by reviewing PySpark fundamentals, such as Spark’s core architecture, and see how to use PySpark for big data processing like data ingestion, cleaning, and transformations techniques. This is followed by building workflows for analyzing streaming data using PySpark and a comparison of various streaming platforms.
You'll then see how to schedule different spark jobs using Airflow with PySpark and book examine tuning machine and deep learning models for real-time predictions. This book concludes with a discussion on graph frames and performing network analysis using graph algorithms in PySpark. All the code presented in the book will be available in Python scripts on Github.What You'll Learn
- Develop pipelines for streaming data processing using PySpark
- Build Machine Learning & Deep Learning models using PySpark latest offerings
- Use graph analytics using PySpark
- Create Sequence Embeddings from Text data
Data Scientists, machine learning and deep learning engineers who want to learn and use PySpark for real time analysis on streaming data.
Zielgruppe
Professional/practitioner
Autoren/Hrsg.
Fachgebiete
- Mathematik | Informatik EDV | Informatik Betriebssysteme Linux Betriebssysteme, Open Source Betriebssysteme
- Mathematik | Informatik EDV | Informatik Daten / Datenbanken Big Data
- Mathematik | Informatik EDV | Informatik Informatik Künstliche Intelligenz Maschinelles Lernen
- Mathematik | Informatik EDV | Informatik Programmierung | Softwareentwicklung Programmier- und Skriptsprachen
Weitere Infos & Material
Chapter 1: Introduction to PySpark.- Chapter 2: Data Processing.- Chapter 3: Spark Structured Streaming.- Chapter 4: Airflow.- Chapter 5: Machine Learning Library (MLlib).- Chapter 6: Supervised Machine Learning.- Chapter 7: Unsupervised Machine Learning.- Chapter 8: Deep Learning Using PySpark.