Data Algorithms with Spark: Recipes and Design Patterns for Scaling Up using PySpark | Agenda Bookshop Skip to content
Selected Colleen Hoover Books at €9.99c | In-store & Online
Selected Colleen Hoover Books at €9.99c | In-store & Online
A01=Mahmoud Parsian
Age Group_Uncategorized
Age Group_Uncategorized
Author_Mahmoud Parsian
automatic-update
Category1=Non-Fiction
Category=UN
COP=United States
Delivery_Delivery within 10-20 working days
Language_English
PA=Available
Price_€50 to €100
PS=Active
softlaunch

Data Algorithms with Spark: Recipes and Design Patterns for Scaling Up using PySpark

English

By (author): Mahmoud Parsian

Apache Spark's speed, ease of use, sophisticated analytics, and multilanguage support makes practical knowledge of this cluster-computing framework a required skill for data engineers and data scientists. With this hands-on guide, anyone looking for an introduction to Spark will learn practical algorithms and examples using PySpark. In each chapter, author Mahmoud Parsian shows you how to solve a data problem with a set of Spark transformations and algorithms. You'll learn how to tackle problems involving ETL, design patterns, machine learning algorithms, data partitioning, and genomics analysis. Each detailed recipe includes PySpark algorithms using the PySpark driver and shell script. With this book, you will: Learn how to select Spark transformations for optimized solutions Explore powerful transformations and reductions including reduceByKey(), combineByKey(), and mapPartitions() Understand data partitioning for optimized queries Build and apply a model using PySpark design patterns Apply motif-finding algorithms to graph data Analyze graph data by using the GraphFrames API Apply PySpark algorithms to clinical and genomics data Learn how to use and apply feature engineering in ML algorithms Understand and use practical and pragmatic data design patterns See more
Current price €65.44
Original price €76.99
Save 15%
A01=Mahmoud ParsianAge Group_UncategorizedAuthor_Mahmoud Parsianautomatic-updateCategory1=Non-FictionCategory=UNCOP=United StatesDelivery_Delivery within 10-20 working daysLanguage_EnglishPA=AvailablePrice_€50 to €100PS=Activesoftlaunch
Delivery/Collection within 10-20 working days
Product Details
  • Dimensions: 178 x 232mm
  • Publication Date: 30 Apr 2022
  • Publisher: O'Reilly Media
  • Publication City/Country: United States
  • Language: English
  • ISBN13: 9781492082385

About Mahmoud Parsian

Mahmoud Parsian Ph.D. in Computer Science is a practicing software professional with 30 years of experience as a developer designer architect and author. For the past 15 years he has been involved in Java server-side databases MapReduce Spark PySpark and distributed computing.

Customer Reviews

No reviews yet
0%
(0)
0%
(0)
0%
(0)
0%
(0)
0%
(0)
We use cookies to ensure that we give you the best experience on our website. If you continue we'll assume that you are understand this. Learn more
Accept