Spark

Name: Spark
Brand: John Wiley & Sons Inc
SKU: 9781119254010
Price: 52.99 EUR
Availability: InStock

Ilya Ganelin | Ema Orhian | Kai Sasaki | Brennon York

€52.99

Product variants

Quantity:

In stock with our UK publisher. 14-28 days 14 days return policy

4.8/5

Judge.me

611 verified reviews

100% verified

In stock with our UK publisher. 14-28 days

Delivery/Collection within 10-20 working days

14 days return policy Shipping & Delivery

A01=Brennon York

A01=Ema Orhian

A01=Ilya Ganelin

A01=Kai Sasaki

Apache Software

Atiego

Author_Brennon York

Author_Ema Orhian

Author_Ilya Ganelin

Author_Kai Sasaki

big data in production

big data tools

Brennon York

Capital One

Category=UTR

Cloudera

Concur

Ema Orhian

eq_bestseller

eq_computing

eq_isMigrated=1

eq_isMigrated=2

eq_nobargain

eq_non-fiction

Ilya Ganelin

Kai Sasaki

live Spark in production

open-source big data framework

production cluster computing

Spark applications

Spark data warehousing

Spark database

Spark guide

Spark in production

Spark ML Lib

Spark on Mesos

Spark on Yarn

Spark production use cases

Spark security

Spark SQL

Spark with Tachyon

Spark: Big Data Cluster Computing in Production

Yahoo

Product details

ISBN 9781119254010
Weight: 372g
Dimensions: 188 x 236mm
Publication Date: 29 Apr 2016
Publisher: John Wiley & Sons Inc
Publication City/Country: US
Product Form: Paperback

Secure checkout

Fast Shipping

Easy returns

Production-targeted Spark guidance with real-world use cases

Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you through the challenges in moving from proof-of-concept or demo Spark applications to live Spark in production. Real use cases provide deep insight into common problems, limitations, challenges, and opportunities, while expert tips and tricks help you get the most out of Spark performance. Coverage includes Spark SQL, Tachyon, Kerberos, ML Lib, YARN, and Mesos, with clear, actionable guidance on resource scheduling, db connectors, streaming, security, and much more.

Spark has become the tool of choice for many Big Data problems, with more active contributors than any other Apache Software project. General introductory books abound, but this book is the first to provide deep insight and real-world advice on using Spark in production. Specific guidance, expert tips, and invaluable foresight make this guide an incredibly useful resource for real production settings.

Review Spark hardware requirements and estimate cluster size
Gain insight from real-world production use cases
Tighten security, schedule resources, and fine-tune performance
Overcome common problems encountered using Spark in production

Spark works with other big data tools including MapReduce and Hadoop, and uses languages you already know like Java, Scala, Python, and R. Lightning speed makes Spark too good to pass up, but understanding limitations and challenges in advance goes a long way toward easing actual production implementation. Spark: Big Data Cluster Computing in Production tells you everything you need to know, with real-world production insight and expert guidance, tips, and tricks.

Ilya Ganelin is a data engineer working at Capital One Data Innovation Lab. Ilya is an active contributor to the core components of Apache Spark and a committer to Apache Apex.

Ema Orhian is a Big Data Engineer interested in scaling algorithms. She is the main committer on jaws-spark-sql-rest, a data warehouse explorer on top of Spark SQL.

Kai Sasaki is a software engineer working in distributed computing and machine learning. He is a Spark contributor who develops mainly MLlib, ML libraries.

Brennon York has been a core contributor to Apache Spark since 2014 including development on GraphX and the core build environment.

Spark

Shipping & Delivery

Product details

More from this author