Learning Spark

Author: Holden Karau
Publisher: "O'Reilly Media, Inc."
ISBN: 144935906X
Size: 63.88 MB
Format: PDF
View: 4377
Download Read Online
This book introduces Apache Spark, the open source cluster computing system that makes data analytics fast to write and fast to run.

Learning Spark

Author: Holden Karau
Publisher: "O'Reilly Media, Inc."
ISBN: 1449359051
Size: 27.76 MB
Format: PDF, ePub
View: 4282
Download Read Online
This edition includes new information on Spark SQL, Spark Streaming, setup, and Maven coordinates. Written by the developers of Spark, this book will have data scientists and engineers up and running in no time.

Learning Spark

Author: Mark Hamstra
Publisher: O'Reilly Media
ISBN: 9781449358624
Size: 55.40 MB
Format: PDF, ePub, Mobi
View: 5605
Download Read Online
Subtitle on cover: Lightning-fast data analysis.

High Performance Spark

Author: Holden Karau
Publisher: "O'Reilly Media, Inc."
ISBN: 1491943173
Size: 14.92 MB
Format: PDF
View: 3327
Download Read Online
With this book, you’ll explore: How Spark SQL’s new interfaces improve performance over SQL’s RDD data structure The choice between data joins in Core Spark and Spark SQL Techniques for getting the most out of standard RDD ...

Big Data Analytics With Spark

Author: Mohammed Guller
Publisher: Apress
ISBN: 1484209648
Size: 42.26 MB
Format: PDF, Kindle
View: 1114
Download Read Online
So reading this book and absorbing its principles will provide a boost—possibly a big boost—to your career.

Advanced Analytics With Spark

Author: Sandy Ryza
Publisher: "O'Reilly Media, Inc."
ISBN: 1491912731
Size: 33.88 MB
Format: PDF
View: 3625
Download Read Online
In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark.

Machine Learning With Spark

Author: Nick Pentreath
Publisher: Packt Publishing Ltd
ISBN: 1783288523
Size: 16.39 MB
Format: PDF, ePub
View: 1997
Download Read Online
If you are a Scala, Java, or Python developer with an interest in machine learning and data analysis and are eager to learn how to apply common machine learning techniques at scale using the Spark framework, this is the book for you.

Programming Pig

Author: Alan Gates
Publisher: "O'Reilly Media, Inc."
ISBN: 1491937068
Size: 52.77 MB
Format: PDF, ePub
View: 3577
Download Read Online
You’ll find comprehensive coverage on key features such as the Pig Latin scripting language and the Grunt shell. When you need to analyze terabytes of data, this book shows you how to do it efficiently with Pig.

Big Data Smack

Author: Raul Estrada
Publisher: Apress
ISBN: 1484221753
Size: 72.57 MB
Format: PDF, ePub
View: 5942
Download Read Online
This book covers the five main concepts of data pipeline architecture and how to integrate, replace, and reinforce every layer: The engine: Apache Spark The container: Apache Mesos The model: Akka“li>The storage: Apache Cassandra The ...

Data Analytics With Hadoop

Author: Benjamin Bengfort
Publisher: "O'Reilly Media, Inc."
ISBN: 1491913762
Size: 20.54 MB
Format: PDF, ePub
View: 692
Download Read Online
Ready to use statistical and machine-learning techniques across large data sets? This practical guide shows you why the Hadoop ecosystem is perfect for the job.