Apache Spark 2 for Beginners by Rajanarayanan Thottuvaikkatumana

By Rajanarayanan Thottuvaikkatumana

Key Features

  • This ebook deals a simple advent to the Spark framework released at the most up-to-date model of Apache Spark 2
  • Perform effective information processing, computer studying and graph processing utilizing a variety of Spark components
  • A useful consultant geared toward newcomers to get them up and operating with Spark

Book Description

Spark is without doubt one of the so much widely-used large-scale information processing engines and runs super speedy. it's a framework that has instruments which are both helpful for software builders in addition to info scientists.

This e-book begins with the basics of Spark 2 and covers the middle facts processing framework and API, set up, and alertness improvement setup. Then the Spark programming version is brought via real-world examples by means of Spark SQL programming with DataFrames. An advent to SparkR is roofed subsequent. Later, we disguise the charting and plotting positive aspects of Python at the side of Spark info processing. After that, we have a look at Spark's circulate processing, computer studying, and graph processing libraries. The final bankruptcy combines the entire talents you discovered from the previous chapters to increase a real-world Spark application.

By the top of this ebook, you have all of the wisdom you want to advance effective large-scale purposes utilizing Apache Spark.

What you'll learn

  • Get to understand the basics of Spark 2 and the Spark programming version utilizing Scala and Python
  • Know how you can use Spark SQL and DataFrames utilizing Scala and Python
  • Get an advent to Spark programming utilizing R
  • Perform Spark information processing, charting, and plotting utilizing Python
  • Get accustomed to Spark move processing utilizing Scala and Python
  • Be brought to computer studying utilizing Spark MLlib
  • Get all started with graph processing utilizing the Spark GraphX
  • Bring jointly all that you've got discovered and advance a whole Spark application

About the Author

Rajanarayanan Thottuvaikkatumana, Raj, is a professional technologist with greater than 23 years of software program improvement event at a variety of multinational businesses. He has lived and labored in India, Singapore, and the united states, and is shortly dependent out of the united kingdom. His adventure contains architecting, designing, and constructing software program functions. He has labored on quite a few applied sciences together with significant databases, software improvement structures, net applied sciences, and massive info applied sciences. on account that 2000, he has been operating frequently in Java comparable applied sciences, and does heavy-duty server-side programming in Java and Scala. He has labored on very hugely concurrent, hugely allotted, and excessive transaction quantity platforms. at present he's development a subsequent iteration Hadoop YARN-based info processing platform and an program suite outfitted with Spark utilizing Scala.

Raj holds one master's measure in arithmetic, one master's measure in laptop info structures and has many certifications in ITIL and cloud computing to his credits. Raj is the writer of Cassandra layout styles - moment version, released through Packt.

When now not engaged on the assignments his day activity calls for, Raj is an avid listener to classical track and watches loads of tennis.

Table of Contents

  1. Spark Fundamentals
  2. Spark Programming Model
  3. Spark SQL
  4. Spark Programming with R
  5. Spark facts research with Python
  6. Spark move Processing
  7. Spark computer Learning
  8. Spark Graph Processing
  9. Designing Spark Applications

Show description

Read Online or Download Apache Spark 2 for Beginners PDF

Similar programming algorithms books

GPU Solutions to Multi-scale Problems in Science and Engineering (Lecture Notes in Earth System Sciences)

This publication covers the recent subject of GPU computing with many purposes concerned, taken from assorted fields resembling networking, seismology, fluid mechanics, nano-materials, data-mining , earthquakes ,mantle convection, visualization. it is going to exhibit the general public why GPU computing is critical and straightforward to take advantage of.

Modeling and Optimization of Air Traffic

This ebook combines the study actions of the authors, either one of whom are researchers at Ecole Nationale de l’Aviation Civile (French nationwide college of Civil Aviation), and provides their findings from the final 15 years. Their paintings makes use of air delivery as its point of interest, in the realm of mathematical optimization, taking a look at actual lifestyles difficulties and theoretical types in tandem, and the demanding situations that accompany learning either ways.

Integer Algorithms in Cryptology and Information Assurance

Integer Algorithms in Cryptology and data insurance is a set of the author's personal cutting edge methods in algorithms and protocols for mystery and trustworthy conversation. It concentrates at the “what” and “how” in the back of enforcing the proposed cryptographic algorithms instead of on formal proofs of “why” those algorithms paintings.

Understanding and Using Rough Set Based Feature Selection: Concepts, Techniques and Applications

The booklet will provide:1) intensive clarification of tough set conception besides examples of the ideas. 2) certain dialogue on thought of function choice. three) information of assorted consultant and state-of-the-art characteristic choice options in addition to algorithmic motives. four) serious evaluation of state-of-the-art tough set established characteristic choice tools protecting power and weaknesses of every.

Additional resources for Apache Spark 2 for Beginners

Example text

Download PDF sample

Rated 4.71 of 5 – based on 26 votes