Gratis Versand ab € 16,99. Mehr Infos.
Bookbot

Guide to High Performance Distributed Computing

Case Studies with Hadoop, Scalding and Spark

Buchbewertung

Mehr zum Buch

This timely text/reference describes the development and implementation of large-scale distributed processing systems using open source tools and technologies. Comprehensive in scope, the book presents state-of-the-art material on building high performance distributed computing systems, providing practical guidance and best practices as well as describing theoretical software frameworks. Features: describes the fundamentals of building scalable software systems for large-scale data processing in the new paradigm of high performance distributed computing; presents an overview of the Hadoop ecosystem, followed by step-by-step instruction on its installation, programming and execution; Reviews the basics of Spark, including resilient distributed datasets, and examines Hadoop streaming and working with Scalding; Provides detailed case studies on approaches to clustering, data classification and regression analysis; Explains the process of creating a working recommender system using Scalding and Spark.

Buchkauf

Guide to High Performance Distributed Computing, M. Srinivasa Sarma

Sprache
Erscheinungsdatum
2015
product-detail.submit-box.info.binding
(Hardcover)
Wir benachrichtigen dich per E-Mail.

Lieferung

  • Gratis Versand ab 16,99 € in ganz Österreich! Mehr Infos.

Zahlungsmethoden

4,0
Sehr gut
1 Bewertung

Hier könnte deine Bewertung stehen.

Titel
Guide to High Performance Distributed Computing
Untertitel
Case Studies with Hadoop, Scalding and Spark
Sprache
Englisch
Verlag
Springer
Erscheinungsdatum
2015
Einband
Hardcover
Seitenzahl
321
ISBN10
3319134965
ISBN13
9783319134963
Reihe
Bewertung
4 von 5 Sternen
Beschreibung
This timely text/reference describes the development and implementation of large-scale distributed processing systems using open source tools and technologies. Comprehensive in scope, the book presents state-of-the-art material on building high performance distributed computing systems, providing practical guidance and best practices as well as describing theoretical software frameworks. Features: describes the fundamentals of building scalable software systems for large-scale data processing in the new paradigm of high performance distributed computing; presents an overview of the Hadoop ecosystem, followed by step-by-step instruction on its installation, programming and execution; Reviews the basics of Spark, including resilient distributed datasets, and examines Hadoop streaming and working with Scalding; Provides detailed case studies on approaches to clustering, data classification and regression analysis; Explains the process of creating a working recommender system using Scalding and Spark.