Mastering Spark for Data Science

Mastering Spark for Data Science
Author : Andrew Morgan
Publisher :
Total Pages : 541
Release : 2017-01-31
ISBN 10 : 1785882147
ISBN 13 : 9781785882142
Language : EN, FR, DE, ES & NL

Mastering Spark for Data Science Book Description:

Unlock the complexities of lightning fast data scienceAbout This Book*Develop and apply advanced analytical techniques with Spark*Learn how to tell a compelling story in data science using Spark's ecosystem*Explore data at a scale and work with cutting edge data science methodsWho This Book Is ForThis book is for those who have beginner-level familiarity with the Spark architecture and data science applications, who are looking for a challenge and want to learn cutting edge techniques. This book assumes working knowledge of data science, common machine learning methods, and popular data science tools, and assumes you have previously run proof of concept studies and built prototypes.What You Will Learn*Learn the design patterns that integrate Spark into with industrialized data science pipelines*Understand how commercial data scientists design scalable code and reusable code for data science services*Get a grasp of the new cutting edge data science methods so you can study trends and causality*Find out how to use Spark as a universal ingestion engine tool and as a web scraper*Practice the implementation of advanced topics in graph processing, such as community detection and contact chaining*Get to know the best practices when performing Extended Exploratory Data Analysis, commonly used in commercial data science teams*Grasp advanced Spark concepts, as well as solution design patterns and integration architectures*Demonstrate powerful data science pipelines*Get detailed guidance on how to run Spark in productionIn DetailThe purpose of data science is to transform the world using data, and this goal is mainly achieved through disrupting and changing real processes in real industries. To operate at this level, you need to be able to build data science solutions of substance; ones that solve real problems, and that can run reliably enough for people to trust and act on. Spark has emerged as the big data platform of choice for data scientists.This book deep dives into Spark to deliver production-grade data science solutions that are innovative, disruptive, and reliable enough to be trusted. We demonstrate the process through exploring the construction of a sophisticated global news analysis service that uses Spark to generate continuous geopolitical and current affairs insights. We use the core Spark APIs and take a deep-dive into advanced libraries including: Spark SQL, visual streaming, MLlib, and more.We introduce advanced techniques and methods to help you build data science solutions, and show you how to construct commercial grade data products. Using a sequence of tutorials that deliver a working news intelligence service, we explain advanced Spark architectures, unveil sophisticated data science methods, demonstrate how to work with geographic data in Spark, and explain how to tune Spark algorithms so they scale linearly.

Mastering Spark for Data Science
Language: en
Pages: 541
Authors: Andrew Morgan
Categories:
Type: BOOK - Published: 2017-01-31 - Publisher:

Unlock the complexities of lightning fast data scienceAbout This Book*Develop and apply advanced analytical techniques with Spark*Learn how to tell a compelling
Mastering Spark for Data Science
Language: en
Pages: 560
Authors: Andrew Morgan
Categories: Computers
Type: BOOK - Published: 2017-03-29 - Publisher: Packt Publishing Ltd

Master the techniques and sophisticated analytics used to construct Spark-based solutions that scale to deliver production-grade data science products About Thi
Mastering Spark with R
Language: en
Pages: 296
Authors: Javier Luraschi
Categories: Computers
Type: BOOK - Published: 2019-10-07 - Publisher: O'Reilly Media

If you’re like most R users, you have deep knowledge and love for statistics. But as your organization continues to collect huge amounts of data, adding tools
Mastering Spark with R
Language: en
Pages: 296
Authors: Javier Luraschi
Categories: Big data
Type: BOOK - Published: 2019-10-18 - Publisher:

"Authors Javier Luraschi, Kevin Kuo, and Edgar Ruiz show you how to combine R with Spark to analyze data at scale. This book covers relevant data science topics
Mastering Java for Data Science
Language: en
Pages: 364
Authors: Alexey Grigorev
Categories: Computers
Type: BOOK - Published: 2017-04-27 - Publisher: Packt Publishing Ltd

Use Java to create a diverse range of Data Science applications and bring Data Science into production About This Book An overview of modern Data Science and Ma
Mastering Python for Data Science
Language: en
Pages: 294
Authors: Samir Madhavan
Categories: Computers
Type: BOOK - Published: 2015-08-31 - Publisher: Packt Publishing Ltd

Explore the world of data science through Python and learn how to make sense of data About This Book Master data science methods using Python and its libraries
Mastering Apache Spark 2.x
Language: en
Pages: 354
Authors: Romeo Kienzler
Categories: Computers
Type: BOOK - Published: 2017-07-26 - Publisher: Packt Publishing Ltd

Advanced analytics on your Big Data with latest Apache Spark 2.x About This Book An advanced guide with a combination of instructions and practical examples to
Mastering Machine Learning with Spark 2.x
Language: en
Pages: 340
Authors: Alex Tellez
Categories: Computers
Type: BOOK - Published: 2017-08-31 - Publisher: Packt Publishing Ltd

Unlock the complexities of machine learning algorithms in Spark to generate useful data insights through this data analysis tutorial About This Book Process and
Apache Spark 2: Data Processing and Real-Time Analytics
Language: en
Pages: 616
Authors: Romeo Kienzler
Categories: Computers
Type: BOOK - Published: 2018-12-21 - Publisher: Packt Publishing Ltd

Build efficient data flow and machine learning programs with this flexible, multi-functional open-source cluster-computing framework Key Features Master the art
Apache Spark 2
Language: en
Pages: 616
Authors: Romeo Kienzler
Categories: Computers
Type: BOOK - Published: 2018-12-18 - Publisher:

Build efficient data flow and machine learning programs with this flexible, multi-functional open-source cluster-computing framework Key Features Master the art
Back to Top