Hadoop in Practice, 2nd Edition

Hadoop in Practice, 2nd Edition

Publisher Manning
Author
Pages 512
Year 2014
Language English
ISBN 9781617292224
File size 9.9 MB
File format pdf
Download & Read more

Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating Kafka, Impala, and Spark SQL with Hadoop. You'll also get new and updated techniques for Flume, Sqoop...

Fast Data Processing with Spark

Fast Data Processing with Spark

Publisher Packt Publishing
Author
Pages 120
Year 2013
Language English
ISBN 9781782167068
File size 11.0 MB
File format pdf
Download & Read more

Spark is a framework for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and inbuilt tools for interactive query analysis (Shark), large-scale graph processing and analysis (Bagel), and real-time analysis (Spark Streaming), it can be interactivel...

Programming Elastic MapReduce

Programming Elastic MapReduce

Publisher O'Reilly Media
Author
Pages 174
Year 2013
Language English
ISBN 9781449363628
File size 19.2 MB
File format pdf
Download & Read more

Although you don't need a large computing infrastructure to process massive amounts of data with Apache Hadoop, it can still be difficult to get started. This practical guide shows you how to quickly launch data analysis projects in the cloud by using Amazon Elastic MapReduce (EMR), the hosted Hadoop framework in Amazon Web Services (AWS). Authors Kevin Schmidt and Christopher Phillips demo...

Fast Data Processing with Spark, 2nd Edition

Fast Data Processing with Spark, 2nd Edition

Publisher Packt Publishing
Author
Pages 184
Year 2015
Language English
ISBN 9781784392574
File size 14.2 MB
File format pdf
Download & Read more

Spark is a framework used for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does, but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and built-in tools for interactive query analysis (Spark SQL), large-scale graph processing and analysis (GraphX), and real-time analysis (Spark Streaming), it can be ...

Writing and Querying MapReduce Views in CouchDB

Writing and Querying MapReduce Views in CouchDB

Publisher O'Reilly Media
Author
Pages 76
Year 2011
Language English
ISBN 9781449303129
File size 1.9 MB
File format pdf
Download & Read more

Learn how to create MapReduce views in CouchDB that let you query the document-oriented database for meaningful data. With this short and concise ebook, you'll get step-by-step instructions and lots of sample code to create and explore several MapReduce views, using an example database you construct.

HBase in Action

HBase in Action

Publisher Manning
Author
Pages 360
Year 2012
Language English
ISBN 9781617290527
File size 9.8 MB
File format pdf
Download & Read more

HBase is a NoSQL storage system designed for fast, random access to large volumes of data. It runs on commodity hardware and scales smoothly from modest datasets to billions of rows and millions of columns. HBase in Action is an experience-driven guide that shows you how to design, build, and run applications using HBase. First, it introduces you to the fundamentals of handling big data. Th...

Seven Concurrency Models in Seven Weeks

Seven Concurrency Models in Seven Weeks

Publisher The Pragmatic Programmers
Author
Pages 300
Year 2014
Language English
ISBN 9781937785659
File size 4.4 MB
File format pdf
Download & Read more

Your software needs to leverage multiple cores, handle thousands of users and terabytes of data, and continue working in the face of both hardware and software failure. Concurrency and parallelism are the keys, and Seven Concurrency Models in Seven Weeks equips you for this new world. See how emerging technologies such as actors and functional programming address issues with traditional threads an...

MapReduce Design Patterns

MapReduce Design Patterns

Publisher O'Reilly Media
Author
Pages 252
Year 2012
Language English
ISBN 9781449327170
File size 9.5 MB
File format pdf
Download & Read more

Until now, design patterns for the MapReduce framework have been scattered among various research papers, blogs, and books. This handy guide brings together a unique collection of valuable MapReduce patterns that will save you time and effort regardless of the domain, language, or development framework you're using. Each pattern is explained in context, with pitfalls and caveats clearly ide...

Hadoop MapReduce Cookbook

Hadoop MapReduce Cookbook

Publisher Packt Publishing
Author
Pages 300
Year 2013
Language English
ISBN 9781849517287
File size 4.1 MB
File format pdf
Download & Read more

Learn to process large and complex data sets, starting simply, then diving in deep. Solve complex big data problems such as classifications, finding relationships, online marketing and recommendations. More than 50 Hadoop MapReduce recipes, presented in a simple and straightforward manner, with step-by-step instructions and real world examples.

Hadoop: The Definitive Guide, 3rd Edition

Hadoop: The Definitive Guide, 3rd Edition

Publisher O'Reilly Media
Author
Pages 630
Year 2012
Language English
ISBN 9781449311520
File size 9.1 MB
File format pdf
Download & Read more

With this digital Early Release edition of Hadoop: The Definitive Guide, you get the entire book bundle in its earliest form - the author's raw and unedited content - so you can take advantage of this content long before the book's official release. You'll also receive updates when significant changes are made. Ready to unleash the power of your massive dataset? With the latest edition of this com...