Hadoop in Practice, 2nd Edition

Hadoop in Practice, 2nd Edition

ناشر Manning
نویسنده
تعداد صفحه 512
سال 2014
زبان انگلیسی
شابک 9781617292224
حجم فایل 9.9 MB
نوع فایل pdf
دانلود   و   ادامه مطلب

Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating Kafka, Impala, and Spark SQL with Hadoop. You'll also get new and updated techniques for Flume, Sqoop...

Fast Data Processing with Spark

Fast Data Processing with Spark

ناشر Packt Publishing
نویسنده
تعداد صفحه 120
سال 2013
زبان انگلیسی
شابک 9781782167068
حجم فایل 11.0 MB
نوع فایل pdf
دانلود   و   ادامه مطلب

Spark is a framework for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and inbuilt tools for interactive query analysis (Shark), large-scale graph processing and analysis (Bagel), and real-time analysis (Spark Streaming), it can be interactivel...

Programming Elastic MapReduce

Programming Elastic MapReduce

ناشر O'Reilly Media
نویسنده
تعداد صفحه 174
سال 2013
زبان انگلیسی
شابک 9781449363628
حجم فایل 19.2 MB
نوع فایل pdf
دانلود   و   ادامه مطلب

Although you don't need a large computing infrastructure to process massive amounts of data with Apache Hadoop, it can still be difficult to get started. This practical guide shows you how to quickly launch data analysis projects in the cloud by using Amazon Elastic MapReduce (EMR), the hosted Hadoop framework in Amazon Web Services (AWS). Authors Kevin Schmidt and Christopher Phillips demo...

Fast Data Processing with Spark, 2nd Edition

Fast Data Processing with Spark, 2nd Edition

ناشر Packt Publishing
نویسنده
تعداد صفحه 184
سال 2015
زبان انگلیسی
شابک 9781784392574
حجم فایل 14.2 MB
نوع فایل pdf
دانلود   و   ادامه مطلب

Spark is a framework used for writing fast, distributed programs. Spark solves similar problems as Hadoop MapReduce does, but with a fast in-memory approach and a clean functional style API. With its ability to integrate with Hadoop and built-in tools for interactive query analysis (Spark SQL), large-scale graph processing and analysis (GraphX), and real-time analysis (Spark Streaming), it can be ...

Writing and Querying MapReduce Views in CouchDB

Writing and Querying MapReduce Views in CouchDB

ناشر O'Reilly Media
نویسنده
تعداد صفحه 76
سال 2011
زبان انگلیسی
شابک 9781449303129
حجم فایل 1.9 MB
نوع فایل pdf
دانلود   و   ادامه مطلب

Learn how to create MapReduce views in CouchDB that let you query the document-oriented database for meaningful data. With this short and concise ebook, you'll get step-by-step instructions and lots of sample code to create and explore several MapReduce views, using an example database you construct.

HBase in Action

HBase in Action

ناشر Manning
نویسنده
تعداد صفحه 360
سال 2012
زبان انگلیسی
شابک 9781617290527
حجم فایل 9.8 MB
نوع فایل pdf
دانلود   و   ادامه مطلب

HBase is a NoSQL storage system designed for fast, random access to large volumes of data. It runs on commodity hardware and scales smoothly from modest datasets to billions of rows and millions of columns. HBase in Action is an experience-driven guide that shows you how to design, build, and run applications using HBase. First, it introduces you to the fundamentals of handling big data. Th...

Seven Concurrency Models in Seven Weeks

Seven Concurrency Models in Seven Weeks

ناشر The Pragmatic Programmers
نویسنده
تعداد صفحه 300
سال 2014
زبان انگلیسی
شابک 9781937785659
حجم فایل 4.4 MB
نوع فایل pdf
دانلود   و   ادامه مطلب

Your software needs to leverage multiple cores, handle thousands of users and terabytes of data, and continue working in the face of both hardware and software failure. Concurrency and parallelism are the keys, and Seven Concurrency Models in Seven Weeks equips you for this new world. See how emerging technologies such as actors and functional programming address issues with traditional threads an...

MapReduce Design Patterns

MapReduce Design Patterns

ناشر O'Reilly Media
نویسنده
تعداد صفحه 252
سال 2012
زبان انگلیسی
شابک 9781449327170
حجم فایل 9.5 MB
نوع فایل pdf
دانلود   و   ادامه مطلب

Until now, design patterns for the MapReduce framework have been scattered among various research papers, blogs, and books. This handy guide brings together a unique collection of valuable MapReduce patterns that will save you time and effort regardless of the domain, language, or development framework you're using. Each pattern is explained in context, with pitfalls and caveats clearly ide...

Hadoop MapReduce Cookbook

Hadoop MapReduce Cookbook

ناشر Packt Publishing
نویسنده
تعداد صفحه 300
سال 2013
زبان انگلیسی
شابک 9781849517287
حجم فایل 4.1 MB
نوع فایل pdf
دانلود   و   ادامه مطلب

Learn to process large and complex data sets, starting simply, then diving in deep. Solve complex big data problems such as classifications, finding relationships, online marketing and recommendations. More than 50 Hadoop MapReduce recipes, presented in a simple and straightforward manner, with step-by-step instructions and real world examples.

Hadoop: The Definitive Guide, 3rd Edition

Hadoop: The Definitive Guide, 3rd Edition

ناشر O'Reilly Media
نویسنده
تعداد صفحه 630
سال 2012
زبان انگلیسی
شابک 9781449311520
حجم فایل 9.1 MB
نوع فایل pdf
دانلود   و   ادامه مطلب

With this digital Early Release edition of Hadoop: The Definitive Guide, you get the entire book bundle in its earliest form - the author's raw and unedited content - so you can take advantage of this content long before the book's official release. You'll also receive updates when significant changes are made. Ready to unleash the power of your massive dataset? With the latest edition of this com...