Programming Elastic MapReduce
(eBook)

Book Cover
Average Rating
Published
Sebastopol, CA : O'Reilly Media, ©2014.
Format
eBook
ISBN
1449363628, 9781449363628, 9781449364045, 1449364047
Physical Desc
1 online resource (1 volume) : illustrations
Status

Description

Loading Description...

Also in this Series

Checking series information...

More Like This

Loading more titles like this title...

Syndetics Unbound

More Details

Language
English

Notes

Description
Although you don't need a large computing infrastructure to process massive amounts of data with Apache Hadoop, it can still be difficult to get started. This practical guide shows you how to quickly launch data analysis projects in the cloud by using Amazon Elastic MapReduce (EMR), the hosted Hadoop framework in Amazon Web Services (AWS). Authors Kevin Schmidt and Christopher Phillips demonstrate best practices for using EMR and various AWS and Apache technologies by walking you through the construction of a sample MapReduce log analysis application. Using code samples and example configurations, you'll learn how to assemble the building blocks necessary to solve your biggest data analysis problems. Get an overview of the AWS and Apache software tools used in large-scale data analysis Go through the process of executing a Job Flow with a simple log analyzer Discover useful MapReduce patterns for filtering and analyzing data sets Use Apache Hive and Pig instead of Java to build a MapReduce Job Flow Learn the basics for using Amazon EMR to run machine learning algorithms Develop a project cost model for using Amazon EMR and other AWS tools.
Local note
O'Reilly,O'Reilly Online Learning: Academic/Public Library Edition

Reviews from GoodReads

Loading GoodReads Reviews.

Citations

APA Citation, 7th Edition (style guide)

Schmidt, K. J., & Phillips, C. (2014). Programming Elastic MapReduce . O'Reilly Media.

Chicago / Turabian - Author Date Citation, 17th Edition (style guide)

Schmidt, Kevin J and Chris Phillips. 2014. Programming Elastic MapReduce. O'Reilly Media.

Chicago / Turabian - Humanities (Notes and Bibliography) Citation, 17th Edition (style guide)

Schmidt, Kevin J and Chris Phillips. Programming Elastic MapReduce O'Reilly Media, 2014.

MLA Citation, 9th Edition (style guide)

Schmidt, Kevin J., and Chris Phillips. Programming Elastic MapReduce O'Reilly Media, 2014.

Note! Citations contain only title, author, edition, publisher, and year published. Citations should be used as a guideline and should be double checked for accuracy. Citation formats are based on standards as of August 2021.

Staff View

Grouped Work ID
51b557da-9ad9-3e1b-0f99-9150be6a4781-eng
Go To Grouped Work

Grouping Information

Grouped Work ID51b557da-9ad9-3e1b-0f99-9150be6a4781-eng
Full titleprogramming elastic mapreduce
Authorschmidt kevin j
Grouping Categorybook
Last Update2024-04-16 12:23:35PM
Last Indexed2024-04-17 02:50:35AM

Book Cover Information

Image Sourcesyndetics
First LoadedNov 23, 2022
Last UsedDec 15, 2023

Marc Record

First DetectedNov 09, 2022 03:44:24 PM
Last File Modification TimeApr 16, 2024 12:31:12 PM

MARC Record

LEADER04156cam a2200661 a 4500
001ocn870275289
003OCoLC
00520240405112445.0
006m     o  d        
007cr unu||||||||
008140210s2014    caua    o     001 0 eng d
019 |a 968071670|a 969048298
020 |a 1449363628
020 |a 9781449363628
020 |a 9781449364045
020 |a 1449364047
020 |z 9781449363628
0291 |a DEBBG|b BV041783842
0291 |a DEBSZ|b 404335535
0291 |a GBVCP|b 882725556
035 |a (OCoLC)870275289|z (OCoLC)968071670|z (OCoLC)969048298
037 |a CL0500000380|b Safari Books Online
040 |a UMI|b eng|e pn|c UMI|d COO|d DEBBG|d CUS|d DEBSZ|d OCLCQ|d OCLCF|d OCLCQ|d FEM|d OCLCQ|d CEF|d UAB|d AU@|d OCLCO|d OCLCQ|d OCLCO|d OCLCL
049 |a TKLA
050 4|a QA76.9.D5|b S36 2014
08204|a 004|q OCoLC
1001 |a Schmidt, Kevin J.|q (Kevin James)|1 https://id.oclc.org/worldcat/entity/E39PCjvrdTp6htf3Vwj93rvY6C|0 http://id.loc.gov/authorities/names/no2001084779
24510|a Programming Elastic MapReduce /|c Kevin Schmidt and Christopher Phillips.
2461 |i Subtitle on cover:|a Using AWS services to build an end-to-end application
260 |a Sebastopol, CA :|b O'Reilly Media,|c ©2014.
300 |a 1 online resource (1 volume) :|b illustrations
336 |a text|b txt|2 rdacontent
337 |a computer|b c|2 rdamedia
338 |a online resource|b cr|2 rdacarrier
347 |a text file
520 |a Although you don't need a large computing infrastructure to process massive amounts of data with Apache Hadoop, it can still be difficult to get started. This practical guide shows you how to quickly launch data analysis projects in the cloud by using Amazon Elastic MapReduce (EMR), the hosted Hadoop framework in Amazon Web Services (AWS). Authors Kevin Schmidt and Christopher Phillips demonstrate best practices for using EMR and various AWS and Apache technologies by walking you through the construction of a sample MapReduce log analysis application. Using code samples and example configurations, you'll learn how to assemble the building blocks necessary to solve your biggest data analysis problems. Get an overview of the AWS and Apache software tools used in large-scale data analysis Go through the process of executing a Job Flow with a simple log analyzer Discover useful MapReduce patterns for filtering and analyzing data sets Use Apache Hive and Pig instead of Java to build a MapReduce Job Flow Learn the basics for using Amazon EMR to run machine learning algorithms Develop a project cost model for using Amazon EMR and other AWS tools.
5880 |a Online resource; title from title page (Safari, viewed January 30, 2014).
590 |a O'Reilly|b O'Reilly Online Learning: Academic/Public Library Edition
63000|a Apache Hadoop.|0 http://id.loc.gov/authorities/names/n2013024279
63007|a Apache Hadoop.|2 blmlsh
63007|a Apache Hadoop|2 fast
650 0|a Electronic data processing|x Distributed processing.|0 http://id.loc.gov/authorities/subjects/sh85042293
650 0|a Big data.|0 http://id.loc.gov/authorities/subjects/sh2012003227
650 0|a Web services.|0 http://id.loc.gov/authorities/subjects/sh2003001435
650 0|a Internet programming.|0 http://id.loc.gov/authorities/subjects/sh96009904
650 6|a Traitement réparti.
650 6|a Données volumineuses.
650 6|a Services Web.
650 6|a Programmation Internet.
65017|a Internet programming.|2 bisacsh
650 7|a Big data|2 fast
650 7|a Electronic data processing|x Distributed processing|2 fast
650 7|a Internet programming|2 fast
650 7|a Web services|2 fast
7001 |a Phillips, Chris,|d 1971-|1 https://id.oclc.org/worldcat/entity/E39PCjvBd6Hr8DJk3JcpHfKJ9P|0 http://id.loc.gov/authorities/names/n2013003142
758 |i has work:|a Programming elastic mapreduce (Text)|1 https://id.oclc.org/worldcat/entity/E39PCGDQwhyGVWV93cWtbcKMmq|4 https://id.oclc.org/worldcat/ontology/hasWork
85640|u https://ezproxy.knoxlib.org/login?url=https://learning.oreilly.com/library/view/~/9781449364038/?ar
994 |a 92|b TKL