Venner, Jason
Pro Hadoop / - New Edition - New York : Apress L. P., June 2009.
Annotation You’ve heard the hype about Hadoop: it runs petabyte–scale data mining tasks insanely fast, it runs gigantic tasks on clouds for absurdly cheap, it’s been heavily committed to by tech giants like IBM, Yahoo!, and the Apache Project, and it’s completely open-source (thusfree). But what exactly is it, and more importantly, how do you even get a Hadoop cluster up and running?From Apress, the name you’ve come to trust for hands–on technical knowledge,Pro Hadoopbrings you up to speed on Hadoop. You learn the ins and outs of MapReduce; how to structure a cluster, design, and implement the Hadoop file system; and how to build your first cloud–computing tasks using Hadoop. Learn how to let Hadoop take care of distributing and parallelizing your software—you just focus on the code, Hadoop takes care of the rest.Best of all, you’ll learn from a tech professional who’s been in the Hadoop scene since day one. Written from the perspective of a principal engineer with down–in–the–trenches knowledge of what to dowrongwith Hadoop, you learn how to avoid the common, expensive first errors that everyone makes with creating their own Hadoop system or inheriting someone else’s.Skip the novice stage and the expensive, hard–to–fix mistakes...go straight to seasoned pro on the hottest cloud–computing framework withPro Hadoop. Your productivity will blow your managers away.What you’ll learnSet up a stand–alone Hadoop cluster the smart way, laid out simply and step by step so you can get up and running quickly to build your next data center, collaborative, data–intensive Internet services application, Software as a Service (SaaS), and more.Optimize your Hadoop production tasks like an experienced pro.Work with time–proven, bulletproof standard patterns that have been tested and debugged in high–volume production.Understand just enough theoretical knowledge to know why something works in Hadoop, without getting bogged down in abstruse walls of theory.Get detailed explanations of not onlyhowto do something with Hadoop, but alsowhy, from a front–line coder with years in the Hadoop game.Turn someone else’s expensive cluster–wide “wrong” into an orderly, productive "right" with professional–level debugging and testing.Who this book is forIT professionals interested in investigating Hadoop and implementing it in their organizations, and existing Hadoop users who want to deepen their professional toolkits.Table of ContentsGetting Started with Hadoop CoreThe Basics of a MapReduce JobThe Basics of Multimachine ClustersHDFS Details for Multimachine ClustersMapReduce Details for Multimachine ClustersTuning Your MapReduce JobsUnit Testing and DebuggingAdvanced and Alternate MapReduce TechniquesSolving Problems with HadoopProjects Based On Hadoop and Future Directions
9781430219422 1430219424 (Trade Paper) USD 39.99 Retail Price (Publisher)
9781430219422
00152887
005.3 / VEP
Pro Hadoop / - New Edition - New York : Apress L. P., June 2009.
Annotation You’ve heard the hype about Hadoop: it runs petabyte–scale data mining tasks insanely fast, it runs gigantic tasks on clouds for absurdly cheap, it’s been heavily committed to by tech giants like IBM, Yahoo!, and the Apache Project, and it’s completely open-source (thusfree). But what exactly is it, and more importantly, how do you even get a Hadoop cluster up and running?From Apress, the name you’ve come to trust for hands–on technical knowledge,Pro Hadoopbrings you up to speed on Hadoop. You learn the ins and outs of MapReduce; how to structure a cluster, design, and implement the Hadoop file system; and how to build your first cloud–computing tasks using Hadoop. Learn how to let Hadoop take care of distributing and parallelizing your software—you just focus on the code, Hadoop takes care of the rest.Best of all, you’ll learn from a tech professional who’s been in the Hadoop scene since day one. Written from the perspective of a principal engineer with down–in–the–trenches knowledge of what to dowrongwith Hadoop, you learn how to avoid the common, expensive first errors that everyone makes with creating their own Hadoop system or inheriting someone else’s.Skip the novice stage and the expensive, hard–to–fix mistakes...go straight to seasoned pro on the hottest cloud–computing framework withPro Hadoop. Your productivity will blow your managers away.What you’ll learnSet up a stand–alone Hadoop cluster the smart way, laid out simply and step by step so you can get up and running quickly to build your next data center, collaborative, data–intensive Internet services application, Software as a Service (SaaS), and more.Optimize your Hadoop production tasks like an experienced pro.Work with time–proven, bulletproof standard patterns that have been tested and debugged in high–volume production.Understand just enough theoretical knowledge to know why something works in Hadoop, without getting bogged down in abstruse walls of theory.Get detailed explanations of not onlyhowto do something with Hadoop, but alsowhy, from a front–line coder with years in the Hadoop game.Turn someone else’s expensive cluster–wide “wrong” into an orderly, productive "right" with professional–level debugging and testing.Who this book is forIT professionals interested in investigating Hadoop and implementing it in their organizations, and existing Hadoop users who want to deepen their professional toolkits.Table of ContentsGetting Started with Hadoop CoreThe Basics of a MapReduce JobThe Basics of Multimachine ClustersHDFS Details for Multimachine ClustersMapReduce Details for Multimachine ClustersTuning Your MapReduce JobsUnit Testing and DebuggingAdvanced and Alternate MapReduce TechniquesSolving Problems with HadoopProjects Based On Hadoop and Future Directions
9781430219422 1430219424 (Trade Paper) USD 39.99 Retail Price (Publisher)
9781430219422
00152887
005.3 / VEP