Hadoop: The Definitive Guide 在线电子书 图书标签: Hadoop 大数据 BigData 计算机 分布式 hadoop 机器学习 O'Reilly
发表于2025-01-22
Hadoop: The Definitive Guide 在线电子书 pdf 下载 txt下载 epub 下载 mobi 下载 2025
经典
评分读了前3部分,该看源码去了。
评分第四版全面基于hadoop2,相比前版进行一些重要增添和顺序调整,之前版本就不要看了。继续那么全面而透彻实用。
评分真尼玛长。介绍了生态圈里的大部分工具,用来总结回顾比较适合,没有实践过的读者看前两部分mr和yarn核心,扫一遍后面所有工具是做什么用的就可以了。
评分经典
Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He works for Cloudera, a company set up to offer Hadoop support and training. Previously he was as an independent Hadoop consultant, working with companies to set up, use, and extend Hadoop. He has written numerous articles for O'Reilly, java.net and IBM's developerWorks, and has spoken at several conferences, including at ApacheCon 2008 on Hadoop. Tom has a Bachelor's degree in Mathematics from the University of Cambridge and a Master's in Philosophy of Science from the University of Leeds, UK.
Get ready to unlock the power of your data. With the fourth edition of this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.
Using Hadoop 2 exclusively, author Tom White presents new chapters on YARN and several Hadoop-related projects such as Parquet, Flume, Crunch, and Spark. You’ll learn about recent changes to Hadoop, and explore new case studies on Hadoop’s role in healthcare systems and genomics data processing.
Learn fundamental components such as MapReduce, HDFS, and YARN
Explore MapReduce in depth, including steps for developing applications with it
Set up and maintain a Hadoop cluster running HDFS and MapReduce on YARN
Learn two data formats: Avro for data serialization and Parquet for nested data
Use data ingestion tools such as Flume (for streaming data) and Sqoop (for bulk data transfer)
Understand how high-level data processing tools like Pig, Hive, Crunch, and Spark work with Hadoop
Learn the HBase distributed database and the ZooKeeper distributed configuration service
很好的Hadoop教程,比Apache和Yahoo !网页版guide详细很多,很多想不明白的Hadoop实现细节都可以在这本书里找到。
评分Cobub Razor APP数据统计分析工具官网上有篇文章是讲Hadoop Yarn调度器的选择和使用的,我觉得写的挺好的,推荐http://www.cobub.com/the-selection-and-use-of-hadoop-yarn-scheduler/
评分你的履历添了一笔<hadoop权威指南>译者,但是你不配 这是我见过的最不用心的翻译, 字里行间行文不通顺, 请别勉强自己,map reduce shuffle机制都没翻译的好 虽然原作者写作功底也实在是一般 第 1 2 5 6 7 这几章 翻译的实在是太烂了 请不要呐Google翻译糊弄人阿 误人子弟 ...
评分详见:http://www.cnblogs.com/aprilrain/archive/2013/03/07/2947664.html
评分-- china-pub 赠书活动 -- http://www.douban.com/group/topic/20965935/ 一直比较忙,整本书还没读完,只是粗略翻了个大概,其中有两三章细读了一遍。先做个大体评价吧,有时间全部细读后再评论。 从书的内容上来讲,大致上与网上该书的内容介绍一致。简单点概括:这本书对...
Hadoop: The Definitive Guide 在线电子书 pdf 下载 txt下载 epub 下载 mobi 下载 2025