Hadoop: The Definitive Guide 在線電子書 圖書標籤: Hadoop 大數據 BigData 計算機 分布式 hadoop 機器學習 O'Reilly
發表於2025-03-12
Hadoop: The Definitive Guide 在線電子書 pdf 下載 txt下載 epub 下載 mobi 下載 2025
真尼瑪長。介紹瞭生態圈裏的大部分工具,用來總結迴顧比較適閤,沒有實踐過的讀者看前兩部分mr和yarn核心,掃一遍後麵所有工具是做什麼用的就可以瞭。
評分很全,主要是前兩部分,尤其mapreduce部分,後麵的那些cluster和各種相關項目的其實可以隻做瀏覽,講得也不是很細,用的時候看apache的說明文檔就好
評分T^T 買瞭很厚的影印版
評分前半段原理英文第四版,後半段相關項目和案例學習中文第三版就直接劃水劃過去瞭。Definitive Guide一貫作風,料多廢話也多,Hadoop也是復雜又難用,Spark要是革瞭你的命也是理所應當。
評分很全,主要是前兩部分,尤其mapreduce部分,後麵的那些cluster和各種相關項目的其實可以隻做瀏覽,講得也不是很細,用的時候看apache的說明文檔就好
Tom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He works for Cloudera, a company set up to offer Hadoop support and training. Previously he was as an independent Hadoop consultant, working with companies to set up, use, and extend Hadoop. He has written numerous articles for O'Reilly, java.net and IBM's developerWorks, and has spoken at several conferences, including at ApacheCon 2008 on Hadoop. Tom has a Bachelor's degree in Mathematics from the University of Cambridge and a Master's in Philosophy of Science from the University of Leeds, UK.
Get ready to unlock the power of your data. With the fourth edition of this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.
Using Hadoop 2 exclusively, author Tom White presents new chapters on YARN and several Hadoop-related projects such as Parquet, Flume, Crunch, and Spark. You’ll learn about recent changes to Hadoop, and explore new case studies on Hadoop’s role in healthcare systems and genomics data processing.
Learn fundamental components such as MapReduce, HDFS, and YARN
Explore MapReduce in depth, including steps for developing applications with it
Set up and maintain a Hadoop cluster running HDFS and MapReduce on YARN
Learn two data formats: Avro for data serialization and Parquet for nested data
Use data ingestion tools such as Flume (for streaming data) and Sqoop (for bulk data transfer)
Understand how high-level data processing tools like Pig, Hive, Crunch, and Spark work with Hadoop
Learn the HBase distributed database and the ZooKeeper distributed configuration service
参加豆瓣China-pub抽奖,比较幸运的得到这本Hadoop权威指南中文第二版,拿来与第一版相比,发现新加入了Hive和Sqoop章节,译文质量也提高了不少,并且保留了英文索引。 这本书对Hadoop的介绍还算全面,有实践冲动的朋友基本可以拿着书、配合Google百度马上实现梦想。个人感觉“...
評分专门登录来评论的,翻译也太烂了吧,真的真的建议强烈英语阅读能力好的人去读原版书,不要花冤枉钱在这上面,除了文字错误外,里边的图居然也有错,就比如260页的图最后两个年份应该是1901结果这里竟然是1900,我是真滴服了,一本神书被翻译成这样,作者得气死。zsbd zsbd zsbd...
評分 評分 評分专门登录来评论的,翻译也太烂了吧,真的真的建议强烈英语阅读能力好的人去读原版书,不要花冤枉钱在这上面,除了文字错误外,里边的图居然也有错,就比如260页的图最后两个年份应该是1901结果这里竟然是1900,我是真滴服了,一本神书被翻译成这样,作者得气死。zsbd zsbd zsbd...
Hadoop: The Definitive Guide 在線電子書 pdf 下載 txt下載 epub 下載 mobi 下載 2025