MapReduce Design Patterns 在線電子書 圖書標籤: MapReduce 大數據 O'Reilly 數據挖掘 計算機科學 Patterns Design 計算機
發表於2024-11-22
MapReduce Design Patterns 在線電子書 pdf 下載 txt下載 epub 下載 mobi 下載 2024
還算有點實用,不過隨著pig的成熟,很多東東其實不需要瞭解瞭
評分花瞭大概3-4個小時快速看完,溫習瞭一下Input/OutputFormat, RecordReader/Writer, InputSplit,基本沒收獲,比較適閤剛會寫MapReduce的碼農們快速瀏覽一遍
評分如何編寫一個真正的不依賴於in-memory sort的find median算法?單機版本的max-N都已經是O(N)的瞭,Hadoop版本的(作者這裏描述的)就有點弱瞭
評分如何編寫一個真正的不依賴於in-memory sort的find median算法?單機版本的max-N都已經是O(N)的瞭,Hadoop版本的(作者這裏描述的)就有點弱瞭
評分就告訴你如何用MR實現SQL中的JOIN、聚閤函數等
Design patterns for the MapReduce framework, until now, have been scattered among various research papers, blogs, and books. This handy guide brings together a unique collection of valuable MapReduce patterns that will save you time and effort regardless of the domain, language, or development framework you're using. Each pattern is explained in context, with pitfalls and caveats clearly identified - so you can avoid some of the common design mistakes when modeling your Big Data architecture. This book also provides a complete overview of MapReduce that explains its origins and implementations, and why design patterns are so important. Hadoop MapReduce code is provided to help you learn how to apply the design patterns by example. Topics include: Basic patterns, including map-only filter, group by, aggregation, distinct, and limit Joins: traditional reduce-side join, reduce-side join with Bloom filter, replicated join with distributed cache, merge join, Cartesian products, and intersections Binning, sharding for other systems, sorting, sampling, unions, and other patterns for organizing data Job optimization patterns, including multi-job map-only job folding, and overloading the key grouping to perform two jobs at once
評分
評分
評分
評分
MapReduce Design Patterns 在線電子書 pdf 下載 txt下載 epub 下載 mobi 下載 2024