MapReduce Design Patterns 在线电子书 图书标签: MapReduce 大数据 O'Reilly 数据挖掘 计算机科学 Patterns Design 计算机
发表于2024-11-22
MapReduce Design Patterns 在线电子书 pdf 下载 txt下载 epub 下载 mobi 下载 2024
入门了,略拖沓。
评分大概13年的时候读过这本书,当时觉得觉得收获非常大,基本覆盖了用mr处理数据的常用方法,不过现在看开用hive就够了。
评分就告诉你如何用MR实现SQL中的JOIN、聚合函数等
评分慢慢思索,仍需品味…
评分还算有点实用,不过随着pig的成熟,很多东东其实不需要了解了
Design patterns for the MapReduce framework, until now, have been scattered among various research papers, blogs, and books. This handy guide brings together a unique collection of valuable MapReduce patterns that will save you time and effort regardless of the domain, language, or development framework you're using. Each pattern is explained in context, with pitfalls and caveats clearly identified - so you can avoid some of the common design mistakes when modeling your Big Data architecture. This book also provides a complete overview of MapReduce that explains its origins and implementations, and why design patterns are so important. Hadoop MapReduce code is provided to help you learn how to apply the design patterns by example. Topics include: Basic patterns, including map-only filter, group by, aggregation, distinct, and limit Joins: traditional reduce-side join, reduce-side join with Bloom filter, replicated join with distributed cache, merge join, Cartesian products, and intersections Binning, sharding for other systems, sorting, sampling, unions, and other patterns for organizing data Job optimization patterns, including multi-job map-only job folding, and overloading the key grouping to perform two jobs at once
评分
评分
评分
评分
MapReduce Design Patterns 在线电子书 pdf 下载 txt下载 epub 下载 mobi 下载 2024