Apache Flume: Distributed Log Collection for Hadoop 在线电子书 pdf 下载 txt下载 epub 下载 mobi 下载 2024


Apache Flume: Distributed Log Collection for Hadoop

简体网页||繁体网页
Steve Hoffman 作者
Packt Publishing Ltd
译者
2013-7 出版日期
108 页数
0 价格
丛书系列
9781782167914 图书编码

Apache Flume: Distributed Log Collection for Hadoop 在线电子书 图书标签: 分布式   


喜欢 Apache Flume: Distributed Log Collection for Hadoop 在线电子书 的读者还喜欢




点击这里下载
    

想要找书就要到 图书目录大全
立刻按 ctrl+D收藏本页
你会得到大惊喜!!

发表于2024-06-30


Apache Flume: Distributed Log Collection for Hadoop 在线电子书 epub 下载 mobi 下载 pdf 下载 txt 下载 2024

Apache Flume: Distributed Log Collection for Hadoop 在线电子书 epub 下载 mobi 下载 pdf 下载 txt 下载 2024

Apache Flume: Distributed Log Collection for Hadoop 在线电子书 pdf 下载 txt下载 epub 下载 mobi 下载 2024



Apache Flume: Distributed Log Collection for Hadoop 在线电子书 用户评价

评分

数据

评分

数据

评分

工具书籍

评分

数据

评分

数据

Apache Flume: Distributed Log Collection for Hadoop 在线电子书 著者简介

Steve Hoffman has 30 years of software development experience and holds

a B.S. in computer engineering from the University of Illinois Urbana-Champaign

and a M.S. in computer science from the DePaul University. He is currently

a Principal Engineer at Orbitz Worldwide.

More information on Steve can be found at http://bit.ly/bacoboy or on

Twitter @bacoboy .

This is Steve's first book.


Apache Flume: Distributed Log Collection for Hadoop 在线电子书 图书目录


Apache Flume: Distributed Log Collection for Hadoop 在线电子书 pdf 下载 txt下载 epub 下载 mobi 在线电子书下载

Apache Flume: Distributed Log Collection for Hadoop 在线电子书 图书描述

Hadoop is a great open source tool for sifting tons of unstructured data into something

manageable, so that your business can gain better insight into your customers, needs.

It is cheap (can be mostly free), scales horizontally as long as you have space and

power in your data center, and can handle problems your traditional data warehouse

would be crushed under. That said, a little known secret is that your Hadoop cluster

requires you to feed it with data; otherwise, you just have a very expensive heat

generator. You will quickly find, once you get past the “playing around” phase

with Hadoop, that you will need a tool to automatically feed data into your cluster.

In the past, you had to come up with a solution for this problem, but no more! Flume

started as a project out of Cloudera when their integration engineers had to keep

writing tools over and over again for their customers to import data automatically.

Today the project lives with the Apache Foundation, is under active development,

and boasts users who have been using it in their production environments for years.

In this book I hope to get you up and running quickly with an architectural overview

of Flume and a quick start guide. After that we’ll deep-dive into the details on many

of the more useful Flume components, including the very important File Channel

for persistence of in-flight data records and the HDFS Sink for buffering and writing

data into HDFS, the Hadoop Distributed File System. Since Flume comes with

a wide variety of modules, chances are that the only tool you’ll need to get started

is a text editor for the configuration file.

By the end of the book, you should know enough to build out a highly available,

fault tolerant, streaming data pipeline feeding your Hadoop cluster.

Apache Flume: Distributed Log Collection for Hadoop 在线电子书 下载 mobi epub pdf txt 在线电子书下载

想要找书就要到 图书目录大全
立刻按 ctrl+D收藏本页
你会得到大惊喜!!

Apache Flume: Distributed Log Collection for Hadoop 在线电子书 读后感

评分

评分

评分

评分

评分

类似图书 点击查看全场最低价

Apache Flume: Distributed Log Collection for Hadoop 在线电子书 pdf 下载 txt下载 epub 下载 mobi 下载 2024


分享链接





Apache Flume: Distributed Log Collection for Hadoop 在线电子书 相关图书




本站所有内容均为互联网搜索引擎提供的公开搜索信息,本站不存储任何数据与内容,任何内容与数据均与本站无关,如有需要请联系相关搜索引擎包括但不限于百度google,bing,sogou

友情链接

© 2024 book.wenda123.org All Rights Reserved. 图书目录大全 版权所有