Apache Sqoop Cookbook

Apache Sqoop Cookbook pdf epub mobi txt 电子书 下载 2025

出版者:O'Reilly Media
作者:Kathleen Ting
出品人:
页数:94
译者:
出版时间:2013-7-26
价格:USD 14.99
装帧:Paperback
isbn号码:9781449364625
丛书系列:
图书标签:
  • sqoop
  • hadoop
  • Hadoop
  • Programming
  • 英文原版
  • 数据分析
  • tech
  • rdbms
  • Sqoop
  • Big Data
  • Hadoop
  • Data Integration
  • Data Migration
  • Database
  • Java
  • ETL
  • Cookbook
  • Apache
想要找书就要到 图书目录大全
立刻按 ctrl+D收藏本页
你会得到大惊喜!!

具体描述

Integrating data from multiple sources is essential in the age of big data, but it can be a challenging and time-consuming task. This handy cookbook provides dozens of ready-to-use recipes for using Apache Sqoop, the command-line interface application that optimizes data transfers between relational databases and Hadoop.

Sqoop is both powerful and bewildering, but with this cookbook’s problem-solution-discussion format, you’ll quickly learn how to deploy and then apply Sqoop in your environment. The authors provide MySQL, Oracle, and PostgreSQL database examples on GitHub that you can easily adapt for SQL Server, Netezza, Teradata, or other relational systems.

Transfer data from a single database table into your Hadoop ecosystem

Keep table data and Hadoop in sync by importing data incrementally

Import data from more than one database table

Customize transferred data by calling various database functions

Export generated, processed, or backed-up data from Hadoop to your database

Run Sqoop within Oozie, Hadoop’s specialized workflow scheduler

Load data into Hadoop’s data warehouse (Hive) or database (HBase)

Handle installation, connection, and syntax issues common to specific database vendors

作者简介

目录信息

Chapter 1 Getting Started
Downloading and Installing Sqoop
Installing JDBC Drivers
Installing Specialized Connectors
Starting Sqoop
Getting Help with Sqoop
Chapter 2 Importing Data
Transferring an Entire Table
Specifying a Target Directory
Importing Only a Subset of Data
Protecting Your Password
Using a File Format Other Than CSV
Compressing Imported Data
Speeding Up Transfers
Overriding Type Mapping
Controlling Parallelism
Encoding NULL Values
Importing All Your Tables
Chapter 3 Incremental Import
Importing Only New Data
Incrementally Importing Mutable Data
Preserving the Last Imported Value
Storing Passwords in the Metastore
Overriding the Arguments to a Saved Job
Sharing the Metastore Between Sqoop Clients
Chapter 4 Free-Form Query Import
Importing Data from Two Tables
Using Custom Boundary Queries
Renaming Sqoop Job Instances
Importing Queries with Duplicated Columns
Chapter 5 Export
Transferring Data from Hadoop
Inserting Data in Batches
Exporting with All-or-Nothing Semantics
Updating an Existing Data Set
Updating or Inserting at the Same Time
Using Stored Procedures
Exporting into a Subset of Columns
Encoding the NULL Value Differently
Exporting Corrupted Data
Chapter 6 Hadoop Ecosystem Integration
Scheduling Sqoop Jobs with Oozie
Specifying Commands in Oozie
Using Property Parameters in Oozie
Installing JDBC Drivers in Oozie
Importing Data Directly into Hive
Using Partitioned Hive Tables
Replacing Special Delimiters During Hive Import
Using the Correct NULL String in Hive
Importing Data into HBase
Importing All Rows into HBase
Improving Performance When Importing into HBase
Chapter 7 Specialized Connectors
Overriding Imported boolean Values in PostgreSQL Direct Import
Importing a Table Stored in Custom Schema in PostgreSQL
Exporting into PostgreSQL Using pg_bulkload
Connecting to MySQL
Using Direct MySQL Import into Hive
Using the upsert Feature When Exporting into MySQL
Importing from Oracle
Using Synonyms in Oracle
Faster Transfers with Oracle
Importing into Avro with OraOop
Choosing the Proper Connector for Oracle
Exporting into Teradata
Using the Cloudera Teradata Connector
Using Long Column Names in Teradata
Colophon
· · · · · · (收起)

读后感

评分

评分

评分

评分

评分

用户评价

评分

工具书

评分

很简短的概述性的入门级书籍,很小巧和实用的SQl to hadOOP工具,方便将关系型数据库和企业级数据仓库中的数据与存放在Hadoop中的数据进行交换,感觉Cloudera将逐步从大数据工具领域中脱颖而出!

评分

小巧实用,简明易读

评分

小巧实用,简明易读

评分

一问一答得方式解决问题,十分简短,个人觉得相当不错。

本站所有内容均为互联网搜索引擎提供的公开搜索信息,本站不存储任何数据与内容,任何内容与数据均与本站无关,如有需要请联系相关搜索引擎包括但不限于百度google,bing,sogou

© 2025 book.wenda123.org All Rights Reserved. 图书目录大全 版权所有