Loading from Multiple Data Sources with Oracle Loader for Hadoop

Posted by mannamal on Oracle Blogs See other posts from Oracle Blogs or by mannamal
Published on Wed, 16 Oct 2013 15:20:06 +0000 Indexed on 2013/10/17 16:16 UTC
Read the original article Hit count: 201

Filed under:

Oracle Loader for Hadoop can be used to load data from multiple data sources (for example Hive, HBase), and data in multiple formats (for example Apache weblogs, JSON files).   There are two ways to do this:

(1) Use an input format implementation.  Oracle Loader for Hadoop includes several input format implementations.  In addition, a user can develop their own input format implementation for proprietary data sources and formats.

(2) Leverage the capabilities of Hive, and use Oracle Loader for Hadoop to load from Hive.

These approaches are discussed in our Oracle Open World 2013 presentation

© Oracle Blogs or respective owner

Related posts about /Oracle