ETL for Oracle to PostgreSQL 1 - Oracle Data Integrator (ODI)

本文涉及的产品
云数据库 RDS SQL Server,基础系列 2核4GB
云原生数据库 PolarDB 分布式版,标准版 2核8GB
RDS PostgreSQL Serverless,0.5-4RCU 50GB 3个月
推荐场景:
对影评进行热评分析
简介:

标签

PostgreSQL , Oracle , ETL , Oracle Data Integrator , ODI


背景

原文

https://www.cdata.com/kb/tech/postgresql-jdbc-odi.rst

正文

ETL PostgreSQL in Oracle Data Integrator

This article shows how to transfer PostgreSQL data into a data warehouse using Oracle Data Integrator.

Leverage existing skills by using the JDBC standard to read and write to PostgreSQL: Through drop-in integration into ETL tools like Oracle Data Integrator (ODI), the CData JDBC Driver for PostgreSQL connects real-time PostgreSQL data to your data warehouse, business intelligence, and Big Data technologies.

JDBC connectivity enables you to work with PostgreSQL just as you would any other database in ODI. As with an RDBMS, you can use the driver to connect directly to the PostgreSQL APIs in real time instead of working with flat files.

This article walks through a JDBC-based ETL -- PostgreSQL to Oracle. After reverse engineering a data model of PostgreSQL entities, you will create a mapping and select a data loading strategy -- since the driver supports SQL-92, this last step can easily be accomplished by selecting the built-in SQL to SQL Loading Knowledge Module.

Install the Driver

To install the driver, copy the driver JAR and .lic file, located in the installation folder, into the ODI userlib directory:

On Unix:

~/.odi/oracledi/userlib  

On Windows:

%APPDATA%\Roaming\odi\oracledi\userlib  

Restart ODI to complete the installation.

Reverse Engineer a Model

Reverse engineering the model retrieves metadata about the driver's relational view of PostgreSQL data. After reverse engineering, you can query real-time PostgreSQL data and create mappings based on PostgreSQL tables.

1、In ODI, connect to your repository and click New -> Model and Topology Objects.

2、On the Model screen of the resulting dialog, enter the following information:

  • Name: Enter PostgreSQL.
  • Technology: Select Generic SQL.
  • Logical Schema: Enter PostgreSQL.
  • Context: Select Global.

3、On the Data Server screen of the resulting dialog, enter the following information:

  • Technology: Select Oracle.
  • Name: Enter PostgreSQL.
  • Driver List: Select Oracle JDBC Driver.
  • Driver: Enter cdata.jdbc.postgresql.PostgreSQLDriver
  • URL: Enter the JDBC URL containing the connection string. Below is a typical connection string:
jdbc:postgresql:User=postgres;Password=admin;Database=postgres;Server=127.0.0.1;Port=5432  

To connect to PostgreSQL, set the Server, Port (the default port is 5432), and Database connection properties and set the User and Password you wish to use to authenticate to the server. If the Database property is not specified, the data provider connects to the user's default database.

4、On the Physical Schema screen, enter the following information:

  • Schema (Schema): Enter PostgreSQL.
  • Schema (Work Schema): Enter PostgreSQL.
    pic

5、In the opened model click Reverse Engineer to retrieve the metadata for PostgreSQL tables.

Edit and Save PostgreSQL Data

After reverse engineering you can now work with PostgreSQL data in ODI. To edit and save PostgreSQL data, expand the Models accordion in the Designer navigator, right-click a table, and click Data. Click Refresh to pick up any changes to the data. Click Save Changes when you are finished making changes.

pic

Create an ETL Project

Follow the steps below to create an ETL from PostgreSQL. You will load Orders entities into the sample data warehouse included in the ODI Getting Started VM.

1、Open SQL Developer and connect to your Oracle database. Right-click the node for your database in the Connections pane and click new SQL Worksheet.

Alternatively you can use SQLPlus. From a command prompt enter the following:

sqlplus / as sysdba  

2、Enter the following query to create a new target table in the sample data warehouse, which is in the ODI_DEMO schema. The following query defines a few columns that match the Orders table in PostgreSQL:

CREATE TABLE ODI_DEMO.TRG_ORDERS (SHIPCITY NUMBER(20,0),ShipName VARCHAR2(255));  

3、In ODI expand the Models accordion in the Designer navigator and double-click the Sales Administration node in the ODI_DEMO folder. The model is opened in the Model Editor.

4、Click Reverse Engineer. The TRG_ORDERS table is added to the model.

5、Right-click the Mappings node in your project and click New Mapping. Enter a name for the mapping and clear the Create Empty Dataset option. The Mapping Editor is displayed.

6、Drag the TRG_ORDERS table from the Sales Administration model onto the mapping.

7、Drag the Orders table from the PostgreSQL model onto the mapping.

8、Click the source connector point and drag to the target connector point. The Attribute Matching dialog is displayed. For this example, use the default options. The target expressions are then displayed in
the properties for the target columns.

9、Open the Physical tab of the Mapping Editor and click ORDERS_AP in TARGET_GROUP.

10、In the ACCOUNT_AP properties, select LKM SQL to SQL (Built-In) on the Loading Knowledge Module tab.

pic

You can then run the mapping to load PostgreSQL data into Oracle. (反之亦可)

相关实践学习
使用PolarDB和ECS搭建门户网站
本场景主要介绍如何基于PolarDB和ECS实现搭建门户网站。
阿里云数据库产品家族及特性
阿里云智能数据库产品团队一直致力于不断健全产品体系,提升产品性能,打磨产品功能,从而帮助客户实现更加极致的弹性能力、具备更强的扩展能力、并利用云设施进一步降低企业成本。以云原生+分布式为核心技术抓手,打造以自研的在线事务型(OLTP)数据库Polar DB和在线分析型(OLAP)数据库Analytic DB为代表的新一代企业级云原生数据库产品体系, 结合NoSQL数据库、数据库生态工具、云原生智能化数据库管控平台,为阿里巴巴经济体以及各个行业的企业客户和开发者提供从公共云到混合云再到私有云的完整解决方案,提供基于云基础设施进行数据从处理、到存储、再到计算与分析的一体化解决方案。本节课带你了解阿里云数据库产品家族及特性。
目录
相关文章
|
4月前
|
Oracle 关系型数据库 数据库
【赵渝强老师】在PostgreSQL中访问Oracle
本文介绍了如何在PostgreSQL中使用oracle_fdw扩展访问Oracle数据库数据。首先需从Oracle官网下载三个Instance Client安装包并解压,设置Oracle环境变量。接着从GitHub下载oracle_fdw扩展,配置pg_config环境变量后编译安装。之后启动PostgreSQL服务器,在数据库中创建oracle_fdw扩展及外部数据库服务,建立用户映射。最后通过创建外部表实现对Oracle数据的访问。文末附有具体操作步骤与示例代码。
146 6
【赵渝强老师】在PostgreSQL中访问Oracle
|
12月前
|
Oracle NoSQL 关系型数据库
主流数据库对比:MySQL、PostgreSQL、Oracle和Redis的优缺点分析
主流数据库对比:MySQL、PostgreSQL、Oracle和Redis的优缺点分析
2048 3
|
分布式计算 DataWorks 关系型数据库
DataWorks操作报错合集之使用连接串模式新增PostgreSQL数据源时遇到了报错"not support data sync channel, error code: 0001",该怎么办
DataWorks是阿里云提供的一站式大数据开发与治理平台,支持数据集成、数据开发、数据服务、数据质量管理、数据安全管理等全流程数据处理。在使用DataWorks过程中,可能会遇到各种操作报错。以下是一些常见的报错情况及其可能的原因和解决方法。
|
SQL Oracle 关系型数据库
关系型数据库Oracle Data Guard
【7月更文挑战第11天】
107 1
|
SQL 监控 Oracle
关系型数据库Oracle 的Data Guard:
【7月更文挑战第7天】
220 3
|
Oracle 关系型数据库 数据库
|
人工智能 Oracle 关系型数据库
一篇文章弄懂Oracle和PostgreSQL的Database Link
一篇文章弄懂Oracle和PostgreSQL的Database Link
|
SQL Oracle 关系型数据库
常用数据库的分页语句(mySQL、oracle、PostgreSQL、SQL Server)
常用数据库的分页语句(mySQL、oracle、PostgreSQL、SQL Server)
|
Cloud Native 关系型数据库 OLAP
从0~1,基于DMS面向AnalyticDB PostgreSQL的数据ETL链路开发
在传统数仓中,往往采用资源预购的方式,缺少面向业务的资源调整灵活性。 在数据分析这种存在明显业务波峰波谷或分时请求的场景下,实例无法按需使用,造成了大量成本浪费。云原生数仓AnalyticDB PostgreSQL产品自2022年2月正式发布了Serverless版之后,依托于内核强大的资源管理能力...
|
Oracle 关系型数据库 数据库

相关产品

  • 云原生数据库 PolarDB
  • 推荐镜像

    更多