Thanks for the question, Doug.
Asked: April 27, 2009 - 9:09 pm UTC
Last updated: April 28, 2009 - 10:17 am UTC
Version: 10.2
Viewed 1000+ times
You Asked
Tom,
I'm looking for a way to decrease the total run time for a set of ETL processes that, by one measure, operate on about 60 GB of information in each, notionally daily, processing cycle. The source system is a billing application on an IBM z/OS "mainframe," and the billing information is not in a relational database, but in files organized for index access that are known to users of IBM mainframes as VSAM files. The destination system is an Oracle 10gR2 database on an AIX host.
Our existing ETL processes involve extracting from the source system to "flat" files and transferring those files to the database's host by FTP. We present the files to the database as external tables and accomplish the initial transformation and load tasks with multi-table INSERTs from the external tables. Formerly, we were able to write our extract files once, rather than twice, because we made the files available to the Oracle database via NFS. However, we've had to abandon NFS and adopt FTP in something of a rush, because the mainframe's NFS server took a large share of CPU resources that have somewhat suddenly become much more scarce and valuable.
I'd like to find and consider ways to load directly into the Oracle database across the network, without having to read and write the extract files an extra time in the course of transferring them by FTP. At first, putting an Oracle client home on the mainframe and using SQL*Loader seemed to be a possibility worth looking into. However, I was brought up short when I learned that Oracle Database 10gR2 is the terminal release of the Oracle Database on the z/OS platform. I doubt I could persuade anyone (including myself) that using an Oracle 10gR2 client home on z/OS would be a good idea, while we move on to Oracle 11g, "12h", and "13i". I'm hoping you have information that will get me started off in a more fruitful direction.
Thanks.
and Tom said...
I asked Marc Connolly (one of our mainframe experts) to comment and he wrote:
...
Tom,
OK, given the desire to "load directly into the Oracle database across the network", there is an alternative to sqlldr on z/OS from Oracle, and that is the "Oracle® Database Gateway for IMS, VSAM, and Adabas":
http://docs.oracle.com/cd/B28359_01/gateways.111/b32526/toc.htm Architecturally, the gateway is simple in design:
http://docs.oracle.com/cd/B28359_01/gateways.111/b32526/gettingstarted.htm#i1005661 Beyond the Oracles' Heterogenous Services, there are two components which need to be installed. The first is Oracle Connect, which is installed under z/OS as a started task. Oracle Connect is responsible for access (read/write) directly (or indirectly via CICS) against VSAM files. The second component is Oracle Studio which is stand alone Java GUI application which attaches itself to Oracle Connect running on z/OS and is used to describe the layout of and access to VSAM files on the mainframe:
http://docs.oracle.com/cd/B28359_01/gateways.111/b32526/vsamdata_access.htm#CIHFDIGG Once both components are installed and configured, access to VSAM files and the records they contain is relatively straight forward. A good reference guide is the "Oracle® Database Gateway for VSAM User's Guide 11g Release 1 (11.1) Part Number B31054-01 here:
http://docs.oracle.com/cd/B28359_01/gateways.111/b31054/title.htm Hope this helps....
=======================================
Oracle Sales Support - IBM
Oracle Global Sales Support
.....
Rating
(2 ratings)
Is this answer out of date? If it is, please let us know via a Comment