Skip to Main Content
  • Questions
  • How Oracle Warehouse Builder compares to Informatica?

Breadcrumb

Question and Answer

Tom Kyte

Thanks for the question, Alex .

Asked: December 04, 2002 - 5:20 pm UTC

Last updated: April 20, 2010 - 10:02 am UTC

Version: 9i

Viewed 1000+ times

You Asked

Dear Tom,

I don't know if you ever worked with Oracle's OWB tool, but here is my question.
I've worked in the past with Informatica, Datastage, and MS DTS (Microsoft Data Transformation Services). I consider Informatica the most complex and harder to learn, followed by Datastage, and having DTS to be the easiest to learn but also offering much less functionality than the other 2. I have an opportunity to work on a project that utilizes Oracle Warehouse Builder and I would like to know how it compares to the above 3 ETL tools, as far as easiness to learn (can you learn it on the job or it is advised to go to the 4-day training), complexity, and functionality.

I'll appreciate your feedback.

Thank you,
Alex Kourtis

and Tom said...

I asked Derrick Cameron, our resident BI/DW/Tools expert and he says:

...
I can't make a direct comparison between OWB and these products without first hand experience using them. However, they all have a similar common goal, and as such would have some common characteristics. So learning OWB would be easier than not having used any tool at all.

In many ways OWB should be relatively easier due to its support for a single target - that is the Oracle database (in fact flat files too, but not other dbs). This reduces ( and optimizes) the possible ways it would generate code, which would simplify diagnostic and deployment activities. For example, OWB uses pl/sql to process data (platform independent) while others use C, which introduces OS compatibility issues. If you are famliar with sql/plsql (most developers are) you can expose the code OWB generates and see what it is doing. This is rarely required, but it's a good thing.

The tool is all GUI and you can easily define sources, targets, and mappings to move data in a few minutes. Do you need a course? No. Of course it would accelerate your understanding of how best to use the tool. You'll have to decide that for yourself.
.......

Rating

  (42 ratings)

Is this answer out of date? If it is, please let us know via a Comment

Comments

A reader, December 05, 2002 - 1:15 pm UTC

For the benefit of a wider audience, can we have a more basic discussion here? Sorry if my questions are too naive.

What is the purpose of these ETL tools in the first place? For loading data into datawarehouse databases(whether oracle or any other d/b), each of the databases have provided tools that are most appropriate for that d/b (that is what they claim anyway. Tom has repeatedly said he prefers sqlldr to C + OCI calls for dataloads). Similarly extracting data from the database to flat files also is supported by all the databases. Whatmore, many databases even offer connectivity to other databases using interfaces like ODBC or transparent gateways. Mainframes and other Cobol type systems too can download data to Text files. And for transforming data, what better tool than SQL? Or if you want you can do it on a Unix platform using sed/awk etc most efficiently. And Oracle database has a vast array of features like MVs, dimensions, Transportable Tablespaces, sqlldr with direct and parallel etc which are great tools for extraction transformation and loading. I don't really know if an ETL tool can do anything more efficiently.

I have not used any of the ETL tools myself. But I have worked on a couple of datawarehouse projects where they just 'play cat and mouse' with the data using ETL tools. On many occasions it just adds up the cost and cause delays. I have seen data extracted into text files from and Oracle ODS database using an ETL tool and loaded into another Oracle warehouse database using sqlldr. I have also seen cases where ETL tools directly interfacing with the database using OCI. But I have not seen a situation where a ETL tool was really indespensable.

I heard that Oracle's ETL tool (warehouse builder) is now made part of Oracle 9i database as the 'ETL engine'. I have no clue what that means!. Can someone throw some light on this whole ETL thing. Even after working on 3 datawarehouse projects for nearly 3 years, I don't really understand the rationale of their existence! I must be really stupid!


Tom Kyte
December 06, 2002 - 6:54 am UTC

Here is my 2 cents -- yes, any tool can get in the way, any tool can be useful -- it is 100% a two way street there. Taken to illogical extremes, anything can be bad like that.

On of the important things in a data warehouse is documentation, "where did this bit of data come from", "what was done to this data over time", "why do you give me X as the answer when this other thing gives me Y as the answer". In order to answer those questions AND keep all of the books as to what comes from where and when - I think you'll find OWB pretty good at that. It is to the point where you can be sitting looking at a report and because our stuff knows about OWB -- you can click on a data element in the report and trace its lineage back through the system to see where it came from. So, I can see this data is actually merged data from two systems where system1 was added to system2 and system1 was my mainfram VSAM files and system2 was some relational data from my new web interface on the internet. This is more then just a "neat" function -- this will save the analyst hours when asked to justify their numbers. This will save days verifying the system is getting stuff from where it should be. In short -- one of the predominant "features" of OWB is the repository -- the bookkeeping it does. It automates many of the mundane tasks you would otherwise.

By "ETL" engine we mean much of what you actually said. We have the gateways to everything (part of iAS). That gives us great "E" capability. Inside the database with things like merge, multi-table inserts, external tables, pipelined functions, transportable tablespaces and the like -- we have the "TL" parts, the transform and load. Whereas most ETL tools take a pure "dump", "transform", "load" approach -- hitting the file system 2, 3, 4 times along the way -- we just extract, tranform and load in as few steps as we can - you even have the ability to do the ETL in one continous step -- no intermediate files, no intermediate steps.


And I asked Derrick Cameron, our resident BI/DW/Tools guru to respond as well and he says:

Firstly, I think it is important to recognize there are a great many
successful data warehouse implementation projects that did not use an
ETL tool. Not surprising considering ETL tools are still early in the
evolution of this technology. I totally agree about the use of
sqlloader (actually, external tables now), sql, plsql as the optimal way
to load and process data in an Oracle database, as well as your other
comments about database features. I too have worked on data warehousing
projects without the use of an ETL tool so your comments have merit.

So what does an ETL tool, and in particular OWB, bring to a data
warehousing project? Many good things.

1. Automated code generation. OK, we're coders. We can do that.
There's more to it than that. In a team environment you varying levels
of expertise, techniques, and approaches, etc. OWB generates all this
code (sql, plsql, tcl scripts, etc.) in a standardized way that so
implementation and maintenance is consistent, and much easier to
maintain, than trying to figure out how someone else coded the
transformations. Some projects may have only semi technical staff and
coding may be difficult for them. A non-issue with OWB. It is also
quite common for development staff to continue to do things the way they
know how rather than adopting the latest best practices, such as
leveraging the 9i ETL functions such as external tables, merge
processing, pipelined transformations, etc. To address one of the
questions about OWB and the 9i ETL engine - the point is OWB generates
code that takes advantage of database ETL features such as merge, multi
table insert, external tables, etc. The OWB development group work VERY
closely with the server technologies group to ensure optimal techniques
are used. Its a very collaborative development environment. They are
right up to date with the latest server features, leveraging all the
things you raise in your comments, but implemented consistently in a
standardized way.

2. Utilities. In our custom development project we built a whole
infrastructure around process management and control. I'm talking about
auditing the load jobs, monitoring, reporting, tracking, etc. jobs and
processes. This is part of what a tool brings to a project. It's one
thing to code a transformation and another to log these and know when
the jobs were run, how long they took, what the results were, how the
exceptions were handled, etc. In a large production warehouse these
become important.

3. Documentation. Ever used Excel spreadsheets to document flows from
source to target? What a pain. I have. Dreary work. Never
maintained. OWB provides extensive reporting of all the metadata in the
repository, including lineage tracking. Better yet, end users can drill
from a workbook in Discoverer (end user ad hoc query tool) back to the
source - target mapping to determine where the data came from. This is
not developed for each workbook but come as a result of building the
mappings in the tools and exploiting the tool integration.

4. This brings me to inter-tool integration. ETL is not the end goal.
It's getting information to end users. Without an ETL tool you start
from scratch in reporting. OWB allow you to build your reporting End
User metadata layer (EUL) as you develop ETL processes. In fact,
updates to the EUL are now (OK, maybe in a month) real time. The saves
a lot of effort building the business metadata reporting layer. This
also extends to OLAP. OWB generates multidimensional OLAP Cubes in
Oracle 9i, and provides the support for cube loading and OLAP metadata.
Again, you're building from scratch there without the ETL tool. Same
goes for Designer. We're really just scratching the surface here. The
bottom line - Datawarehousing is a lot more than just ETL, and tools
(those that are integrated that is), save a lot of development and
maintenance time and effort down stream.

Maybe two or three years ago a lot of this was not there. Its come a
long way and should now be a serious consideration for anyone building a
data warehouse.

How Oracle Warehouse Builder compares to Informatica

Joe Kaeding, December 06, 2002 - 6:24 pm UTC

Great information, but it doesn't answer the question posed. As a resident expert in ETL (Derrick Cameron), I would hope that he should have first hand experience with the major competing tools or at least be versed enough to provide a substantive comparison.

Very good explanation of why an ETL tool

Ramakrishna, January 02, 2003 - 11:32 pm UTC

Dear Tom/Derrick,
Thanks, I too was looking at the ETL tool as just a hindrance to loading data into the warehouse the fastest way. So, the use of an ETL tool like DataStage or Informatica will be justified PROVIDED we use their metadata management and reporting features, right? Unfortunately (at least in DataStage), it is possible to get away with doing just the plain ETL without really capturing all the metadata about data lineage, transformation steps, etc (in fact, there is a separate plugin called MetaBroker which needs to be purchased to do some of this metadata tracking).
It is good that OWB seems to be doing all of this automatically so the metadata is readily available. A great advantage (which the other tools lack) is linking to this metadata from the end-user reports. And, of course, OWB will be optimized for Oracle better than the other tools so it makes sense to use it when the db is Oracle.

regards
Ramakrishna

Fantastic discussion...

Kashif, April 30, 2003 - 11:42 pm UTC

Thanks to Tom and Derrick for the fantastic discussion on the merits of an ETL tool, as compared to homegrown ETL scripts. I myself have never really used an ETL tool (we work off only pl/sql and shell scripts, and it works fine for us), but I can see the value in using one. Especially as the market for BI matures, it will become even more important to have documentation and information about the integrity and source of your data, based on which you make multiple analyses and decisions, and an ETL tool can provide just that. Thanks again.

Kashif

I was exactly looking for this kind of information

Sami, May 13, 2003 - 8:22 am UTC

Thanks Tom & Derrick Cameron.

OWB

judy, July 15, 2003 - 10:38 pm UTC

I was convinced to use the OWB and I recommended my company to purchase the OWB. Now I have the OWB and I have hard time to use it. It has many components and the documentation is like hundreds of pages. Can you point to me a good demo or tutoring step by step how to use it?

I have created my Star schema using ERwin and I tried to import the meta data to the repository and failed.

Any recommendation for a good training sites will be appreciated.

Tom Kyte
July 17, 2003 - 11:18 am UTC

ETL To migrate..

vj, October 29, 2003 - 12:22 am UTC

I am new to Oracle Warehouse builder..

i wanted to know if Oracle Warehouse Builder can be used as an ETL tool in order to migrate data from DB2 to Oracle. i want some assistance in gathering more information on how to use Oracle Warehouse Builder and achieve this..

Any help highly appreciated

Tom Kyte
October 29, 2003 - 6:38 am UTC

Yes it can, you have gateways to pretty much all relational and many non-relational data stores. OWB works with them all. If you have OWB as part of iAS, you have the gateways as well

Q on OWB

Anil Pant, October 30, 2003 - 3:52 am UTC

Hi,
is OWB client 9.2 is supported in Win 98 ?

Tom Kyte
October 30, 2003 - 7:12 am UTC

good question for support -- i've never run win98, no experience with it.

Q on OWB

A reader, October 30, 2003 - 11:58 pm UTC

Hi,
Can anyone tell me does OWB 9.2 client runs on Win98 ?

Tom Kyte
October 31, 2003 - 7:37 am UTC

support says......


O/S Information:
Certification of this product on MS Windows 98 is not planned.

Licensing of OWB

Jasbir Kular, November 04, 2003 - 10:14 am UTC

Is OWB included with the Enterprise Oracle Database license?

Tom Kyte
November 04, 2003 - 10:23 am UTC

in 10g, it'll be.

in 9i timeframe, it is part of iAS licensing

Licensing of OWB

Jasbir Kular, November 04, 2003 - 11:00 am UTC

Do you mean iDS instead of iAS?

Tom Kyte
November 04, 2003 - 4:06 pm UTC

hmm, doh, yes -- ids, sorry about that.

in 10g, DB
in 9i and before ids

BI components?

A reader, March 20, 2004 - 9:43 am UTC

Tom,
When and in what order might one be using
Oracle Warehouse Builder, Oracle Discoverer, Oracle Data Mining, Oracle BI Beans
I have seen documentation which explains each of the above. I would like to all how all of the above would fit in or be used as an integrated tool.

Thank you

Tom Kyte
March 20, 2004 - 10:13 am UTC

OWB -> builds the warehouse that
end users query with Disco
power users use jdev with data mining/bi beans wizards to mine
developers build olap apps against with jdev/bi beans

How can i Install oracle in Windows XP

Haron, March 20, 2004 - 7:57 pm UTC

Hi tom,

Tom,please help me how to install oracle 8i and 8 and forms in windows XP thanks

Tom Kyte
March 21, 2004 - 9:52 am UTC

please contact support for installation and configuration issues (but good luck with really old software on really new os's -- if you want a bit of "old" you have to go "all old" like windows NT as both 8i and 8 predate XP by many many years)

Dave, March 20, 2004 - 9:27 pm UTC

Haron, help yourself by reading the documentation

How to install oracle 8i and 8 and forms in Windows XP

Haron Palala, March 21, 2004 - 8:36 pm UTC

hI TOM,

Please help me how can i install ORACLE 8 AND 8I AND FORMS IN WINDOWS XP THANKS A LOT

Lincencing OWB

Martin Guillen, January 03, 2005 - 9:45 am UTC

Tom:
It seems that we need and Oracle Database EE license AND and DS license in order to use OWB. I have read:
</code> http://www.oracle.com/technology/products/warehouse/htdocs/oracle_warehouse_builder_10g_faq.htm#179 <code>
Could it be an error?

Haron:
I don't want to be crude but this is not a support site. You will not get information on support issues. Not even if you write upper case.
In metalink.oracle.com there is people that work on what you are looking for.
Good luck,
Martin.

Tom Kyte
January 03, 2005 - 9:58 am UTC

it is saying "you need a EE database (not much of a data warehouse without EE)" and the developer suite, that looks correct.

Oracle 9i

Balaa, February 11, 2005 - 6:16 am UTC

Hi tom

It is posible to load oracle9i in windows98 ?

Tom Kyte
February 11, 2005 - 8:18 pm UTC

windows98 is a single user, desktop operating system. it is not a server platform.

You might have been able to get personal edition to load way back when (no server capabilities really), but given that MS itself doesn't support this, neither do we.

I believe it would have worked for personal edition, but I never used that OS so I cannot verify. As a server -- no, not at all.

A reader, February 11, 2005 - 6:19 am UTC


Further Information

John Gilmore, May 11, 2005 - 4:28 am UTC

I'm somewhat new to OWB and I was wondering if you or Derrick know of any sources for further information on this product. Something like Ask Tom but specific to OWB issues, perhaps a forum or such. I've read the OWB User's Guide but I find that it doesn't always go into sufficient depth.

For example, I'm unsure how to use advanced joins such as correlated subqueries, existential queries etc. Suppose I want to include a table of statements in my join but only want the most recent statement for each account to appear in the result set. In SQL I could use something like the following.

select ...
from accounts a, statements s
where a.acct_num = s.acct_num
and s.stmt_num = (
select max(stmt_num)
from statements inner_s
where inner_s.acct_num = s.acct_num)

How would you do something like this in OWB without resorting to, say, a view of the most recent statements for all accounts?

Tom Kyte
May 11, 2005 - 7:32 am UTC

otn.oracle.com -> discussion forums

Forums Home
Technologies
Data Warehousing and Business Intelligence

Datawarehouse Migration

Navaz, May 17, 2005 - 3:08 am UTC

My client oracle 8i database but they are using sybase IQ with DataStage 4.2(ETL) , Now they are going migrate to oracle 9i. Now my client is asking whether they can use the same DataStage 4.2 for build datawarehouse. Please explain me .

Tom Kyte
May 17, 2005 - 8:37 am UTC

I'd ask the makers of datastage what products they support?

OWB is quite misunderstood...

Nicholas Goodman, June 24, 2005 - 12:40 pm UTC

I can't tell you the number of times that I've seen very smart, capable, Oracle DBAs and Developers scratching their heads when trying to get going with OWB. Oracle needs to improve the ease of "getting going" and is making headway in their Paris release, but still smart people shouldn't be perplexed.

I've worked with OWB extensively (I'm a consultant specializing in OWB/Oracle DW/BI) and find it is one of the absolute best ETL tools in the market. Why is it best? Because it provides so much VALUE for it's price. Informatica/DataStage are all REALLY expensive; OWB is almost "free" at 5k/seat if you're already deploying your database on Oracle.

Informatica/Datastage are engines, as opposed to code generators. They retrieve data from servers X and Y, perform a transformation on their server, and then send the data back over to X or Y or Z, depending. This is the "PERL" equivalent to ETL; you are writing a specification that will be interepretted by their engines at runtime to process data. OWB is more like C++ in some respects. You've built your ETL and then it generates "PL/SQL" code that is "compiled" and run in the database (not precisely, but a close comparison).

The questions re: why use an ETL tool for moving data from X to Y. Sure, you get metadata but that's not the strong suit of the tool. If all you REALLY need to do is the Extract(E) and the Load(L) part of the data movement there will be quicker non-OWB methods (transportable tablespaces anyone?). However, when you get into the Transformation(T) business, then you start seeing immense value with the tool. Making 100 normalized tables into one cube and 4 dimensions is a real pain by hand. :)

If you're processing a few million records daily that "weigh" approximately 50GB, what circumstance would you prefer? Inf/DS pulling 50GB over the wire to their server, performing a value add transformation, and then returning the data over the wire. (100GB for one transformation).
-or-
Sending a small (20k) PL/SQL package to the database and having the transformation run right next to the data inside of Oracle.

OWB has a huge architectural advantage as long as your going to use Oracle for your database anyhow.

re: correlated subqueries. They are not supported in mapping building, but can be accomplished through views. Sorry bout that. :(

re: Quickstart. There is not, unfortunately, immense amount of resources for getting up and running quickly. Perhaps Oracle should start distributing a fully "cooked" OWB/DW/Disco stack as part of their desktop data centers. I'm working with two customers that have called me after their first attempts have been difficult. And these are smart smart DBAs and developers.

re: resources. Check out rittman.net. Mark Rittman regularly posts good information on OWB; always practical. Here are some notes for a simple installation of OWB from my PSOUG presentation. Perhaps you'll find them helpful. </code> http://www.bayontechnologies.com/bt/blog/archives/2005/02/psoug_owb_lectu.php <code>


Core OWB free :)

Duke Ganote, July 14, 2006 - 12:51 pm UTC

Someone recently told me that "Paris" -- at least, the 10gR2 OWB Core ETL -- is free as of May 2006 for those with Oracle licenses. That appears to be true:
</code> http://www.oracle.com/technology/products/warehouse/htdocs/owb_10gr2_faq.htm#IsWarehouseBuilder10gRelease2nowDB <code>
I've downloaded it, installed it on my laptop, leafed through Mark Rittman's blog, etc., and started through the "by example" webpages.

I am curious how OWB sets up error trapping on 9i database, though.. the few examples I've read show the resulting code is set-based (SQL oriented) PL/SQL, but don't show any exception clauses.

Tom Kyte
July 14, 2006 - 1:15 pm UTC

exceptions propagate up to the client - as all good code should let them do.

It is very bad to catch exceptions you are NOT expecting! You let unhandled (un-expected) exceptions cause the process to fail. Errors go to top level - where they can be reported on.

Catching them at the low level would only cause them to be "hidden" from the client - the most important thing to not withhold them from.

exception tripping

Duke Ganote, July 14, 2006 - 2:36 pm UTC

All-or-nothing, that's how I always wrote ETL before my present position. I just didn't know if there was an OWB option for "DML error logging" like 10gR2 SQL, SQL*LOADER, and row-by-row/slow-by-slow.

row-by-row fail over

Duke Ganote, July 20, 2006 - 3:17 pm UTC

I found this in a description of OWB performance:

"Needless to say, the set based operating mode is faster in all cases, while the row based operating mode gives more control to the user, both in terms of logging row-by-row processing information as well as in terms of control over the management of incorrect data.

"The default operating mode is the set based, fail over to row based mode in which Warehouse builder will attempt to use the better-performing set-based operating mode, but will fall back to the more granular rowbased mode if data errors are encountered. This mode allows the user to get the speed of set based processing but when an unexpected error occurs it allows you to log these errors."

</code> http://www.oracle.com/technology/products/warehouse/pdf/OWBPerformanceWP.pdf <code>

Tom Kyte
July 22, 2006 - 5:22 pm UTC

that paper predates the release of 10gR2 by a long long long time.

OWB

Dave Thompson, November 16, 2006 - 6:21 am UTC

I've just had it confirmed by the support guys at Metalink that OWB 10gR2 does not support DML Error logging as in the 10gR2 database.

This makes it difficult to do large set based etl work with OWB 10gR2.

I find this amazing.

How we need to deploy the mapping

Saravanan Ramalingam, December 15, 2006 - 7:29 am UTC

Hi Tom,

How the deployment will be in OWB? do we need to deploy the mapping or do we need to deploy the generated code??. Normally in other ETL tools, we will deploy the mapping directly in their own application server. How about in OWB.


Thanks

Tom Kyte
December 15, 2006 - 11:39 am UTC

OWB predominantly creates plsql - which is in the database.

Alberto Dell'Era, December 15, 2006 - 12:20 pm UTC

A mapping is a pl/sql package (or maybe the "module" is, don't remember exactly) - when you hit "deploy" the package is automatically installed for you in the target database.

You can also inspect the generated package by hitting "generate", and you can see the actual code - excellent for learning or debugging purposes.

I've played with OWB this weekend to build a demo, and even if I had very limited previous experience with it (so got a bit lost into the graphical objects initially) - once I understood how to map (sorry for the pun) the graphical objects to the generated pl/sql code, everything became crystal clear - since I was able to map (pun, again) my pl/sql knowledge with the new environment. To me, this has been a real winning point - low learning curve, reuse of knowledge.

How we need to deploy the mapping

Saravanan Ramalingam, December 19, 2006 - 10:54 am UTC

Thanks Tom,

Still i have a question here.

My understanding is that OWB generates PL/SQL code and that code will get deployed in the server.

The same thing can be achieved by writing our own PL/SQL code.

In this case OWB seems to be a code generator. Is there any other major advantage in using OWB rather than writing hand-code.


Regards,

Saravanan Ramalingam



Tom Kyte
December 19, 2006 - 10:57 am UTC

the main use of a tool to do ETL is in the documentation...

it is all about maintainability, understanding, getting up to speed quickly.




Is this an offense

Saravanan Ramalingam, December 20, 2006 - 2:28 am UTC

Tom,

Will it be an offense by any way, if I modify the code which was generated by OWB.


Regards,

Saravanan Ramalingam

Tom Kyte
December 20, 2006 - 8:16 am UTC

you can do whatever you like, but OWB will consistently output the code it does - without your modifications.

(so it would not be a really good idea to modify it...)

Rollback; commit; together in Exception block of OWB

Dheeraj, November 14, 2007 - 7:59 am UTC

Hi Tom,

I was going thru. one of the OWB generated PL-SQL codes for an OWB mapping and found following snippet:
-----------------------------------------------------------
EXCEPTION WHEN OTHERS THEN
ROLLBACK;
COMMIT;
RETURN FALSE;
END;
-----------------------------------------------------------

Just wanted to highlight if its a bug in OWB as rollback and commit together doesn't make any sense. I am using Oracle 9i Developer Suite - ORA_OWB903_HOME.

Thanks,

Dheeraj

It was very useful

Manu Sharma, September 12, 2008 - 7:19 am UTC

Hi Tom,

It was very useful for me knowing the OWB's working. Now problem is that we have to choose between Sybase IQ and OWB to be installed in our systems. Can you please tell me in the light of the achievement of Sybase IQ with the Sun SPARC server, how can we compare it with OWB if we choose to go with OWB.
Tom Kyte
September 16, 2008 - 1:19 pm UTC

... Now problem is that we
have to choose between Sybase IQ and OWB to be installed in our systems. ...


Interesting.

OWB is an 'extract transform load' (ETL) tool.
OWB is not a query tool, it is not a database.

Sybase IQ is not an ETL tool.
Sybase IQ is a query tool.


So, I'm utterly at a loss to compare the two. They are not even remotely comparable.


and what "achievement" are you talking of?

reader

A reader, October 01, 2008 - 1:47 pm UTC

Tom,

Is there a good newsgroup for Oracle Business Intelligence Enterprise Edition (OBIEE) questions formarly seibel

Thanks
Tom Kyte
October 01, 2008 - 2:19 pm UTC

Forum: Oracle BI Suite Enterprise Edition

http://forums.oracle.com/forums/forum.jspa?forumID=378

reader

A reader, October 01, 2008 - 5:09 pm UTC

Thanks

OWB is small yet powerful!

Ankur, November 29, 2009 - 8:48 am UTC

I agree with what Nicholas says. ETL tools are inherently slow. The prime disadvantage of Inf/DS is the heavy data flow. As said earlier, Informatica performs all data on its server and then pushes the resultant to the destination server. Trust me for an ETL scenario where you are dealing with 80 million records everyday and a daily mapping referrring to these records, the Informatica's or any other tool's lame strategy to perform calculations at their end is a real pain in ass!
When you can perform all the normal DML on the database end itself, performing it on the ETL server side seems silly to me.



owb and sql injunction problem

A Reader, November 30, 2009 - 11:17 am UTC

Hi Tom,

I have worked with owb. I would like to inform you have there is a huge bug in oracle warehouse builder from 10 to 11g.

I was working with one of our clients on a telco project where i find consistently that shared pool gets invalidated after some time. I searched the cause of the problem and find out that oracle warehouse builder 11g does not have bind variables in its sql. I open a support request on metalink and after monthes of arguments with oracle support they have submitted it as a bug.

Tom, why is this kind of things are happening does your people does not follow the teachings you preached.


owb sql injunction problem

A reader, December 05, 2009 - 2:27 pm UTC

Hi,

here is bug id

Bug 9114410: ORACLE 11G OWB SQL INJUNCTION PROBLEM


The bug is also in oracle 11g release 2

sql injuncation in owb

A reader, December 05, 2009 - 2:30 pm UTC

also this bug

Bug 8474786: PROCESS FLOW VARIABLE CODE GENERATES PL/SQL BLOCKS THAT HAVE TO BE HARD PARSED


both of these bugs seriously hit the performance owb so take care if you use owb

if I can add my 2 cents ...

Jiri, March 22, 2010 - 8:00 pm UTC

interesting reading ...

I used Informatica 8, OWB, MS DTS, even custom coding with pretty solid auditing mechanism....

I wish I could plot a graph here, X would be ETL speed and Y would be easy to understand code. There is a correlation.


Custom code is the fastest for sure and you can better utilize features for database you use, tools like Informatica are going to be slower simply because they usually need separate server between and network traffic is going to slow it down. Informatica is very robust and widely used in very large organization simply because you might have 50 ETL developers and they need to quickly look at the code and be able to troubleshoot. You cannot do this with custom code no matter how pretty your packages look.

If you have sources all over the place (from ascii flat files, through mainframe to HTTP, XML and PDF files) and large warehouse(s) on several servers then Informatica is going to be good choice. What I don't like on Informatica is that is a bit too too generic.

If you have straightforward ETL mostly from flat files to ODS and marts and they are all on one Oracle server or RAC, then OWB is probably going to work faster and you will be very happy with single easy to use platform managed by same DBAs with features tuned on Oracle database.

I had an interview few years ago with a company who used informatica, when I told them that we used custom PL/SQL packages instead of informatica because source and target was on the same RAC and thus moving 100+mm records with complex transformation from one schema to another was way faster than informatica (which we had in house) they said that I use custom pl/sql for "job security"... no I did not take that job and I lost respect to those guys.


Informatica vs OWB

Babjee, April 20, 2010 - 7:08 am UTC

Both OWB and Informatica have a similar common goal, and
as such would have some common characteristics.In many ways OWB should be relatively easier due to its support for a single
target - that is the Oracle database (in fact flat files too, but not other
dbs). This reduces ( and optimizes) the possible ways it would generate code,
which would simplify diagnostic and deployment activities.

But in the performance,ease of use prospective Informatica is far better than OWB.example usage of expression palette in OWB is bit hectic( you have to write the Output variable list).

Through OWB we cannot expect the same result always. Some times it gives different results for the same mapping after couple of runs.

I would prefer informatica than OWB.

Tom Kyte
April 20, 2010 - 8:15 am UTC

... Through OWB we cannot expect the same result always. Some times it gives
different results for the same mapping after couple of runs.
...

I'm sorry, but the software is deterministic.


I hate slow by slow processing as does anyone that has worked with a database of more than a few records. Informatica epitomizes slow by slow processing - every single time I encounter it - it is the cause of slowness (true, I tend these days to only encounter things when they are giving a problem - but still...)

OWB, Informatica, Custom ETL

Tom Clark, April 20, 2010 - 9:50 am UTC

These 2 products produce the same result but in very different manners. I believe that OWB is SQL/PL SQL based and Informatica type tools are file and C based. For example if you want to move data from one Oracle database to another (mapping), Informatica type of tools will unload the table in parallel to files. It will then perform any operations against the data in the files in parallel and then load each of the files utilizing multiple sql loaders to the new database. I believe that OWB would use straight SQL/PL SQL to accomplish the same task. I assume you could allow for parallel and parallel DML to assist with performance of the OWB. My point is that the 2 tools have very different implementations to accomplish the same task.
I have found that neither is as cheap as sales people want you to believe. They sell the products as if you will no longer need an ETL programmer. I have found not only do I still need an ETL programmer but I have to pay for an expensive tool also (Informatica type tools, I know OWB is free with DB). I have to pay for training, learn a new scripting language, consulting fees from the company… I believe that at one time the ETL tools had merit and that was before we had the ability to have queries run parallel, did not have NO LOGGING techniques … I have found that the cost benefit of the products do not hold when compared to good old custom ETL coding. Good Link for OWB: http://www.rittmanmead.com/articles/

Tom Kyte
April 20, 2010 - 10:02 am UTC

... I believe that OWB would use straight SQL/PL SQL to accomplish
the same task....


correct, in the ETL process flow, OWB tries to skip the unnecessary E and L steps altogether.


... They sell the products as if you will no longer need an ETL programmer. ...

I hope not, when I talk about them - it is from the perspective that this is for your large development team - it is a book-keeper plain and simple. If you have one guy that does the ETL in their spare time on wednesdays - you don't want this. If you have a team of more than a few people that will be doing long term work that has to be maintained over the course of years - then this is a tool you want to look at.


Writing code by hand - given a competent developer - you'll get something fast. It won't be flexible probably, it'll be harder to enhance/to change, it'll probably come with minimal (if any) documentation.

That is what ETL tools are good for - being the book-keeper, helping with understanding what has taken plan, being flexible.


See my first followup for a more in depth statement of this...

jiri, April 21, 2010 - 5:55 pm UTC

"I have found that the cost
benefit of the products do not hold when compared to good old custom ETL coding"

ouch to that, custom ETL is going to be probably faster but not easier to understand. If you have small company then you are going to be OK with custom tool, with many developers, outsourced production support, ... the cost benefit is going to change very quickly and you will simply have to use some sort of framework - informatica, owb, pentaho, ... make this framework and the troubleshooting, changes and enhancements easier.

OWB is not as robust as Informatica, but if your data warehouse sits on Oracle, then it can be faster and much cheaper and more intuitive due to PL/SQL backend.



If you need speed...

Vinay, August 21, 2014 - 3:41 pm UTC

If you need speed, use custom coded SQL/PLSQL. Simple. Informatica just cannot match the performance of direct SQL. I say this after having worked on data integration for years now. If you have massive data volumes and a tight "time window" within which your processing needs to finish, Informatica is certainly not the tool for the job.