A reader, December 05, 2002 - 1:15 pm UTC
For the benefit of a wider audience, can we have a more basic discussion here? Sorry if my questions are too naive.
What is the purpose of these ETL tools in the first place? For loading data into datawarehouse databases(whether oracle or any other d/b), each of the databases have provided tools that are most appropriate for that d/b (that is what they claim anyway. Tom has repeatedly said he prefers sqlldr to C + OCI calls for dataloads). Similarly extracting data from the database to flat files also is supported by all the databases. Whatmore, many databases even offer connectivity to other databases using interfaces like ODBC or transparent gateways. Mainframes and other Cobol type systems too can download data to Text files. And for transforming data, what better tool than SQL? Or if you want you can do it on a Unix platform using sed/awk etc most efficiently. And Oracle database has a vast array of features like MVs, dimensions, Transportable Tablespaces, sqlldr with direct and parallel etc which are great tools for extraction transformation and loading. I don't really know if an ETL tool can do anything more efficiently.
I have not used any of the ETL tools myself. But I have worked on a couple of datawarehouse projects where they just 'play cat and mouse' with the data using ETL tools. On many occasions it just adds up the cost and cause delays. I have seen data extracted into text files from and Oracle ODS database using an ETL tool and loaded into another Oracle warehouse database using sqlldr. I have also seen cases where ETL tools directly interfacing with the database using OCI. But I have not seen a situation where a ETL tool was really indespensable.
I heard that Oracle's ETL tool (warehouse builder) is now made part of Oracle 9i database as the 'ETL engine'. I have no clue what that means!. Can someone throw some light on this whole ETL thing. Even after working on 3 datawarehouse projects for nearly 3 years, I don't really understand the rationale of their existence! I must be really stupid!
December 06, 2002 - 6:54 am UTC
Here is my 2 cents -- yes, any tool can get in the way, any tool can be useful -- it is 100% a two way street there. Taken to illogical extremes, anything can be bad like that.
On of the important things in a data warehouse is documentation, "where did this bit of data come from", "what was done to this data over time", "why do you give me X as the answer when this other thing gives me Y as the answer". In order to answer those questions AND keep all of the books as to what comes from where and when - I think you'll find OWB pretty good at that. It is to the point where you can be sitting looking at a report and because our stuff knows about OWB -- you can click on a data element in the report and trace its lineage back through the system to see where it came from. So, I can see this data is actually merged data from two systems where system1 was added to system2 and system1 was my mainfram VSAM files and system2 was some relational data from my new web interface on the internet. This is more then just a "neat" function -- this will save the analyst hours when asked to justify their numbers. This will save days verifying the system is getting stuff from where it should be. In short -- one of the predominant "features" of OWB is the repository -- the bookkeeping it does. It automates many of the mundane tasks you would otherwise.
By "ETL" engine we mean much of what you actually said. We have the gateways to everything (part of iAS). That gives us great "E" capability. Inside the database with things like merge, multi-table inserts, external tables, pipelined functions, transportable tablespaces and the like -- we have the "TL" parts, the transform and load. Whereas most ETL tools take a pure "dump", "transform", "load" approach -- hitting the file system 2, 3, 4 times along the way -- we just extract, tranform and load in as few steps as we can - you even have the ability to do the ETL in one continous step -- no intermediate files, no intermediate steps.
And I asked Derrick Cameron, our resident BI/DW/Tools guru to respond as well and he says:
Firstly, I think it is important to recognize there are a great many
successful data warehouse implementation projects that did not use an
ETL tool. Not surprising considering ETL tools are still early in the
evolution of this technology. I totally agree about the use of
sqlloader (actually, external tables now), sql, plsql as the optimal way
to load and process data in an Oracle database, as well as your other
comments about database features. I too have worked on data warehousing
projects without the use of an ETL tool so your comments have merit.
So what does an ETL tool, and in particular OWB, bring to a data
warehousing project? Many good things.
1. Automated code generation. OK, we're coders. We can do that.
There's more to it than that. In a team environment you varying levels
of expertise, techniques, and approaches, etc. OWB generates all this
code (sql, plsql, tcl scripts, etc.) in a standardized way that so
implementation and maintenance is consistent, and much easier to
maintain, than trying to figure out how someone else coded the
transformations. Some projects may have only semi technical staff and
coding may be difficult for them. A non-issue with OWB. It is also
quite common for development staff to continue to do things the way they
know how rather than adopting the latest best practices, such as
leveraging the 9i ETL functions such as external tables, merge
processing, pipelined transformations, etc. To address one of the
questions about OWB and the 9i ETL engine - the point is OWB generates
code that takes advantage of database ETL features such as merge, multi
table insert, external tables, etc. The OWB development group work VERY
closely with the server technologies group to ensure optimal techniques
are used. Its a very collaborative development environment. They are
right up to date with the latest server features, leveraging all the
things you raise in your comments, but implemented consistently in a
standardized way.
2. Utilities. In our custom development project we built a whole
infrastructure around process management and control. I'm talking about
auditing the load jobs, monitoring, reporting, tracking, etc. jobs and
processes. This is part of what a tool brings to a project. It's one
thing to code a transformation and another to log these and know when
the jobs were run, how long they took, what the results were, how the
exceptions were handled, etc. In a large production warehouse these
become important.
3. Documentation. Ever used Excel spreadsheets to document flows from
source to target? What a pain. I have. Dreary work. Never
maintained. OWB provides extensive reporting of all the metadata in the
repository, including lineage tracking. Better yet, end users can drill
from a workbook in Discoverer (end user ad hoc query tool) back to the
source - target mapping to determine where the data came from. This is
not developed for each workbook but come as a result of building the
mappings in the tools and exploiting the tool integration.
4. This brings me to inter-tool integration. ETL is not the end goal.
It's getting information to end users. Without an ETL tool you start
from scratch in reporting. OWB allow you to build your reporting End
User metadata layer (EUL) as you develop ETL processes. In fact,
updates to the EUL are now (OK, maybe in a month) real time. The saves
a lot of effort building the business metadata reporting layer. This
also extends to OLAP. OWB generates multidimensional OLAP Cubes in
Oracle 9i, and provides the support for cube loading and OLAP metadata.
Again, you're building from scratch there without the ETL tool. Same
goes for Designer. We're really just scratching the surface here. The
bottom line - Datawarehousing is a lot more than just ETL, and tools
(those that are integrated that is), save a lot of development and
maintenance time and effort down stream.
Maybe two or three years ago a lot of this was not there. Its come a
long way and should now be a serious consideration for anyone building a
data warehouse.
How Oracle Warehouse Builder compares to Informatica
Joe Kaeding, December 06, 2002 - 6:24 pm UTC
Great information, but it doesn't answer the question posed. As a resident expert in ETL (Derrick Cameron), I would hope that he should have first hand experience with the major competing tools or at least be versed enough to provide a substantive comparison.
Very good explanation of why an ETL tool
Ramakrishna, January 02, 2003 - 11:32 pm UTC
Dear Tom/Derrick,
Thanks, I too was looking at the ETL tool as just a hindrance to loading data into the warehouse the fastest way. So, the use of an ETL tool like DataStage or Informatica will be justified PROVIDED we use their metadata management and reporting features, right? Unfortunately (at least in DataStage), it is possible to get away with doing just the plain ETL without really capturing all the metadata about data lineage, transformation steps, etc (in fact, there is a separate plugin called MetaBroker which needs to be purchased to do some of this metadata tracking).
It is good that OWB seems to be doing all of this automatically so the metadata is readily available. A great advantage (which the other tools lack) is linking to this metadata from the end-user reports. And, of course, OWB will be optimized for Oracle better than the other tools so it makes sense to use it when the db is Oracle.
regards
Ramakrishna
Fantastic discussion...
Kashif, April 30, 2003 - 11:42 pm UTC
Thanks to Tom and Derrick for the fantastic discussion on the merits of an ETL tool, as compared to homegrown ETL scripts. I myself have never really used an ETL tool (we work off only pl/sql and shell scripts, and it works fine for us), but I can see the value in using one. Especially as the market for BI matures, it will become even more important to have documentation and information about the integrity and source of your data, based on which you make multiple analyses and decisions, and an ETL tool can provide just that. Thanks again.
Kashif
I was exactly looking for this kind of information
Sami, May 13, 2003 - 8:22 am UTC
Thanks Tom & Derrick Cameron.
OWB
judy, July 15, 2003 - 10:38 pm UTC
I was convinced to use the OWB and I recommended my company to purchase the OWB. Now I have the OWB and I have hard time to use it. It has many components and the documentation is like hundreds of pages. Can you point to me a good demo or tutoring step by step how to use it?
I have created my Star schema using ERwin and I tried to import the meta data to the repository and failed.
Any recommendation for a good training sites will be appreciated.
July 17, 2003 - 11:18 am UTC
ETL To migrate..
vj, October 29, 2003 - 12:22 am UTC
I am new to Oracle Warehouse builder..
i wanted to know if Oracle Warehouse Builder can be used as an ETL tool in order to migrate data from DB2 to Oracle. i want some assistance in gathering more information on how to use Oracle Warehouse Builder and achieve this..
Any help highly appreciated
October 29, 2003 - 6:38 am UTC
Yes it can, you have gateways to pretty much all relational and many non-relational data stores. OWB works with them all. If you have OWB as part of iAS, you have the gateways as well
Q on OWB
Anil Pant, October 30, 2003 - 3:52 am UTC
Hi,
is OWB client 9.2 is supported in Win 98 ?
October 30, 2003 - 7:12 am UTC
good question for support -- i've never run win98, no experience with it.
Q on OWB
A reader, October 30, 2003 - 11:58 pm UTC
Hi,
Can anyone tell me does OWB 9.2 client runs on Win98 ?
October 31, 2003 - 7:37 am UTC
support says......
O/S Information:
Certification of this product on MS Windows 98 is not planned.
Licensing of OWB
Jasbir Kular, November 04, 2003 - 10:14 am UTC
Is OWB included with the Enterprise Oracle Database license?
November 04, 2003 - 10:23 am UTC
in 10g, it'll be.
in 9i timeframe, it is part of iAS licensing
Licensing of OWB
Jasbir Kular, November 04, 2003 - 11:00 am UTC
Do you mean iDS instead of iAS?
November 04, 2003 - 4:06 pm UTC
hmm, doh, yes -- ids, sorry about that.
in 10g, DB
in 9i and before ids
BI components?
A reader, March 20, 2004 - 9:43 am UTC
Tom,
When and in what order might one be using
Oracle Warehouse Builder, Oracle Discoverer, Oracle Data Mining, Oracle BI Beans
I have seen documentation which explains each of the above. I would like to all how all of the above would fit in or be used as an integrated tool.
Thank you
March 20, 2004 - 10:13 am UTC
OWB -> builds the warehouse that
end users query with Disco
power users use jdev with data mining/bi beans wizards to mine
developers build olap apps against with jdev/bi beans
How can i Install oracle in Windows XP
Haron, March 20, 2004 - 7:57 pm UTC
Hi tom,
Tom,please help me how to install oracle 8i and 8 and forms in windows XP thanks
March 21, 2004 - 9:52 am UTC
please contact support for installation and configuration issues (but good luck with really old software on really new os's -- if you want a bit of "old" you have to go "all old" like windows NT as both 8i and 8 predate XP by many many years)
Dave, March 20, 2004 - 9:27 pm UTC
Haron, help yourself by reading the documentation
How to install oracle 8i and 8 and forms in Windows XP
Haron Palala, March 21, 2004 - 8:36 pm UTC
hI TOM,
Please help me how can i install ORACLE 8 AND 8I AND FORMS IN WINDOWS XP THANKS A LOT
Lincencing OWB
Martin Guillen, January 03, 2005 - 9:45 am UTC
January 03, 2005 - 9:58 am UTC
it is saying "you need a EE database (not much of a data warehouse without EE)" and the developer suite, that looks correct.
Oracle 9i
Balaa, February 11, 2005 - 6:16 am UTC
Hi tom
It is posible to load oracle9i in windows98 ?
February 11, 2005 - 8:18 pm UTC
windows98 is a single user, desktop operating system. it is not a server platform.
You might have been able to get personal edition to load way back when (no server capabilities really), but given that MS itself doesn't support this, neither do we.
I believe it would have worked for personal edition, but I never used that OS so I cannot verify. As a server -- no, not at all.
A reader, February 11, 2005 - 6:19 am UTC
Further Information
John Gilmore, May 11, 2005 - 4:28 am UTC
I'm somewhat new to OWB and I was wondering if you or Derrick know of any sources for further information on this product. Something like Ask Tom but specific to OWB issues, perhaps a forum or such. I've read the OWB User's Guide but I find that it doesn't always go into sufficient depth.
For example, I'm unsure how to use advanced joins such as correlated subqueries, existential queries etc. Suppose I want to include a table of statements in my join but only want the most recent statement for each account to appear in the result set. In SQL I could use something like the following.
select ...
from accounts a, statements s
where a.acct_num = s.acct_num
and s.stmt_num = (
select max(stmt_num)
from statements inner_s
where inner_s.acct_num = s.acct_num)
How would you do something like this in OWB without resorting to, say, a view of the most recent statements for all accounts?
May 11, 2005 - 7:32 am UTC
otn.oracle.com -> discussion forums
Forums Home
Technologies
Data Warehousing and Business Intelligence
Datawarehouse Migration
Navaz, May 17, 2005 - 3:08 am UTC
My client oracle 8i database but they are using sybase IQ with DataStage 4.2(ETL) , Now they are going migrate to oracle 9i. Now my client is asking whether they can use the same DataStage 4.2 for build datawarehouse. Please explain me .
May 17, 2005 - 8:37 am UTC
I'd ask the makers of datastage what products they support?
OWB is quite misunderstood...
Nicholas Goodman, June 24, 2005 - 12:40 pm UTC
I can't tell you the number of times that I've seen very smart, capable, Oracle DBAs and Developers scratching their heads when trying to get going with OWB. Oracle needs to improve the ease of "getting going" and is making headway in their Paris release, but still smart people shouldn't be perplexed.
I've worked with OWB extensively (I'm a consultant specializing in OWB/Oracle DW/BI) and find it is one of the absolute best ETL tools in the market. Why is it best? Because it provides so much VALUE for it's price. Informatica/DataStage are all REALLY expensive; OWB is almost "free" at 5k/seat if you're already deploying your database on Oracle.
Informatica/Datastage are engines, as opposed to code generators. They retrieve data from servers X and Y, perform a transformation on their server, and then send the data back over to X or Y or Z, depending. This is the "PERL" equivalent to ETL; you are writing a specification that will be interepretted by their engines at runtime to process data. OWB is more like C++ in some respects. You've built your ETL and then it generates "PL/SQL" code that is "compiled" and run in the database (not precisely, but a close comparison).
The questions re: why use an ETL tool for moving data from X to Y. Sure, you get metadata but that's not the strong suit of the tool. If all you REALLY need to do is the Extract(E) and the Load(L) part of the data movement there will be quicker non-OWB methods (transportable tablespaces anyone?). However, when you get into the Transformation(T) business, then you start seeing immense value with the tool. Making 100 normalized tables into one cube and 4 dimensions is a real pain by hand. :)
If you're processing a few million records daily that "weigh" approximately 50GB, what circumstance would you prefer? Inf/DS pulling 50GB over the wire to their server, performing a value add transformation, and then returning the data over the wire. (100GB for one transformation).
-or-
Sending a small (20k) PL/SQL package to the database and having the transformation run right next to the data inside of Oracle.
OWB has a huge architectural advantage as long as your going to use Oracle for your database anyhow.
re: correlated subqueries. They are not supported in mapping building, but can be accomplished through views. Sorry bout that. :(
re: Quickstart. There is not, unfortunately, immense amount of resources for getting up and running quickly. Perhaps Oracle should start distributing a fully "cooked" OWB/DW/Disco stack as part of their desktop data centers. I'm working with two customers that have called me after their first attempts have been difficult. And these are smart smart DBAs and developers.
re: resources. Check out rittman.net. Mark Rittman regularly posts good information on OWB; always practical. Here are some notes for a simple installation of OWB from my PSOUG presentation. Perhaps you'll find them helpful. </code>
http://www.bayontechnologies.com/bt/blog/archives/2005/02/psoug_owb_lectu.php <code>
Core OWB free :)
Duke Ganote, July 14, 2006 - 12:51 pm UTC
Someone recently told me that "Paris" -- at least, the 10gR2 OWB Core ETL -- is free as of May 2006 for those with Oracle licenses. That appears to be true:
</code>
http://www.oracle.com/technology/products/warehouse/htdocs/owb_10gr2_faq.htm#IsWarehouseBuilder10gRelease2nowDB <code>
I've downloaded it, installed it on my laptop, leafed through Mark Rittman's blog, etc., and started through the "by example" webpages.
I am curious how OWB sets up error trapping on 9i database, though.. the few examples I've read show the resulting code is set-based (SQL oriented) PL/SQL, but don't show any exception clauses.
July 14, 2006 - 1:15 pm UTC
exceptions propagate up to the client - as all good code should let them do.
It is very bad to catch exceptions you are NOT expecting! You let unhandled (un-expected) exceptions cause the process to fail. Errors go to top level - where they can be reported on.
Catching them at the low level would only cause them to be "hidden" from the client - the most important thing to not withhold them from.
exception tripping
Duke Ganote, July 14, 2006 - 2:36 pm UTC
All-or-nothing, that's how I always wrote ETL before my present position. I just didn't know if there was an OWB option for "DML error logging" like 10gR2 SQL, SQL*LOADER, and row-by-row/slow-by-slow.
row-by-row fail over
Duke Ganote, July 20, 2006 - 3:17 pm UTC
I found this in a description of OWB performance:
"Needless to say, the set based operating mode is faster in all cases, while the row based operating mode gives more control to the user, both in terms of logging row-by-row processing information as well as in terms of control over the management of incorrect data.
"The default operating mode is the set based, fail over to row based mode in which Warehouse builder will attempt to use the better-performing set-based operating mode, but will fall back to the more granular rowbased mode if data errors are encountered. This mode allows the user to get the speed of set based processing but when an unexpected error occurs it allows you to log these errors."
</code>
http://www.oracle.com/technology/products/warehouse/pdf/OWBPerformanceWP.pdf <code>
July 22, 2006 - 5:22 pm UTC
that paper predates the release of 10gR2 by a long long long time.
OWB
Dave Thompson, November 16, 2006 - 6:21 am UTC
I've just had it confirmed by the support guys at Metalink that OWB 10gR2 does not support DML Error logging as in the 10gR2 database.
This makes it difficult to do large set based etl work with OWB 10gR2.
I find this amazing.
How we need to deploy the mapping
Saravanan Ramalingam, December 15, 2006 - 7:29 am UTC
Hi Tom,
How the deployment will be in OWB? do we need to deploy the mapping or do we need to deploy the generated code??. Normally in other ETL tools, we will deploy the mapping directly in their own application server. How about in OWB.
Thanks
December 15, 2006 - 11:39 am UTC
OWB predominantly creates plsql - which is in the database.
Alberto Dell'Era, December 15, 2006 - 12:20 pm UTC
A mapping is a pl/sql package (or maybe the "module" is, don't remember exactly) - when you hit "deploy" the package is automatically installed for you in the target database.
You can also inspect the generated package by hitting "generate", and you can see the actual code - excellent for learning or debugging purposes.
I've played with OWB this weekend to build a demo, and even if I had very limited previous experience with it (so got a bit lost into the graphical objects initially) - once I understood how to map (sorry for the pun) the graphical objects to the generated pl/sql code, everything became crystal clear - since I was able to map (pun, again) my pl/sql knowledge with the new environment. To me, this has been a real winning point - low learning curve, reuse of knowledge.
How we need to deploy the mapping
Saravanan Ramalingam, December 19, 2006 - 10:54 am UTC
Thanks Tom,
Still i have a question here.
My understanding is that OWB generates PL/SQL code and that code will get deployed in the server.
The same thing can be achieved by writing our own PL/SQL code.
In this case OWB seems to be a code generator. Is there any other major advantage in using OWB rather than writing hand-code.
Regards,
Saravanan Ramalingam
December 19, 2006 - 10:57 am UTC
the main use of a tool to do ETL is in the documentation...
it is all about maintainability, understanding, getting up to speed quickly.
Is this an offense
Saravanan Ramalingam, December 20, 2006 - 2:28 am UTC
Tom,
Will it be an offense by any way, if I modify the code which was generated by OWB.
Regards,
Saravanan Ramalingam
December 20, 2006 - 8:16 am UTC
you can do whatever you like, but OWB will consistently output the code it does - without your modifications.
(so it would not be a really good idea to modify it...)
Rollback; commit; together in Exception block of OWB
Dheeraj, November 14, 2007 - 7:59 am UTC
Hi Tom,
I was going thru. one of the OWB generated PL-SQL codes for an OWB mapping and found following snippet:
-----------------------------------------------------------
EXCEPTION WHEN OTHERS THEN
ROLLBACK;
COMMIT;
RETURN FALSE;
END;
-----------------------------------------------------------
Just wanted to highlight if its a bug in OWB as rollback and commit together doesn't make any sense. I am using Oracle 9i Developer Suite - ORA_OWB903_HOME.
Thanks,
Dheeraj
It was very useful
Manu Sharma, September 12, 2008 - 7:19 am UTC
Hi Tom,
It was very useful for me knowing the OWB's working. Now problem is that we have to choose between Sybase IQ and OWB to be installed in our systems. Can you please tell me in the light of the achievement of Sybase IQ with the Sun SPARC server, how can we compare it with OWB if we choose to go with OWB.
September 16, 2008 - 1:19 pm UTC
... Now problem is that we
have to choose between Sybase IQ and OWB to be installed in our systems. ...
Interesting.
OWB is an 'extract transform load' (ETL) tool.
OWB is not a query tool, it is not a database.
Sybase IQ is not an ETL tool.
Sybase IQ is a query tool.
So, I'm utterly at a loss to compare the two. They are not even remotely comparable.
and what "achievement" are you talking of?
reader
A reader, October 01, 2008 - 1:47 pm UTC
Tom,
Is there a good newsgroup for Oracle Business Intelligence Enterprise Edition (OBIEE) questions formarly seibel
Thanks
October 01, 2008 - 2:19 pm UTC
reader
A reader, October 01, 2008 - 5:09 pm UTC
Thanks
OWB is small yet powerful!
Ankur, November 29, 2009 - 8:48 am UTC
I agree with what Nicholas says. ETL tools are inherently slow. The prime disadvantage of Inf/DS is the heavy data flow. As said earlier, Informatica performs all data on its server and then pushes the resultant to the destination server. Trust me for an ETL scenario where you are dealing with 80 million records everyday and a daily mapping referrring to these records, the Informatica's or any other tool's lame strategy to perform calculations at their end is a real pain in ass!
When you can perform all the normal DML on the database end itself, performing it on the ETL server side seems silly to me.
owb and sql injunction problem
A Reader, November 30, 2009 - 11:17 am UTC
Hi Tom,
I have worked with owb. I would like to inform you have there is a huge bug in oracle warehouse builder from 10 to 11g.
I was working with one of our clients on a telco project where i find consistently that shared pool gets invalidated after some time. I searched the cause of the problem and find out that oracle warehouse builder 11g does not have bind variables in its sql. I open a support request on metalink and after monthes of arguments with oracle support they have submitted it as a bug.
Tom, why is this kind of things are happening does your people does not follow the teachings you preached.
owb sql injunction problem
A reader, December 05, 2009 - 2:27 pm UTC
Hi,
here is bug id
Bug 9114410: ORACLE 11G OWB SQL INJUNCTION PROBLEM
The bug is also in oracle 11g release 2
sql injuncation in owb
A reader, December 05, 2009 - 2:30 pm UTC
also this bug
Bug 8474786: PROCESS FLOW VARIABLE CODE GENERATES PL/SQL BLOCKS THAT HAVE TO BE HARD PARSED
both of these bugs seriously hit the performance owb so take care if you use owb
if I can add my 2 cents ...
Jiri, March 22, 2010 - 8:00 pm UTC
interesting reading ...
I used Informatica 8, OWB, MS DTS, even custom coding with pretty solid auditing mechanism....
I wish I could plot a graph here, X would be ETL speed and Y would be easy to understand code. There is a correlation.
Custom code is the fastest for sure and you can better utilize features for database you use, tools like Informatica are going to be slower simply because they usually need separate server between and network traffic is going to slow it down. Informatica is very robust and widely used in very large organization simply because you might have 50 ETL developers and they need to quickly look at the code and be able to troubleshoot. You cannot do this with custom code no matter how pretty your packages look.
If you have sources all over the place (from ascii flat files, through mainframe to HTTP, XML and PDF files) and large warehouse(s) on several servers then Informatica is going to be good choice. What I don't like on Informatica is that is a bit too too generic.
If you have straightforward ETL mostly from flat files to ODS and marts and they are all on one Oracle server or RAC, then OWB is probably going to work faster and you will be very happy with single easy to use platform managed by same DBAs with features tuned on Oracle database.
I had an interview few years ago with a company who used informatica, when I told them that we used custom PL/SQL packages instead of informatica because source and target was on the same RAC and thus moving 100+mm records with complex transformation from one schema to another was way faster than informatica (which we had in house) they said that I use custom pl/sql for "job security"... no I did not take that job and I lost respect to those guys.
Informatica vs OWB
Babjee, April 20, 2010 - 7:08 am UTC
Both OWB and Informatica have a similar common goal, and
as such would have some common characteristics.In many ways OWB should be relatively easier due to its support for a single
target - that is the Oracle database (in fact flat files too, but not other
dbs). This reduces ( and optimizes) the possible ways it would generate code,
which would simplify diagnostic and deployment activities.
But in the performance,ease of use prospective Informatica is far better than OWB.example usage of expression palette in OWB is bit hectic( you have to write the Output variable list).
Through OWB we cannot expect the same result always. Some times it gives different results for the same mapping after couple of runs.
I would prefer informatica than OWB.
April 20, 2010 - 8:15 am UTC
... Through OWB we cannot expect the same result always. Some times it gives
different results for the same mapping after couple of runs.
...
I'm sorry, but the software is deterministic.
I hate slow by slow processing as does anyone that has worked with a database of more than a few records. Informatica epitomizes slow by slow processing - every single time I encounter it - it is the cause of slowness (true, I tend these days to only encounter things when they are giving a problem - but still...)
OWB, Informatica, Custom ETL
Tom Clark, April 20, 2010 - 9:50 am UTC
These 2 products produce the same result but in very different manners. I believe that OWB is SQL/PL SQL based and Informatica type tools are file and C based. For example if you want to move data from one Oracle database to another (mapping), Informatica type of tools will unload the table in parallel to files. It will then perform any operations against the data in the files in parallel and then load each of the files utilizing multiple sql loaders to the new database. I believe that OWB would use straight SQL/PL SQL to accomplish the same task. I assume you could allow for parallel and parallel DML to assist with performance of the OWB. My point is that the 2 tools have very different implementations to accomplish the same task.
I have found that neither is as cheap as sales people want you to believe. They sell the products as if you will no longer need an ETL programmer. I have found not only do I still need an ETL programmer but I have to pay for an expensive tool also (Informatica type tools, I know OWB is free with DB). I have to pay for training, learn a new scripting language, consulting fees from the company… I believe that at one time the ETL tools had merit and that was before we had the ability to have queries run parallel, did not have NO LOGGING techniques … I have found that the cost benefit of the products do not hold when compared to good old custom ETL coding. Good Link for OWB:
http://www.rittmanmead.com/articles/
April 20, 2010 - 10:02 am UTC
... I believe that OWB would use straight SQL/PL SQL to accomplish
the same task....
correct, in the ETL process flow, OWB tries to skip the unnecessary E and L steps altogether.
... They sell the products as if you will no longer need an ETL programmer. ...
I hope not, when I talk about them - it is from the perspective that this is for your large development team - it is a book-keeper plain and simple. If you have one guy that does the ETL in their spare time on wednesdays - you don't want this. If you have a team of more than a few people that will be doing long term work that has to be maintained over the course of years - then this is a tool you want to look at.
Writing code by hand - given a competent developer - you'll get something fast. It won't be flexible probably, it'll be harder to enhance/to change, it'll probably come with minimal (if any) documentation.
That is what ETL tools are good for - being the book-keeper, helping with understanding what has taken plan, being flexible.
See my first followup for a more in depth statement of this...
jiri, April 21, 2010 - 5:55 pm UTC
"I have found that the cost
benefit of the products do not hold when compared to good old custom ETL coding"
ouch to that, custom ETL is going to be probably faster but not easier to understand. If you have small company then you are going to be OK with custom tool, with many developers, outsourced production support, ... the cost benefit is going to change very quickly and you will simply have to use some sort of framework - informatica, owb, pentaho, ... make this framework and the troubleshooting, changes and enhancements easier.
OWB is not as robust as Informatica, but if your data warehouse sits on Oracle, then it can be faster and much cheaper and more intuitive due to PL/SQL backend.
If you need speed...
Vinay, August 21, 2014 - 3:41 pm UTC
If you need speed, use custom coded SQL/PLSQL. Simple. Informatica just cannot match the performance of direct SQL. I say this after having worked on data integration for years now. If you have massive data volumes and a tight "time window" within which your processing needs to finish, Informatica is certainly not the tool for the job.