Skip to Main Content
  • Questions
  • Using indexes to order data (and avoiding order by)

Breadcrumb

Question and Answer

Tom Kyte

Thanks for the question, Prashant.

Asked: December 15, 2000 - 12:15 pm UTC

Last updated: February 18, 2005 - 2:19 pm UTC

Version: 8.0.4

Viewed 10K+ times! This question is

You Asked

I want to avoid an order by in a pro*c cursor. I know I can do this with an appropriate index on a single table select if the right clauses are included in the where clause and the right fields are pulled, but I want to do something more complicated.

The select is a join on two tables. Both tables have a composite index, in which the leading column is the join column, and I want the data ordered by this column. The select list includes only index columns from one table and index cloumns plus other columns from the other table. I can make Oracle use the two indexs by using index hints, so I have an index_ffs pull on one table and an index access followed by table lookup on the other table (as indicated by explain plan).

The data returns ordered (by the join column) even if I do not put an order by clause on the select. This is true of any join method (nl or hash join) as long as I do not use parallel. If however, I put the order by clause in, a sort is indicated on the explain plan and the performance is much slower. Oracle does not seem able to avoid an explict sort, even though it seems to me that the data *must* be ordered already.

The Question:
Given that the tables are joining on a column which is the first column in both indexes, and that the hints force Oracle to use these indexes, can I make the assumption that Oracle will always return me the data ordered by the joining column even if I do not use the order by clause, and will this be true even if I use hash join instead of nested loop?



and Tom said...

Hints are just that -- Hints. They may (and will be) ignored at the whim of the server. Not only that -- but a fast full scan does not return data sorted (proof by simple example below).

Your ONLY option for a correctly coded application, if you need the data sorted, is to use an ORDER BY. A subsequent upgrade, someone analyzing statistics (or analyzing them differently then they do today), data skew over time, any number of things -- can and will change the plan (even if it works). Never rely on a "plan" to have the data ordered -- you can ONLY rely on order by to order the data.

Fast full scans on the index are defined in this fashion (designing and tuning reference):

<quote>
This is an alternative to a full table scan when the index contains all the columns that are needed for the query, and at least one column in the index key has the NOT NULL constraint. Fast full scan accesses the data in the index itself, without accessing the table. It cannot be used to eliminate a sort operation. It reads the entire index using multiblock reads (unlike a full index scan) and can be parallelized.
</quote>

<quote>
Using Fast Full Index Scans

The fast full index scan is an alternative to a full table scan when there is an index that contains all the keys that are needed for the query. A fast full scan is faster than
a normal full index scan in that it can use multiblock I/O and can be parallelized just like a table scan. Unlike regular index scans, however, you cannot use keys and the rows will not necessarily come back in sorted order. The following query and
plan illustrate this feature.
</quote>

Your data is currently perhaps being returned in sorted order TODAY but it does not have to be and will not be over time!!!

Here is the conclusive proof of that:

ops$tkyte@DEV816> create table t
2 ( x int not null,
3 a char(2000) default rpad('*',2000,'*') not null,
4 b char(2000) default rpad('*',2000,'*')
5 )
6 /

Table created.

ops$tkyte@DEV816> insert into t(x) select rownum from all_objects where rownum < 10
2 /

9 rows created.

ops$tkyte@DEV816> create index t_idx on t(x,a)
2 /

Index created.

ops$tkyte@DEV816> begin
2 for i in 1 .. 5
3 loop
4 delete from t where x = i;
5 commit;
6 end loop;
7 end;
8 /

PL/SQL procedure successfully completed.

ops$tkyte@DEV816> begin
2 for i in 1 .. 5
3 loop
4 insert into t (x) values (i+100);
5 commit;
6 end loop;
7 end;
8 /

PL/SQL procedure successfully completed.

ops$tkyte@DEV816> set autotrace on explain
ops$tkyte@DEV816> select /*+ INDEX_FFS(t t_idx) */ x from t
2 /

X
----------
101
102
103
6
7
8
9
104
105

9 rows selected.


Execution Plan
----------------------------------------------------------
0 SELECT STATEMENT Optimizer=CHOOSE (Cost=1 Card=82
1 0 INDEX (FAST FULL SCAN) OF 'T_IDX' (NON-UNIQUE)

This is why we do the explicit order by at the end -- an index fast full scan does not return the data sorted in general!!!!

There is many dozens of ways to get the results above and it will happen naturally as the index grows and splits over time -- an index fast full scan is most definitely NOT sorted.

Rating

  (3 ratings)

Is this answer out of date? If it is, please let us know via a Comment

Comments

A reader, April 10, 2002 - 10:11 am UTC

hi Tom,

Can we use index to sort data?

We are using siebel CRM applications(6.2). and we are doing lot's of sort in application. siebel application only use RULE base optimizer.
In one of the table(1 million rows) we sort by
create_date desc and activity_start_Date desc.

it is taking long time.If i remove sorts then result are very fast.

So can i created composite index in descending order like.

(create_date desc,activity_start_Date desc).

Will it improve performance on query?

Thanks in advance.





Tom Kyte
April 10, 2002 - 10:28 am UTC

You need to ask a different tom -- tom siebel.

Do they support you adding indexes.

Adding indexes may affect other components of Siebel (ouch, that hurts to type)...

Yes, you can create such an index (an index with DESC in it will only be used by the CBO, not the RBO).


I cannot imagine what you would DO with 1 million rows though!!! that seems like alot (about 999,900 more then i would want to see sorted on a screen myself)

A reader, April 10, 2002 - 10:44 am UTC

Tom,

Why you guy's hate each other(oracle and siebel).

If i ask anything to siebel people they tell us USE
IBM DB2 which is better then oracle.

You are telling me to go to tom siebel.

Here is my response.

Personally i feel oracle is THE BEST database(i work with DB2 around a year).

about CRM applications, SIEBEL is much better then ORACLE CRM .it use oracle forms which i don't like (I worked with forms about 2 year(version 3.0(Character) and 4.5(GUI) ), and i found VISUAL BASIC is the BEST for front end applications.

any comments?


Tom Kyte
April 10, 2002 - 11:24 am UTC

Well, they are the competition -- It pains me to write "sqlserver" as well. It's supposed to be a joke.

I have to tell you to go to siebel for this one. I cannot answer the question. I was not brushing you off saying "goto siebel if you want to use that stuff". If I could answer the question, I would.

I can say "yes, you can definitely put that index on that table, it will work from the perspective of Oracle".

I cannot say "and your siebel software will be unaffected, their support will be 100% ok with this, and no undesirable side effects will occur". That I cannot say. I quite simply do not know what siebels policy on mucking about with their database objects are.

Siebel tells you to go IBM because IBM doesn't sell CRM software, IBM sells services that will implement Siebel CRM.


Its been a very long time since you've seen Oracle's tools then. Forms 4.5 was ending about 1995 (about when the web came along).

I find the WEB to be the best for front end applications personally. If you want to install and patch and reinstall and maintain an application on 40,000 desktops -- thats up to you. Me, nope, won't do it. I haven't used client server since about 1995 myself. VB, never ever used that for anything at anytime (C, wrote lots and lots of C)



order by containing few tables - how can i avoid a sort ?

Tommy, February 18, 2005 - 10:35 am UTC

I have a query which joins a few tables.
this query has order by 4 field (from 2 tables).

Is there a way to avoid a real sort and use an index ?

Thanks,
Tommy.


Tom Kyte
February 18, 2005 - 2:19 pm UTC

when you figure out how to create a b*tree index across two tables :)

think about it.... if you join EMP to DEPT and order by "ename, dname"... how could an index be used to avoid a sort?



More to Explore

PL/SQL demos

Check out more PL/SQL tutorials on our LiveSQL tool.

PL/SQL docs

PL/SQL reference manual from the Oracle documentation library