Subject Index Stats
Author Don Gollahon
I need a little more info than the IB docs are giving me on the meaning of
the Index Statistics.

Consider the following example:

Index TOLL_INDEX (3)
Depth: 3, leaf buckets: 16823, nodes: 7096542
Average data length: 5.00, total dup: 293392, max dup: 26
Fill distribution:
0 - 19% = 76
20 - 39% = 6
40 - 59% = 8865
60 - 79% = 6861
80 - 99% = 1015

This says there are a lot of total dups but since max dups is low then this
index is probably okay, Right?

Now consider the following:

Index CABSEXTRACT (4)
Depth: 3, leaf buckets: 10348, nodes: 7096545
Average data length: 0.00, total dup: 7096513, max dup: 809128
Fill distribution:
0 - 19% = 4
20 - 39% = 4
40 - 59% = 10324
60 - 79% = 11
80 - 99% = 5

This index has a large max dup. I should probably add a field to the index
reduce that? I assume there is a point at which it would be better not to
have an index at all if it is going to have a large amount of dups, right?
Is there a rule of thumb to determine this?


It would be nice if there was a FB conference in Chicago sometime so I could
go get some training.

Thanks for all of your help.


Don Gollahon
gollahon@...
"The Original GenSoft Prodigal"