Encyclopedia > Pigeonhole sort

  Article Content

Pigeonhole sort

Pigeonhole sorting takes linear time, which is the best possible performance for a sort algorithm since one has to look at all of the objects to be sorted. However, it requires
  • that no two objects in the array be identical;
  • an invertible function mapping the objects you are sorting to integers within a small range (say, 0 to 1000). The range must be limited to some constant times the number of elements that is being sorted to achieve the linear running time. Pigeonhole sort orders the elements by the result of this mapping.

The algorithm works as follows.

  1. Set up an array of initially empty "pigeonholes" the size of the range.
  2. Go over the original array, putting each object in its pigeonhole.
  3. Iterate over the pigeonhole array in order, and put elements from non-empty holes back into the original array.

The hidden constant for this algorithm depends critically on the density of the elements in the pigeonhole array. If there are many more array elements than items to be sorted, steps 1 and 3 will be relatively slow.

Pigeonhole sort is rarely used as the requirements are rarely met and other, more flexible, sorting algorithms are easier to use.

Sample C code for this algorithm:

 void pigeonhole_sort ( int *low , int *high , int minvalue , int maxvalue )
    /* minvalue and maxvalue can also easily be determined within this function */
    int count , size = maxvalue - minvalue + 1 , *current ;
    bool holes[size] ;
    for ( count = 0 ; count < size ; count++ ) /*Initializing*/
       holes[count] = false ;
    for ( current = low ; current <= high ; current++ ) /*Sorting*/
       holes[(*current)-minvalue] = true ;
    for ( current = low , count = 0 ; count < size ; count++ )
       if ( holes[count] )
          *current = count + minvalue ;
          current++ ;

All Wikipedia text is available under the terms of the GNU Free Documentation License

  Search Encyclopedia

Search over one million articles, find something about almost anything!
  Featured Article
Glossary of telecommunication network terms

... network[?] -- differentiating network[?] -- directed net[?] -- distributed control[?] -- distributed network[?] -- distributed-queue dual-bus[?] (DQDB) -- endpoint node[?] ...