Untitled Document

The general selection algorithm with a O(n(logn)²)-time complexity

Introduction

We write pi(a) for the permutation that orders the intercepts of l₁,…, l_n, in descending order at x=a. So y_pi1(a)>…> ypi_n(a). We renumber l₁,…, l_n so that for a< t₁, (a) is the identity. The permutation pi(t₁) has one inversion (exactly one paire of lines crossed at t₁), pi (t₂) has two, etc…

In fact the function I( (x)) = number of inversions in pi(x), is a monotone step function in x with unit jumps at the t_i's.

I(pi (x)) = j if and only if t_j = max(t_i:t_i<=x).

For a given k, the problem of finding t_kmay be viewed as an usual sorting problem to which we apply Megiddo's technique of building algorithms from parallel ones. An implicit binary search over the t_i's is performed, each step taking O(nlogn) time. This will an O(n(logn)²)-time algorithm.

Presentation of the algorithm

In seeking t_k, we will attempt to sort y₁(a^*),…, y_n(a^*) at a^* = t_k + epsilon. We know that this sort may be achieved in O(nlogn) comparisons (cf Binary Search Tree), each answering a question Q_ij of the form "y_i(a^*)<=y_j(a^*)?". The O(nlogn) answers yield the permutation pi^*that sorts these intercepts : ypi^*₁(a^*)>…> ypi^*_n(a^*). Once pi^* has beed found, t_k= max[upi_i^*pi_i+1^*:pi_i^*>pi_i+1^*]; the kth inversion must have just reversed a pair of adjacent intercepts in the permutation pi(t_k-1).

The control structure for the sort will come from the O(logn)-depth sorting network of Ajtai, kmlos and Szemeredi (AKS-Network). At each level, n/2 questions are answered. The network is just a guide for blocking these comparisions into groups of size n/2. The sort is complete once the O(nlogn) answers are obtained and we have determined pi^*.

Even though we do not know a^*, we can answer the question Q_ij in time O(nlogn) as follows. We find u_ijthe x-coordinate of , l_i inter l_j, i<j, in constant time and then obtain its rank among the t_i's.To find its rank, we sort the n intercepts at u_ij in decreasing order to get pi(u_ij). The rank u_ij is the number of inversions in pi(u_ij), I(pi (u_ij)).

If I(pi (u_ij))>k, we know that u_ij> t_k so the answer is "no"; lines i and j have not yet crossed at t_k.

If I(pi(u_ij))<k, we know that u_ij< t_k so the answer is "yes".

If I(pi (u_ij))=k, u_ij=t_k.

I may be computed in time O(nlogn) (merge sort of Knuth) : if pi(u_ij) = (r₁,…,r_n), we use merge-sort to sort the slopes mr₁,…,mr_n and count the number of inversions that were performed.

If we actually answered all n/2 questions Q_i1j1,…, Q_injn on a level of the network by counting inversions, the complexity would be O(n²logn) for that level (n/2 questions*2nlogn for the sort of the n intercepts and the merge sort). And as there are O(logn) levels, we have a complexity of O((nlogn)²) overall.

Improvements

The trick is to resolve the n/2 questions on a level by actually counting inversions only O(log) times.

As mentionned, each questions determines an intersection of a distinct pair of lines, and the answer is obtained by comparing the rank of that intersection point with k.

On a given level of the sequentialzed version of the sorting network, denote the x-coordinates of these points by z_i1,…,z_in/2. We can compute the median of these intersections points, z_med, in time O(n). Its cost time O(nlogn) to rank it, and answer half the questions. For example, if z_med< t_k, then z_ik< t_k for all the z_ik≤ z_med. Continuing with the n/4 unresolved questions on this level, we again find the median z and rank it in time O(nlogn), etc…After O(logn) inversions counts, all n/2 questions on this level are resolved. Since each inversion count takes O(nlogn) steps and there are O(logn) levels, the algorithm has time complexity O(n(logn)³).

But, Cole shows how the result may be improved by a factor of logn, by considering the fact that in the network each question has two inputs. (see R.COLE, Slowing down sorting networks to obtain faster sorting algorithms, Journal of the.ACM, No34 (1987), pp200-208)

So we obtain an algorithm of O(n(logn)²)-time complexity

The general selection algorithm with a O(n(logn)2)-time complexity

Introduction

Presentation of the algorithm

Improvements

The general selection algorithm with a O(n(logn)²)-time complexity