If this is really O(1), then that makes sorting O(N).

mananaysiempre · 2024-08-30T12:08:00 1725019680

It does, radix sort is in fact O(N) if size of the universe of values counts as a constant. It’s just slow in practice.

The definition of the machine for which the O(N log N) bound is proved is very delicate: you have to allow O(1) operations on an arbitrarily large set of values but not encoding tricks allowing multiple values to be packed into one and then manipulated unrealistically cheaply using those operations. In particular, the machine must not be able to do arbitrary arithmetic.

Sesse__ · 2024-08-30T13:09:07 1725023347

Or stated equivalently: The only operations allowed on elements are binary comparisons and two-element swaps, but both are O(1).

mananaysiempre · 2024-08-30T13:42:44 1725025364

Kiiinda. Two-element swaps are a stretch already for merge sort, especially the O(log N)-space linked list version, let alone search trees and so on. At some point you also need to make sure you can’t sneak arbitrary computation into the (necessarily unlimited-magnitude) array index.

nwellnhof · 2024-08-30T14:26:24 1725027984

A better comparison is bucket sort which is O(N) with uniformly distributed keys.

amelius · 2024-08-30T12:11:03 1725019863

> if size of the universe of values counts as a constant

But of course, that is cheating.

mananaysiempre · 2024-08-30T12:18:46 1725020326

I mean, it depends. In Unicode normalization you have to do a stable sort of an arbitrary number of values (code points) that can only ever map to a small finite number ( < 256) of sort keys (combining classes). Insertion sort is the best choice for ordinary inputs, but for adversarial ones a counting sort is probably your best bet.

tonyg · 2024-08-30T12:11:33 1725019893

It looks like it's rather sensitive to the distributions of inputs. The claim in the abstract is that it's O(1) "for the priority increment distributions recently considered by Jones in his review article." The conclusion gives a bit more detail.