开发者

Is radix sort used for suffix sorting?

I'm trying to implement block sorting. This is from the Burrows Wheeler paper.

(Before this step, you create a V suffix array of S)

Q4. [radix sort]

Sort the elements of V , using the first two characters of each suffix as the sort key. This can be done efficiently using radix sort.

So I understand you are sort开发者_开发问答ing the suffixes with radix sort.

How is this supposed to update the array V? Only after the radix sort finished I can know the sorted position of a suffix. Suppose that the 4th suffix end up being the first after sorted. So V[0] = i. In this case, we know (because I told you) that i = 4. But how does the algorithm know that since we are not keeping track of their position. Should I make a Class that contains both the suffix and its suffix number?


After a quick read; I think Burrows-Wheeler have an error and meant to say to sort the elements of W using the array V to track and map the final locations of the elements of W. ie. Such that W is unchanged and V contains a sorted list of indices.

The paper appears to treat V as an array of pointers to elements in W from that point forward.

Check out http://michael.dipperstein.com/bwt/ There is a great description as well as source code for the algorithm at the bottom of the page.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜