For those only having a quick look, the charts do not have the y axis start at zero, the differences are much smaller than it looks.
Also, they seemingly already had a BWT lying around (which is seemingly not counted in “from scratch in a couple of days”), and there is no mention of speed.
It seems like author means his implementation is competitive in terms of _compression ratio_ only, and this is achieved in the final Huffman encoding step.
From that perspective the BWT implementation isn't very interesting because the BWT is deterministic, so the effect on compression ratio doesn't depend on which implementation was used.
He didn't specify that the demo of BWT was in Ada, or his own.