This is not my area (as you know!) but I would say broadly speaking that you have 2 paths forward: either a mathematical proof that your method performs better or, alternatively, using formal statistical methods to show that the testing you have done is sufficient to be significant.

I have gone for a mixture of what you wrote. The outlandish claims are back! The paper has undergone a major rewrite.
