I wonder if SFT is the problem here as opposed to the coordinate discretization; what happens with continuous action space?