dippogriff 2 days ago
Great work showing on how brittle these GUI benchmarks can be! Love the visuals.

I wonder if SFT is the problem here as opposed to the coordinate discretization; what happens with continuous action space?