I think an end-to-end RL approach will eventually work- but _eventually_ could be in a really long time. It’s also a question of scale: even if Comma’s approach is fundamentally better, how much better is it? If Tesla has 1000x more cars, and their approach is 10x worse, they’ll still improve 100x faster than Comma.