And no, a neural net and two cameras are not "just fine". The day cameras will be as good as your eyes and your neural net will be on the level of human intelligence (AGI) then maybe it would be possible. But until then you will need to rely on extra hardware to get there.
Go check on youtube how FSD behaves in city with 1/10th the complexity of SF/Waymo. And remember the difficulty is with the long tail of unexpected events.
And no, a neural net and two cameras are not "just fine". The day cameras will be as good as your eyes and your neural net will be on the level of human intelligence (AGI) then maybe it would be possible. But until then you will need to rely on extra hardware to get there.
Go check on youtube how FSD behaves in city with 1/10th the complexity of SF/Waymo. And remember the difficulty is with the long tail of unexpected events.