As a data scientist, I’m happy that most data continues to be so terribly formatted and inconsistent as to break and confuse AI. But for how long that’s true, who knows!
Unfortunately, there are still many ways to “fix” things that have a lot of trade-offs or downstream consequences for analysis. For most basic cleaning tasks, LLMs are also still way too slow.