Well, it just means we trained the model to work on instructions written that way. Since the result works out, that means the model must've learned to deal with it.
There isn't much research on what's actually going on here, mainly because nobody has access to the weights of the really good models.
There isn't much research on what's actually going on here, mainly because nobody has access to the weights of the really good models.