The actor pattern exists for this reason - it's not like AI is a singular thing checking itself - you could easily have 1000 instances with supervisors automatically rotating out unhealthy instances so that the system is self-repairing even if instances can fail.