Hiya
My guess is that the brain when exposed to very much information, filters away more than when it has time to process it. If there are gaps in the information, then it can make conclusions that it is loud.
When there is a stream of continues loudness, then there is no way it can make any conclusion about the loudness. It has to wait and compare it to the nextcoming mix.
Just me guessing.