Bradley K addresses this:
"In some situations, calls to malloc_usable_size() were taking significant CPU cycles. (Vadim measured 8% on a workload with a fanout of 128.) We can remember the usable size of the message buffers, saving those CPU cycles."