I would imagine that during the first pass of two pass encoding it's collecting statistics on where the action is so it can choose where to spend its bits, given you probably specified a final file size.
Pretty much. Multipass means a more highly optimised file. CBR you have no need for that because you have a set bitrate. VBR tries to minimise the bitrate as much as possible per your settings.