Try it yourselves - the first person writes the speech bubbles. The second person draws the pictures for those and writes the speech bubbles in the next panel. Then the first person (or third person, depending how many friends you have) draws the pictures for THOSE, and then writes the speech bubbles for the NEXT panel... and so on..
simple right?