The FPU register saving upon vfork entry was missing. Also added macro that tells the actual size of an FPU reg, instead of just having a coefficient for qfpu/no-qfpu.