Instruction handlers that take advantage of ARM VFP. These work with VFP
v2 and v3 (VFPLite).
The ARM code driving the floating-point calculations will run on ARMv5TE
and later. It assumes that word alignment is sufficient for double-word
accesses (which is true for some ARMv5 and all ARMv6/v7), to avoid having
to transfer double-precision values in two steps.