Use ldm and stm instruction to optimize performance when both src and dst are 32-bit aligned. Signed-off-by: zhangyuan21 <zhangyuan21@xiaomi.com>
Use ldm and stm instruction to optimize performance when both src and dst are 32-bit aligned. Signed-off-by: zhangyuan21 <zhangyuan21@xiaomi.com>