move task group into task_tcb_s to avoid access allocator to improve performance for Task Termination, the time consumption will be reduced ~2us (Tricore TC397 300MHZ): 15.97(us) -> 13.55(us) Signed-off-by: chao an <anchao@lixiang.com>