The devil is in temporal token: High quality video reasoning segmentation