CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video SegmentationShare on Twitter Facebook LinkedIn Previous Next