(1) For **Grounded Caption Generation** Interleaved Segmentation, input prompt like: *"Could you provide me with a detailed analysis of this photo? Please output with interleaved segmentation masks for the corresponding parts of the answer."* <br>
 (2) For **Segmentation Output**, input prompt like: *"Can you please segment xxx in the given image"* <br>
 (3) For **Image Captioning** VQA, input prompt like: *"Could you please give me a detailed description of the image?"* <br>
 (4) For **Image Conversation**, input arbitrary text instruction. <br>
"""
ONE_THIRD=1.0/3.0
ONE_SIXTH=1.0/6.0
TWO_THIRD=2.0/3.0
defdesaturate(rgb,factor=0.65):
"""
Desaturate an RGB color by a given factor.
:param rgb: A tuple of (r, g, b) where each value is in [0, 255].
:param factor: The factor by which to reduce the saturation.
0 means completely desaturated, 1 means original color.
:return: A tuple of desaturated (r, g, b) values in [0, 255].