# 更多示例 本文档展示了 [README](../../README_CN.md) 之外的更多示例。若需一次性运行全部示例,可使用 [examples/examples.jsonl](../../examples/examples.jsonl) 并配合 `--jsonl_path` 参数(参见 README 中[「一次测试多个问题」](../../README_CN.md#一次测试多个问题)小节)。 --- #### 示例8 该例题源自 [MindCube](https://github.com/mll-lab-nu/MindCube): ```bash python example.py \ --image_paths examples/Q8_1.jpg examples/Q8_2.jpg examples/Q8_3.jpg examples/Q8_4.jpg \ --question "Based on these four images (image 1, 2, 3, and 4) showing the pink bottle from different viewpoints (front, left, back, and right), with each camera aligned with room walls and partially capturing the surroundings: From the viewpoint presented in image 4, what is to the left of the pink bottle?\nOptions: A. Pink plush toy and headboard B. Window and blue curtain C. Closet and door D. White wall\nAnswer with the option's letter from the given choices directly." \ --model_path sensenova/SenseNova-SI-1.3-InternVL3-8B ```
示例8详情

Q: Based on these four images (image 1, 2, 3, and 4) showing the pink bottle from different viewpoints (front, left, back, and right), with each camera aligned with room walls and partially capturing the surroundings: From the viewpoint presented in image 4, what is to the left of the pink bottle?\nOptions: A. Pink plush toy and headboard B. Window and blue curtain C. Closet and door D. White wall\nAnswer with the option's letter from the given choices directly.

Image 1 Image 2 Image 3 Image 4

正确答案: C

--- #### 示例9 该例题源自 [SITE-Bench](https://github.com/wenqi-wang20/SITE-Bench): ```bash python example.py \ --image_paths examples/Q9.jpg \ --question "Question: Consider the real-world 3D locations and orientations of the objects. Which side of the bus in the center is facing the bus stop?\nOptions: \nA. front\nB. left\nC. back\nD. right\nGive me the answer letter directly. The best answer is:" \ --model_path sensenova/SenseNova-SI-1.3-InternVL3-8B ```
示例9详情

Q: Question: Consider the real-world 3D locations and orientations of the objects. Which side of the bus in the center is facing the bus stop?\nOptions: \nA. front\nB. left\nC. back\nD. right\nGive me the answer letter directly. The best answer is:

Image

正确答案: D

--- #### 示例10 该例题源自 [SITE-Bench](https://github.com/wenqi-wang20/SITE-Bench): ```bash python example.py \ --image_paths examples/Q10.jpg \ --question "Question: Consider the real-world 3D orientations of the objects. Are the arrow on street sign and the taxi facing same or similar directions, or very different directions?\nOptions: \nA. same or similar directions\nB. very different directions\nGive me the answer letter directly. The best answer is:" \ --model_path sensenova/SenseNova-SI-1.3-InternVL3-8B ```
示例10详情

Q: Question: Consider the real-world 3D orientations of the objects. Are the arrow on street sign and the taxi facing same or similar directions, or very different directions? Options: A. same or similar directions, B. very different directions. Give me the answer letter directly. The best answer is:

Image

正确答案: A

--- #### 示例11 该例题源自 [SITE-Bench](https://github.com/wenqi-wang20/SITE-Bench): ```bash python example.py \ --image_paths examples/Q11.jpg \ --question "Question: What shape are all the men standing in?\nOptions: A. circle B. rectangle C. triangle D. square\nGive me the answer letter directly. The best answer is:" \ --model_path sensenova/SenseNova-SI-1.3-InternVL3-8B ```
示例11详情

Q: Question: What shape are all the men standing in?\nOptions: A. circle B. rectangle C. triangle D. square\nGive me the answer letter directly. The best answer is:

Image

正确答案: A

--- #### 示例12 该例题源自 [ViewSpatial-Bench](https://github.com/ZJU-REAL/ViewSpatial-Bench): ```bash python example.py \ --image_paths examples/Q12.jpg \ --question "From the perspective of this man who doesn't wear glasses, where is the man wearing glasses located beside him?\nOptions: A. left B. back-right C. front D. right\nAnswer with the option's letter from the given choices directly." \ --model_path sensenova/SenseNova-SI-1.3-InternVL3-8B ```
示例12详情

Q: From the perspective of this man who doesn't wear glasses, where is the man wearing glasses located beside him? Options: A. left, B. back-right, C. front, D. right. Answer with the option's letter from the given choices directly.

Image

正确答案: A

--- #### 示例13 该例题源自 [MMSI-Bench](https://github.com/InternRobotics/MMSI-Bench),测试模型在开放式简答题上的能力: ```bash python example.py \ --image_paths examples/Q13_1.png examples/Q13_2.png \ --question "The iMac is in the northern part of the room. In which direction is the area where students do their homework?" \ --model_path sensenova/SenseNova-SI-1.3-InternVL3-8B ```
示例13详情

Q: The iMac is in the northern part of the room. In which direction is the area where students do their homework?

First image Second image

正确答案: Northwest corner

--- #### 示例14 该例题源自 [MMSI-Bench](https://github.com/InternRobotics/MMSI-Bench),测试模型在开放式简答题上的能力: ```bash python example.py \ --image_paths examples/Q14_1.png examples/Q14_2.png \ --question "How many building models are captured in total in these two pictures?" \ --model_path sensenova/SenseNova-SI-1.3-InternVL3-8B ```
示例14详情

Q: How many building models are captured in total in these two pictures?

First image Second image

正确答案: 4

--- #### 示例 15 此示例展示模型的 **立体几何(三视图)** 能力: ```bash python example.py \ --image_paths examples/Q15.png \ --question "请将你的思考过程放在标签内,并将你的最终答案放在标签内。" \ --model_path sensenova/SenseNova-SI-1.5-InternVL3-8B ```
示例 15 详情

问题:请将你的思考过程放在<think> </think>标签内,并将你的最终答案放在<answer> </answer>标签内。

第一张图片

正确答案:B

--- #### 示例 16 此示例展示模型的 **立体几何(三视图)** 能力: ```bash python example.py \ --image_paths examples/Q16.png \ --question "请将你的思考过程放在标签内,并将你的最终答案放在标签内。" \ --model_path sensenova/SenseNova-SI-1.5-InternVL3-8B ```
示例 16 详情

问题:请将你的思考过程放在<think> </think>标签内,并将你的最终答案放在<answer> </answer>标签内。

第一张图片

正确答案:C

--- #### 示例 17 此示例展示模型的 **立体几何(3D图形推理)** 能力: ```bash python example.py \ --image_paths examples/Q17.png \ --question "请将你的思考过程放在标签内,并将你的最终答案放在标签内。" \ --model_path sensenova/SenseNova-SI-1.5-InternVL3-8B ```
示例 17 详情

问题:请将你的思考过程放在<think> </think>标签内,并将你的最终答案放在<answer> </answer>标签内。

第一张图片

正确答案:C

--- #### 示例 18 此示例展示模型的 **立体几何(三视图)** 能力: ```bash python example.py \ --image_paths examples/Q18.png \ --question "请将你的思考过程放在标签内,并将你的最终答案放在标签内。" \ --model_path sensenova/SenseNova-SI-1.5-InternVL3-8B ```
示例 18 详情

问题:请将你的思考过程放在<think> </think>标签内,并将你的最终答案放在<answer> </answer>标签内。

第一张图片

正确答案:A