Datasets:
split stringclasses 1
value | reasoning_type stringclasses 3
values | question_index int64 0 7.85k | question stringlengths 35 122 | answer stringclasses 34
values | image_filename stringclasses 1k
values | image_index int64 0 999 | question_family_index int64 0 5 | template_filename stringclasses 6
values | frame_length int64 20 20 | program stringlengths 307 1.82k |
|---|---|---|---|---|---|---|---|---|---|---|
train | factual | 0 | Is the red mountain moving fast at the beginning? | False | super_clevr_0 | 0 | 0 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1, 2, 3], "value_inputs": ["red"]}, {"type": "filter_shape", "inputs": [1], "_output": [1], "value_inputs": ["mountain"]}, {"type": "unique", "inputs": [2], "_output": 1, "value_inputs": []... |
train | factual | 1 | Is the mountain moving faster than a red chopper at the start of the frames? | False | super_clevr_0 | 0 | 1 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [1], "value_inputs": ["mountain"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type"... |
train | factual | 2 | Which direction is the red cruiser moving towards at the beginning? | back | super_clevr_0 | 0 | 2 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1, 2, 3], "value_inputs": ["red"]}, {"type": "filter_shape", "inputs": [1], "_output": [2], "value_inputs": ["cruiser"]}, {"type": "unique", "inputs": [2], "_output": 2, "value_inputs": []}... |
train | factual | 3 | Is the green fighter is accelerating at the beginning? | False | super_clevr_0 | 0 | 0 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0], "value_inputs": ["green"]}, {"type": "filter_shape", "inputs": [1], "_output": [0], "value_inputs": ["fighter"]}, {"type": "unique", "inputs": [2], "_output": 0, "value_inputs": []}, {"... |
train | factual | 4 | Is the green fighter floating when it first appeared? | False | super_clevr_0 | 0 | 1 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0], "value_inputs": ["green"]}, {"type": "filter_shape", "inputs": [1], "_output": [0], "value_inputs": ["fighter"]}, {"type": "unique", "inputs": [2], "_output": 0, "value_inputs": []}, {"... |
train | factual | 5 | Dose the chopper collide with the green fighter in the given frames? | False | super_clevr_0 | 0 | 0 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [3], "value_inputs": ["chopper"]}, {"type": "unique", "inputs": [1], "_output": 3, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [], "value_inputs": []}, {"type": ... |
train | factual | 6 | Is the green thing moving fast when it collides with the purple minivan? | False | super_clevr_1 | 1 | 0 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["green"]}, {"type": "unique", "inputs": [1], "_output": 3, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"ty... |
train | factual | 7 | Is the suv moving fast when the purple minivan collides with it? | False | super_clevr_1 | 1 | 1 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [0], "value_inputs": ["suv"]}, {"type": "unique", "inputs": [1], "_output": 0, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type... |
train | factual | 8 | Is the airliner moving fast when the green object collides with the purple minivan? | True | super_clevr_1 | 1 | 2 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [2], "value_inputs": ["airliner"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {... |
train | factual | 9 | Is the purple minivan moving faster than a airliner after the collides with suv? | False | super_clevr_1 | 1 | 4 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0, 1], "value_inputs": ["purple"]}, {"type": "filter_shape", "inputs": [1], "_output": [1], "value_inputs": ["minivan"]}, {"type": "unique", "inputs": [2], "_output": 1, "value_inputs": ... |
train | factual | 10 | Is the airliner moving faster than a brown chopper when the green object collides with a purple minivan? | True | super_clevr_1 | 1 | 5 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [2], "value_inputs": ["airliner"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {... |
train | factual | 11 | What is the color of the thing which collides with the green thing in the given frames? | purple | super_clevr_1 | 1 | 1 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["green"]}, {"type": "unique", "inputs": [1], "_output": 3, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0, 1], "value_inputs": []}, {"ty... |
train | factual | 12 | What is the shape of the purple object which collides with the purple suv in the given frames? | minivan | super_clevr_1 | 1 | 2 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0, 1], "value_inputs": ["purple"]}, {"type": "filter_shape", "inputs": [1], "_output": [0], "value_inputs": ["suv"]}, {"type": "unique", "inputs": [2], "_output": 0, "value_inputs": []},... |
train | factual | 13 | Is the red object moving fast at the beginning? | True | super_clevr_1 | 1 | 0 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["red"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 0, "value_inputs": []}, {"type": "q... |
train | factual | 14 | Is the red object moving faster than a chopper at the start of the frames? | True | super_clevr_1 | 1 | 1 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["red"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type... |
train | factual | 15 | Which direction is the green minivan moving towards at the beginning? | front | super_clevr_1 | 1 | 2 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["green"]}, {"type": "filter_shape", "inputs": [1], "_output": [3], "value_inputs": ["minivan"]}, {"type": "unique", "inputs": [2], "_output": 3, "value_inputs": []},... |
train | factual | 16 | Is the cyan wagon moving fast after the collision of fighter and the double? | False | super_clevr_2 | 2 | 2 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1, 2], "value_inputs": ["cyan"]}, {"type": "filter_shape", "inputs": [1], "_output": [1], "value_inputs": ["wagon"]}, {"type": "unique", "inputs": [2], "_output": 1, "value_inputs": []},... |
train | factual | 17 | At the start, is the double's engine running? | True | super_clevr_2 | 2 | 0 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [0], "value_inputs": ["double"]}, {"type": "unique", "inputs": [1], "_output": 0, "value_inputs": []}, {"type": "query_engine", "inputs": [2], "_output": true, "value_inputs": []}] |
train | factual | 18 | Is the airliner floating when it first appeared? | True | super_clevr_2 | 2 | 1 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [2], "value_inputs": ["airliner"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "query_floating", "inputs": [2], "_output": true, "value_inputs": []}] |
train | factual | 19 | Dose the blue mountain collide with the brown object in the given frames? | True | super_clevr_2 | 2 | 0 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [4], "value_inputs": ["blue"]}, {"type": "filter_shape", "inputs": [1], "_output": [4], "value_inputs": ["mountain"]}, {"type": "unique", "inputs": [2], "_output": 4, "value_inputs": []},... |
train | factual | 20 | What is the color of the double which collides with the green object in the given frames? | brown | super_clevr_2 | 2 | 1 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["green"]}, {"type": "unique", "inputs": [1], "_output": 3, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10... |
train | factual | 21 | What is the shape of the green thing which collides with the brown object in the given frames? | fighter | super_clevr_2 | 2 | 2 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0], "value_inputs": ["brown"]}, {"type": "unique", "inputs": [1], "_output": 0, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10... |
train | factual | 22 | Is the green thing moving fast at the beginning? | False | super_clevr_2 | 2 | 0 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["green"]}, {"type": "unique", "inputs": [1], "_output": 3, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 0, "value_inputs": []}, {"type": ... |
train | factual | 23 | Is the blue object moving faster than a green object at the start of the frames? | True | super_clevr_2 | 2 | 1 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [4], "value_inputs": ["blue"]}, {"type": "unique", "inputs": [1], "_output": 4, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"typ... |
train | factual | 24 | Which direction is the blue object moving towards at the beginning? | left | super_clevr_2 | 2 | 2 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [4], "value_inputs": ["blue"]}, {"type": "unique", "inputs": [1], "_output": 4, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 0, "value_inputs": []}, {"type": "... |
train | factual | 25 | Is the brown thing moving fast when the yellow object collides with the cyan thing? | True | super_clevr_3 | 3 | 2 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0], "value_inputs": ["brown"]}, {"type": "unique", "inputs": [1], "_output": 0, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"ty... |
train | factual | 26 | Is there an increase in velocity for the cyan sedan at the beginning? | False | super_clevr_3 | 3 | 0 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["cyan"]}, {"type": "filter_shape", "inputs": [1], "_output": [2], "value_inputs": ["sedan"]}, {"type": "unique", "inputs": [2], "_output": 2, "value_inputs": []}, {"... |
train | factual | 27 | Dose the cyan object collide with the yellow sedan in the given frames? | True | super_clevr_3 | 3 | 0 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["cyan"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0, 1, 2], "value_inputs": []}, {"... |
train | factual | 28 | What is the color of the sedan which collides with the cyan sedan in the given frames? | yellow | super_clevr_3 | 3 | 1 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["cyan"]}, {"type": "filter_shape", "inputs": [1], "_output": [2], "value_inputs": ["sedan"]}, {"type": "unique", "inputs": [2], "_output": 2, "value_inputs": []}, {"... |
train | factual | 29 | What is the shape of the object which collides with the cyan object in the given frames? | sedan | super_clevr_3 | 3 | 2 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["cyan"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0, 1, 2], "value_inputs": []}, {"... |
train | factual | 30 | Is the double moving fast at the beginning? | True | super_clevr_3 | 3 | 0 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [0], "value_inputs": ["double"]}, {"type": "unique", "inputs": [1], "_output": 0, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 0, "value_inputs": []}, {"type":... |
train | factual | 31 | Is the yellow sedan moving faster than a brown double at the start of the frames? | False | super_clevr_3 | 3 | 1 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1], "value_inputs": ["yellow"]}, {"type": "filter_shape", "inputs": [1], "_output": [1], "value_inputs": ["sedan"]}, {"type": "unique", "inputs": [2], "_output": 1, "value_inputs": []}, ... |
train | factual | 32 | Which direction is the double moving towards at the beginning? | left | super_clevr_3 | 3 | 2 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [0], "value_inputs": ["double"]}, {"type": "unique", "inputs": [1], "_output": 0, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 0, "value_inputs": []}, {"type":... |
train | factual | 33 | Is the airliner floating when it first appeared? | False | super_clevr_4 | 4 | 1 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [1], "value_inputs": ["airliner"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "query_floating", "inputs": [2], "_output": false, "value_inputs": []}] |
train | factual | 34 | Is the gray mountain is accelerating at the beginning? | True | super_clevr_4 | 4 | 0 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1, 3], "value_inputs": ["gray"]}, {"type": "filter_shape", "inputs": [1], "_output": [3], "value_inputs": ["mountain"]}, {"type": "unique", "inputs": [2], "_output": 3, "value_inputs": []},... |
train | factual | 35 | Dose the gray airliner collide with the school in the given frames? | False | super_clevr_4 | 4 | 0 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1, 3], "value_inputs": ["gray"]}, {"type": "filter_shape", "inputs": [1], "_output": [1], "value_inputs": ["airliner"]}, {"type": "unique", "inputs": [2], "_output": 1, "value_inputs": []},... |
train | factual | 36 | Is the airliner moving fast at the beginning? | True | super_clevr_4 | 4 | 0 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [1], "value_inputs": ["airliner"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 0, "value_inputs": []}, {"type": ... |
train | factual | 37 | Is the yellow object moving faster than a airliner at the start of the frames? | False | super_clevr_4 | 4 | 1 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0], "value_inputs": ["yellow"]}, {"type": "unique", "inputs": [1], "_output": 0, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": ... |
train | factual | 38 | Which direction is the green mountain moving towards at the beginning? | front | super_clevr_4 | 4 | 2 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["green"]}, {"type": "filter_shape", "inputs": [1], "_output": [2], "value_inputs": ["mountain"]}, {"type": "unique", "inputs": [2], "_output": 2, "value_inputs": []}, {... |
train | factual | 39 | Is the blue jet floating at the beginning? | True | super_clevr_5 | 5 | 1 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1, 2], "value_inputs": ["blue"]}, {"type": "filter_shape", "inputs": [1], "_output": [2], "value_inputs": ["jet"]}, {"type": "unique", "inputs": [2], "_output": 2, "value_inputs": []}, {"ty... |
train | factual | 40 | Is the blue wagon is accelerating at the beginning? | False | super_clevr_5 | 5 | 0 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1, 2], "value_inputs": ["blue"]}, {"type": "filter_shape", "inputs": [1], "_output": [1], "value_inputs": ["wagon"]}, {"type": "unique", "inputs": [2], "_output": 1, "value_inputs": []}, {"... |
train | factual | 41 | Dose the wagon collide with the green cruiser in the given frames? | False | super_clevr_5 | 5 | 0 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [1], "value_inputs": ["wagon"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0, 1], "value_inputs": []}, {"type"... |
train | factual | 42 | Is the wagon moving fast at the beginning? | False | super_clevr_5 | 5 | 0 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [1], "value_inputs": ["wagon"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 0, "value_inputs": []}, {"type": "qu... |
train | factual | 43 | Is the blue jet moving faster than a cruiser at the start of the frames? | True | super_clevr_5 | 5 | 1 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1, 2], "value_inputs": ["blue"]}, {"type": "filter_shape", "inputs": [1], "_output": [2], "value_inputs": ["jet"]}, {"type": "unique", "inputs": [2], "_output": 2, "value_inputs": []}, {"ty... |
train | factual | 44 | Which direction is the blue wagon moving towards at the beginning? | front | super_clevr_5 | 5 | 2 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1, 2], "value_inputs": ["blue"]}, {"type": "filter_shape", "inputs": [1], "_output": [1], "value_inputs": ["wagon"]}, {"type": "unique", "inputs": [2], "_output": 1, "value_inputs": []}, {"... |
train | factual | 45 | Is the purple truck moving faster than a purple double after it collides with gray jet? | True | super_clevr_6 | 6 | 4 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0, 2], "value_inputs": ["purple"]}, {"type": "filter_shape", "inputs": [1], "_output": [2], "value_inputs": ["truck"]}, {"type": "unique", "inputs": [2], "_output": 2, "value_inputs": []}, ... |
train | factual | 46 | Is the yellow thing moving fast when the jet collides with the truck? | False | super_clevr_6 | 6 | 2 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1], "value_inputs": ["yellow"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": ... |
train | factual | 47 | What is the color of the truck which collides with the gray thing in the given frames? | purple | super_clevr_6 | 6 | 1 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["gray"]}, {"type": "unique", "inputs": [1], "_output": 3, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0, 1, 2, 3, 4, 5, 6, 7, 8], "value_i... |
train | factual | 48 | What is the shape of the gray object which collides with the purple truck in the given frames? | jet | super_clevr_6 | 6 | 2 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0, 2], "value_inputs": ["purple"]}, {"type": "filter_shape", "inputs": [1], "_output": [2], "value_inputs": ["truck"]}, {"type": "unique", "inputs": [2], "_output": 2, "value_inputs": []}, ... |
train | factual | 49 | Is the gray jet floating in the air from the start? | False | super_clevr_6 | 6 | 1 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["gray"]}, {"type": "filter_shape", "inputs": [1], "_output": [3], "value_inputs": ["jet"]}, {"type": "unique", "inputs": [2], "_output": 3, "value_inputs": []}, {"type"... |
train | factual | 50 | Does the gray thing begin with its engine active? | True | super_clevr_6 | 6 | 0 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["gray"]}, {"type": "unique", "inputs": [1], "_output": 3, "value_inputs": []}, {"type": "query_engine", "inputs": [2], "_output": true, "value_inputs": []}] |
train | factual | 51 | Dose the gray jet collide with the yellow thing in the given frames? | True | super_clevr_6 | 6 | 0 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["gray"]}, {"type": "filter_shape", "inputs": [1], "_output": [3], "value_inputs": ["jet"]}, {"type": "unique", "inputs": [2], "_output": 3, "value_inputs": []}, {"type"... |
train | factual | 52 | Is the truck moving fast at the beginning? | True | super_clevr_6 | 6 | 0 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [2], "value_inputs": ["truck"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 0, "value_inputs": []}, {"type": "qu... |
train | factual | 53 | Is the purple truck moving faster than a purple double at the start of the frames? | True | super_clevr_6 | 6 | 1 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0, 2], "value_inputs": ["purple"]}, {"type": "filter_shape", "inputs": [1], "_output": [2], "value_inputs": ["truck"]}, {"type": "unique", "inputs": [2], "_output": 2, "value_inputs": []}, ... |
train | factual | 54 | Which direction is the yellow object moving towards at the beginning? | left | super_clevr_6 | 6 | 2 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1], "value_inputs": ["yellow"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 0, "value_inputs": []}, {"type": "q... |
train | factual | 55 | Is the cyan thing moving faster than a purple thing when the green object collides with a scooter? | False | super_clevr_7 | 7 | 5 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0], "value_inputs": ["cyan"]}, {"type": "unique", "inputs": [1], "_output": 0, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"typ... |
train | factual | 56 | Is the cyan object moving faster than a purple object when it collides with brown thing? | False | super_clevr_7 | 7 | 4 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0], "value_inputs": ["cyan"]}, {"type": "unique", "inputs": [1], "_output": 0, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"typ... |
train | factual | 57 | Is the blue minivan moving fast when the cyan object collides with the scooter? | False | super_clevr_7 | 7 | 2 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1], "value_inputs": ["blue"]}, {"type": "filter_shape", "inputs": [1], "_output": [1], "value_inputs": ["minivan"]}, {"type": "unique", "inputs": [2], "_output": 1, "value_inputs": []}, ... |
train | factual | 58 | What is the color of the scooter which collides with the green thing in the given frames? | brown | super_clevr_7 | 7 | 1 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["green"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10... |
train | factual | 59 | What is the shape of the green thing which collides with the brown scooter in the given frames? | fighter | super_clevr_7 | 7 | 2 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [4], "value_inputs": ["brown"]}, {"type": "filter_shape", "inputs": [1], "_output": [4], "value_inputs": ["scooter"]}, {"type": "unique", "inputs": [2], "_output": 4, "value_inputs": []},... |
train | factual | 60 | Is the purple jet floating at the beginning? | True | super_clevr_7 | 7 | 1 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["purple"]}, {"type": "filter_shape", "inputs": [1], "_output": [3], "value_inputs": ["jet"]}, {"type": "unique", "inputs": [2], "_output": 3, "value_inputs": []}, {"... |
train | factual | 61 | Is there an increase in velocity for the minivan at the beginning? | False | super_clevr_7 | 7 | 0 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [1], "value_inputs": ["minivan"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "query_engine", "inputs": [2], "_output": false, "value_inputs": []}] |
train | factual | 62 | Dose the blue thing collide with the fighter in the given frames? | False | super_clevr_7 | 7 | 0 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1], "value_inputs": ["blue"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10]... |
train | factual | 63 | Is the green fighter moving fast at the beginning? | False | super_clevr_7 | 7 | 0 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["green"]}, {"type": "filter_shape", "inputs": [1], "_output": [2], "value_inputs": ["fighter"]}, {"type": "unique", "inputs": [2], "_output": 2, "value_inputs": []},... |
train | factual | 64 | Is the blue minivan moving faster than a purple jet at the start of the frames? | False | super_clevr_7 | 7 | 1 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1], "value_inputs": ["blue"]}, {"type": "filter_shape", "inputs": [1], "_output": [1], "value_inputs": ["minivan"]}, {"type": "unique", "inputs": [2], "_output": 1, "value_inputs": []}, ... |
train | factual | 65 | What is the color of the minivan which collides with the yellow thing in the given frames? | blue | super_clevr_8 | 8 | 1 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [4], "value_inputs": ["yellow"]}, {"type": "unique", "inputs": [1], "_output": 4, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs":... |
train | factual | 66 | What is the shape of the blue thing which collides with the mountain in the given frames? | minivan | super_clevr_8 | 8 | 2 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [4], "value_inputs": ["mountain"]}, {"type": "unique", "inputs": [1], "_output": 4, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs... |
train | factual | 67 | Is the cyan object floating at the beginning? | False | super_clevr_8 | 8 | 1 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1], "value_inputs": ["cyan"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "query_floating", "inputs": [2], "_output": false, "value_inputs": []}] |
train | factual | 68 | Which direction is the minivan moving towards at the beginning? | front | super_clevr_8 | 8 | 2 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [0], "value_inputs": ["minivan"]}, {"type": "unique", "inputs": [1], "_output": 0, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 0, "value_inputs": []}, {"type"... |
train | factual | 69 | Is there an increase in velocity for the red airliner at the beginning? | True | super_clevr_8 | 8 | 0 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["red"]}, {"type": "filter_shape", "inputs": [1], "_output": [2], "value_inputs": ["airliner"]}, {"type": "unique", "inputs": [2], "_output": 2, "value_inputs": []}, ... |
train | factual | 70 | Dose the fighter collide with the minivan in the given frames? | True | super_clevr_8 | 8 | 0 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [1], "value_inputs": ["fighter"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs"... |
train | factual | 71 | Is the fighter moving fast at the beginning? | False | super_clevr_8 | 8 | 0 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [1], "value_inputs": ["fighter"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 76, "value_inputs": []}, {"type... |
train | factual | 72 | What is the color of the tandem which collides with the fighter in the given frames? | brown | super_clevr_9 | 9 | 1 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [0], "value_inputs": ["fighter"]}, {"type": "unique", "inputs": [1], "_output": 0, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0], "value_inputs": []}, {"typ... |
train | factual | 73 | What is the shape of the brown object which collides with the green fighter in the given frames? | tandem | super_clevr_9 | 9 | 2 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0], "value_inputs": ["green"]}, {"type": "filter_shape", "inputs": [1], "_output": [0], "value_inputs": ["fighter"]}, {"type": "unique", "inputs": [2], "_output": 0, "value_inputs": []},... |
train | factual | 74 | Which direction is the mountain moving towards at the beginning? | up | super_clevr_9 | 9 | 2 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [3], "value_inputs": ["mountain"]}, {"type": "unique", "inputs": [1], "_output": 3, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 0, "value_inputs": []}, {"type... |
train | factual | 75 | Is the cyan object is accelerating at the beginning? | False | super_clevr_9 | 9 | 0 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [4], "value_inputs": ["cyan"]}, {"type": "unique", "inputs": [1], "_output": 4, "value_inputs": []}, {"type": "query_engine", "inputs": [2], "_output": false, "value_inputs": []}] |
train | factual | 76 | Dose the mountain collide with the green thing in the given frames? | False | super_clevr_9 | 9 | 0 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [3], "value_inputs": ["mountain"]}, {"type": "unique", "inputs": [1], "_output": 3, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0], "value_inputs": []}, {"ty... |
train | factual | 77 | Is the cyan mountain moving fast when it collides with the gray object? | True | super_clevr_10 | 10 | 0 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["cyan"]}, {"type": "filter_shape", "inputs": [1], "_output": [2], "value_inputs": ["mountain"]}, {"type": "unique", "inputs": [2], "_output": 2, "value_inputs": []}, {"... |
train | factual | 78 | Is the cyan object moving fast when the dirtbike collides with it? | True | super_clevr_10 | 10 | 1 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["cyan"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "f... |
train | factual | 79 | Is the cyan object moving faster than a purple thing when it collides with gray dirtbike? | True | super_clevr_10 | 10 | 4 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["cyan"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "f... |
train | factual | 80 | Is the brown object floating when it first appeared? | True | super_clevr_10 | 10 | 1 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0], "value_inputs": ["brown"]}, {"type": "unique", "inputs": [1], "_output": 0, "value_inputs": []}, {"type": "query_floating", "inputs": [2], "_output": true, "value_inputs": []}] |
train | factual | 81 | What is the color of the thing which collides with the mountain in the given frames? | gray | super_clevr_10 | 10 | 1 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [2], "value_inputs": ["mountain"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0], "value_inputs": []}, {"type"... |
train | factual | 82 | What is the shape of the thing which collides with the cyan object in the given frames? | dirtbike | super_clevr_10 | 10 | 2 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["cyan"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0], "value_inputs": []}, {"type": "f... |
train | factual | 83 | Is the cruiser moving faster than a mountain at the start of the frames? | False | super_clevr_10 | 10 | 1 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [1], "value_inputs": ["cruiser"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type":... |
train | factual | 84 | Which direction is the purple object moving towards at the beginning? | left | super_clevr_10 | 10 | 2 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1], "value_inputs": ["purple"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 18, "value_inputs": []}, {"type": "... |
train | factual | 85 | Is the gray dirtbike is accelerating at the beginning? | True | super_clevr_10 | 10 | 0 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["gray"]}, {"type": "filter_shape", "inputs": [1], "_output": [3], "value_inputs": ["dirtbike"]}, {"type": "unique", "inputs": [2], "_output": 3, "value_inputs": []}, {"... |
train | factual | 86 | Dose the cyan object collide with the gray thing in the given frames? | True | super_clevr_10 | 10 | 0 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["cyan"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "collisionEvent", "inputs": [], "_output": [0], "value_inputs": []}, {"type": "f... |
train | factual | 87 | Is the blue object floating when it first appeared? | True | super_clevr_11 | 11 | 1 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [1], "value_inputs": ["blue"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "query_floating", "inputs": [2], "_output": true, "value_inputs": []}] |
train | factual | 88 | Is the scooter moving fast at the beginning? | True | super_clevr_11 | 11 | 0 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [2], "value_inputs": ["scooter"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 0, "value_inputs": []}, {"type": "... |
train | factual | 89 | Is the scooter moving faster than a tandem at the start of the frames? | True | super_clevr_11 | 11 | 1 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [2], "value_inputs": ["scooter"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type":... |
train | factual | 90 | Which direction is the brown object moving towards at the beginning? | back | super_clevr_11 | 11 | 2 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["brown"]}, {"type": "unique", "inputs": [1], "_output": 2, "value_inputs": []}, {"type": "come_in_frame", "inputs": [2], "_output": 0, "value_inputs": []}, {"type": "qu... |
train | factual | 91 | Is the tandem's engine is on at the beginning? | False | super_clevr_11 | 11 | 0 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [3], "value_inputs": ["tandem"]}, {"type": "unique", "inputs": [1], "_output": 3, "value_inputs": []}, {"type": "query_engine", "inputs": [2], "_output": false, "value_inputs": []}] |
train | factual | 92 | Dose the brown scooter collide with the green object in the given frames? | False | super_clevr_11 | 11 | 0 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [2], "value_inputs": ["brown"]}, {"type": "filter_shape", "inputs": [1], "_output": [2], "value_inputs": ["scooter"]}, {"type": "unique", "inputs": [2], "_output": 2, "value_inputs": []}, {"... |
train | factual | 93 | Is the purple airliner moving fast at the beginning? | True | super_clevr_12 | 12 | 0 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["purple"]}, {"type": "filter_shape", "inputs": [1], "_output": [3], "value_inputs": ["airliner"]}, {"type": "unique", "inputs": [2], "_output": 3, "value_inputs": []... |
train | factual | 94 | Is the purple object moving faster than a green dirtbike at the start of the frames? | True | super_clevr_12 | 12 | 1 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["purple"]}, {"type": "unique", "inputs": [1], "_output": 3, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"t... |
train | factual | 95 | Which direction is the purple airliner moving towards at the beginning? | back | super_clevr_12 | 12 | 2 | linear_velocity.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [3], "value_inputs": ["purple"]}, {"type": "filter_shape", "inputs": [1], "_output": [3], "value_inputs": ["airliner"]}, {"type": "unique", "inputs": [2], "_output": 3, "value_inputs": []... |
train | factual | 96 | Is the dirtbike is accelerating at the beginning? | True | super_clevr_12 | 12 | 0 | accelaration.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [1], "value_inputs": ["dirtbike"]}, {"type": "unique", "inputs": [1], "_output": 1, "value_inputs": []}, {"type": "query_engine", "inputs": [2], "_output": true, "value_inputs": []}] |
train | factual | 97 | Dose the red utility collide with the green dirtbike in the given frames? | True | super_clevr_12 | 12 | 0 | collison.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [4], "value_inputs": ["red"]}, {"type": "filter_shape", "inputs": [1], "_output": [4], "value_inputs": ["utility"]}, {"type": "unique", "inputs": [2], "_output": 4, "value_inputs": []}, {... |
train | factual | 98 | Is the articulated moving faster than a red double when they collide? | False | super_clevr_13 | 13 | 3 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_shape", "inputs": [0], "_output": [3], "value_inputs": ["articulated"]}, {"type": "unique", "inputs": [1], "_output": 3, "value_inputs": []}, {"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}... |
train | factual | 99 | Is the red jet moving fast when it collides with the red double? | True | super_clevr_13 | 13 | 0 | linear_velocity_collision.json | 20 | [{"type": "scene", "inputs": [], "_output": [0, 1, 2, 3, 4], "value_inputs": []}, {"type": "filter_color", "inputs": [0], "_output": [0, 4], "value_inputs": ["red"]}, {"type": "filter_shape", "inputs": [1], "_output": [4], "value_inputs": ["jet"]}, {"type": "unique", "inputs": [2], "_output": 4, "value_inputs": []}, {"... |
DynSuperCLEVR
Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering ICLR 2025
DynSuperCLEVR is a video question answering (VQA) benchmark for understanding 4D (3D + temporal) object dynamics and physical properties in dynamic scenes. Unlike most video QA datasets that test appearance and 2D motion, DynSuperCLEVR targets the physical concepts that require an explicit 4D scene understanding: velocity, acceleration, and collisions. It asks models not only to describe what happened, but to predict what will happen next and to reason about counterfactual "what-if" scenarios under altered physical conditions.
| π Project page | https://xingruiwang.github.io/projects/DynSuperCLEVR/ |
| π Paper (arXiv) | https://arxiv.org/abs/2406.00622 |
| π Paper (OpenReview) | https://openreview.net/pdf?id=6Vx28LSR7f |
| π§ Model code (NS-4DPhysics) | https://github.com/XingruiWang/NS-4DPhysics |
| π οΈ Dataset generation code | https://github.com/XingruiWang/DynSuperCLEVR |
| π Poster | https://iclr.cc/virtual/2025/poster/30879 |
Highlights
- Physics-grounded VQA. Questions are built around three physical concepts β velocity, acceleration, and collision β that cannot be answered from a single frame and require reasoning over object dynamics through time.
- Three reasoning types. Factual (what is happening), Predictive (what will happen next), and Counterfactual (what would happen under a different initial condition, e.g. "if the engine were turned on" / "if it were static").
- Full 4D annotations. Every scene ships with complete physical ground truth: per-frame 3D positions, linear & angular velocities, camera parameters (intrinsics
Kand extrinsicsR), instance segmentation, and collision events (contact normal, force, frame, image position). - Rendered with a physics engine. Scenes are generated by extending the Kubric pipeline (built on PyBullet) over the SuperCLEVR asset set, with added support for acceleration dynamics.
Dataset statistics
- 1,100 videos (rigid-body simulations), 120 rendered frames each (RGB, instance masks, segmentation).
- 12,752 questions across three reasoning types and two splits.
| Reasoning type | Train | Val | Total |
|---|---|---|---|
| Factual | 7,850 | 791 | 8,641 |
| Predictive | 2,750 | 274 | 3,024 |
| Counterfactual | 989 | 98 | 1,087 |
| Total | 11,589 | 1,163 | 12,752 |
Answers are a mix of binary (True/False), direction (left, right, front, back, up, down), and attribute (color / object) labels, following the CLEVR-style compositional program format.
Repository structure
DynSuperCLEVR/
βββ questions/
β βββ train/
β β βββ questions_physics_factual.json
β β βββ questions_physics_predictive.json
β β βββ questions_physics_counterfactual.json
β βββ val/
β βββ questions_physics_factual.json
β βββ questions_physics_predictive.json
β βββ questions_physics_counterfactual.json
βββ frames_upload_chunks/
β βββ frames_chunk_000.tar ... frames_chunk_023.tar # all video frames (~325 GB), tarred
βββ videos/frames/<scene>/ # sample uncompressed scenes
βββ rgba_00000.png ... rgba_00119.png # RGB frames
βββ segmentation_00000.png ... # semantic segmentation
βββ mask_00000.png ... # instance masks
βββ metadata.json # camera + per-instance 4D state
βββ events.json # collision events
Question schema
Each question file is a dict with a questions list; every entry contains:
| Field | Description |
|---|---|
question |
Natural-language question |
answer |
Ground-truth answer |
program |
Compositional functional program (CLEVR-style) used to derive the answer |
image_filename / image |
The scene this question refers to (e.g. super_clevr_0) |
image_index, question_index, question_family_index |
Indices |
template_filename |
Question template used to generate the question |
frame_length |
Number of frames the question is grounded over |
split |
train / val |
The Dataset Viewer (and the
questionsconfig loadable viadatasets) is served from flattened Parquet files underviewer/(train.parquet,val.parquet), which merge the three reasoning types and add areasoning_typecolumn. There, theprogramfield is stored as a JSON string for a stable schema; the canonical nested JSON lives underquestions/.
Per-scene physical ground truth (metadata.json)
cameraβ intrinsicsK, extrinsicsR, projection of 3D β image.instancesβ for each object: per-frame positions,velocities,angular_velocities, plus shape / color / material attributes.metadataβ scene-level config:gravity,frame_rate,num_frames,num_instances, background HDRI, resolution.argsβ full generation arguments (floor friction/restitution, camera mode, etc.).
Collision events (events.json)
collisionsβ list of contacts withcontact_normal,force,frame(sub-frame precise), andimage_position.
Usage
The frame data is distributed as tar chunks. Download and extract:
# Download the whole dataset
huggingface-cli download RyanWW/DynSuperCLEVR --repo-type dataset --local-dir ./DynSuperCLEVR
# Or just the questions (small) to get started
huggingface-cli download RyanWW/DynSuperCLEVR --repo-type dataset \
--include "questions/**" --local-dir ./DynSuperCLEVR
# Reassemble the frames
cd DynSuperCLEVR
for f in frames_upload_chunks/frames_chunk_*.tar; do tar -xf "$f"; done
import json
qs = json.load(open("DynSuperCLEVR/questions/val/questions_physics_factual.json"))["questions"]
print(len(qs), "questions")
print(qs[0]["question"], "->", qs[0]["answer"])
# e.g. "Which direction is the red cruiser moving towards at the beginning?" -> "back"
Example questions
| Type | Question | Answer |
|---|---|---|
| Factual | Is the red mountain moving fast at the beginning? | False |
| Factual | Which direction is the red cruiser moving towards at the beginning? | back |
| Predictive | Will the purple minivan collide with the suv next? | True |
| Counterfactual | Will the green fighter collide with the mountain if it turns the engine on? | True |
Method: NS-4DPhysics
The accompanying model, NS-4DPhysics, is a neural-symbolic approach that builds an explicit 4D scene representation and integrates physics priors:
- 3D scene parser β a neural mesh model that recovers object pose and state by render-and-compare.
- Question parser β translates language into a symbolic program.
- Program executor β runs the program over the 4D representation, using a PyBullet physics engine to roll out dynamics for predictive / counterfactual reasoning.
On DynSuperCLEVR, NS-4DPhysics reaches 82.64% average accuracy, substantially outperforming PO3D-VQA (62.93%), PLLaVA (59.24%), and GPT-4o with reasoning (56.06%), highlighting the gap between current video-language models and explicit physical scene understanding.
Citation
@inproceedings{wang2025compositional,
title = {Compositional 4D Dynamic Scenes Understanding with Physics Priors for Video Question Answering},
author = {Wang, Xingrui and Ma, Wufei and Wang, Angtian and Chen, Shuo and Kortylewski, Adam and Yuille, Alan},
booktitle = {International Conference on Learning Representations (ICLR)},
year = {2025},
eprint = {2406.00622},
archivePrefix = {arXiv},
primaryClass = {cs.CV},
url = {https://arxiv.org/abs/2406.00622}
}
License
Released under the Apache 2.0 license. DynSuperCLEVR builds on SuperCLEVR and the Kubric generation pipeline; please also respect their respective licenses.
Authors
Xingrui Wang, Wufei Ma, Angtian Wang (Johns Hopkins University); Shuo Chen (Tsinghua University); Adam Kortylewski (Max Planck Institute for Informatics / University of Freiburg); Alan Yuille (Johns Hopkins University).
- Downloads last month
- 222