PhysGraph: A Physics-aware 3D Scene Graph
for Perception and Reasoning

Anonymous Submission
PhysGraph Teaser Image

Abstract

To perform a wide range of daily tasks, robots need to construct a 3D representation that is semantically rich, physically grounded, and structured enough to support task planning and affordance prediction. However, existing approaches primarily focus on semantic retrieval that often overlooks the physical and kinematic factors. Methods that attempt to model physical properties typically rely on narrow training sets or single-object modeling, limiting scalability and generalization across diverse object types.

To address these challenges, we present PhysGraph, a framework that unifies symbolic reasoning with structured 3D geometry to model kinematic and physical properties in cluttered scenes. Given RGB-D observations, PhysGraph reconstructs object-centric 3D geometry and associates object instances across views. It then decomposes objects into functional parts and infers materials and articulations through visual reasoning.

Evaluated on both synthetic and real-world datasets, PhysGraph achieves state-of-the-art results in semantic segmentation, multi-object mass estimation, and articulation prediction. With its simple yet effective design, PhysGraph produces physically consistent and semantically structured scene graphs, serving as a structured 3D representation for downstream tasks such as constraint-aware 3D affordance prediction and real-to-sim transfer, both of which are demonstrated in our experiments.

Method

PhysGraph Method Pipeline

Given RGB-D observations and camera poses, PhysGraph first performs object-centric perception to reconstruct and segment objects, extract object-level features, and associate instances across views. Selected key frames are then used for visual reasoning to estimate part-level physical properties, including articulation and material attributes. The resulting outputs are integrated into a hierarchical 3D scene graph that encodes object relationships together with part-level physical properties.

BibTeX

Coming soon.