TESGNN: 3D Temporal Equivariant Scene Graph Neural Networks

Abstract

Scene graphs have proven to be highly effective for various scene understanding tasks due to their compact and explicit representation of relational information. However, current methods often overlook the critical importance of preserving symmetry when generating scene graphs from 3D point clouds, which can lead to reduced accuracy and robustness, particularly when dealing with noisy, multi-view data. This work, to the best of our knowledge, presents the first implementation of an Equivariant Scene Graph Neural Network (ESGNN) to generate semantic scene graphs from 3D point clouds, specifically for enhanced scene understanding. Furthermore, a significant limitation of prior methods is the absence of temporal modeling to capture time-dependent relationships among dynamically evolving entities within a scene. To address this gap, we introduce a novel temporal layer that leverages the symmetry-preserving properties of ESGNN to fuse scene graphs across multiple sequences into a unified global representation by an approximate graph-matching algorithm. Our combined architecture, termed the Temporal Equivariant Scene Graph Neural Network (TESGNN), not only surpasses existing state-of-the-art methods in scene estimation accuracy but also achieves faster convergence. Importantly, TESGNN is computationally efficient and straightforward to implement using existing frameworks, making it well-suited for real-time applications in robotics and computer vision. This approach paves the way for more robust and scalable solutions to complex multi-view scene understanding challenges.

Prior setup

Our code base is adopted from 3D Semantic Scene Graph Estimations, a framework for developing 3D semantic scene graph estimations. The original repository includes five different methods, namely IMP, VGFM, 3DSSG, SGFN and MonoSSG. We built from this framework and added our new method, TESGNN. You can thus compare our method with the existing ones.

Please refer to their README.md's Preparation section to install and set up the environment, and download necessary dataset and libraries.

ESGNN - Train and monitor result

Train config: The framework provides several training configs based on different methods.

Monitoring: The first time you may need to change the wandb account in configs/config_default.yaml. Change the wanb.entity and wanb.project to yours. Or you can disable logging by passing --dry_run.

source Init.sh

### Train single
python main_esgnn.py --mode train --config /path/to/your/config/file
### Eval one
python main_esgnn.py --mode eval --config /path/to/your/config/file

Using the config for ESGNN should be:

python main_esgnn.py --mode train --config ./configs/config_ESGNN_full_l20.yaml

You can then go to WanDB and track the results. You can also compare with multiple results by running the training of other methods.

Temporal Model Training

Updating. Please refer to the notebook to check for our model architecture and training process if you need.

Please cite our work!

@article{pham2024tesgnntemporalequivariantscene,
      title={TESGNN: Temporal Equivariant Scene Graph Neural Networks for Efficient and Robust Multi-View 3D Scene Understanding}, 
      author={Quang P. M. Pham and Khoi T. N. Nguyen and Lan C. Ngo and Dezhen Song and Truong Do and Truong Son Hy},
      year={2024},
      eprint={2411.10509},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2411.10509}, 
}

@article{pham2024esgnn,
  title={ESGNN: Towards Equivariant Scene Graph Neural Network for 3D Scene Understanding},
  author={Pham, Quang PM and Nguyen, Khoi TN and Ngo, Lan C and Do, Truong and Hy, Truong Son},
  journal={arXiv preprint arXiv:2407.00609},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
codeLib		codeLib
configs		configs
data_processing		data_processing
scripts		scripts
ssg		ssg
temporal_training		temporal_training
.gitignore		.gitignore
Init.sh		Init.sh
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
main.py		main.py
main_esgnn.py		main_esgnn.py
overall_V3.png		overall_V3.png
setup.sh		setup.sh
setup_conda.sh		setup_conda.sh
testing_notebook.ipynb		testing_notebook.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TESGNN: 3D Temporal Equivariant Scene Graph Neural Networks

Abstract

Prior setup

ESGNN - Train and monitor result

Temporal Model Training

Please cite our work!

About

Releases

Packages

Contributors 2

Languages

License

HySonLab/TESGraph

Folders and files

Latest commit

History

Repository files navigation

TESGNN: 3D Temporal Equivariant Scene Graph Neural Networks

Abstract

Prior setup

ESGNN - Train and monitor result

Temporal Model Training

Please cite our work!

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages