GPSE example #10118

semihcanturk · 2025-03-14T01:18:53Z

Following #9018, Provides a comprehensive example in examples/gpse.py to compute GPSE encodings and use them for a graph regression task on the ZINC dataset. Two methods to compute GPSE encodings is demonstrated:

Through precompute_GPSE: Given a PyG dataset, computes GPSE encodings in-place once before training, without saving them to storage. Ideal if you want to compute the encodings only once per run (unlike a dataset transform) but do not want to save the pre-transformed dataset to storage (unlike a PyG pre-transform).

To run with default pretrained weights (molpcba):

python examples/gpse.py --gpse

To run with pretrained weights from any other dataset, please provide the pretraining dataset name from the available options as a kwarg:

python examples/gpse.py --gpse geom

Through the AddGPSE transform: A PyG transform analogous to AddLaplacianEigenvectorPE and AddRandomWalkPE, can be used as a pre-transform or transform to a PyG dataset.

python examples/gpse.py --gpse --as_transform

Using as a transform is not recommended as recomputing them for every batch in every epoch is quite inefficient; using it as a pre-transform or through precompute_GPSE is suggested instead. In either case, the torch_geometric.nn.GPSENodeEncoder is then used to compute a mapping of the GPSE encodings to the desired dimension, and append them to batch.x to prepare them as inputs to a GNN.

This PR has been tested with the latest (25.01-py3) NVIDIA stack, and has works without any issues.

for more information, see https://pre-commit.ci

[Graph Positional and Structural Encoder](https://arxiv.org/abs/2307.07107) implementation as per #8310. Adapted from the original repository: https://github.com/G-Taxonomy-Workgroup/GPSE. This version is a standalone implementation that is decoupled from GraphGym, and thus aims for better accessibility and a smoother integration into PyG. While the priority of this PR is to enable loading and using pre-trained models in plug-and-play fashion, it also includes the custom loss function used to train the model. Nevertheless, it might be easier to use the original repository for pre-training and fine-tuning new GPSE models for the time being. This PR includes the following: - `GPSE`: The main GPSE module, that generates learned encodings for input graphs. - Several helper classes (`FeatureEncoder`, `GNNStackStage`, `IdentityHead`, `GNNInductiveHybridMultiHead`, `ResGatedGCNConvGraphGymLayer`, `Linear`, `MLP`, `GeneralMultiLayer`, `GeneralLayer`, `BatchNorm1dNode`, `BatchNorm1dEdge`, `VirtualNodePatchSingleton`) and wrapper functions (`GNNPreMP`, `GNNLayer`), all adapted from their GraphGym versions for compatibility and enabling the loading of weights pre-trained using the GraphGym/original version. - The class method `GPSE.from_pretrained()` that returns a model with pre-trained weights from the original repository/Zenodo files. - `GPSENodeEncoder`, a helper linear/MLP encoder that takes the GPSE encodings precomputed as`batch.pestat_GPSE` in the input graphs, maps them to a desired dimension and appends them to node features. - `precompute_GPSE` , a function that takes in a GPSE model and a dataset, and precomputes GPSE encodings in-place for a given dataset using the helper function `gpse_process_batch`. - The transform `AddGPSE`, which in similar fashion to `AddLaplacianEigenvectorPE` and `AddRandomWalkPE` adds the GPSE encodings to a given graph using the helper function `gpse_process` - The testing modules `test/test_gpse.py` and `test/test_add_gpse.py`. - The loss function `gpse_loss` and helper functions `cosim_col_sep` and `process_batch_idx` used in GPSE training. - A comprehensive example in `examples/gpse.py` is provided as a separate PR in #10118. This PR has been tested with the latest (25.01-py3) NVIDIA stack, and has works without any issues. --------- Co-authored-by: Semih Cantürk <=> Co-authored-by: rusty1s <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Rishi Puri <[email protected]> Co-authored-by: Rishi Puri <[email protected]>

puririshi98

LGTM, thanks!

[Graph Positional and Structural Encoder](https://arxiv.org/abs/2307.07107) implementation as per pyg-team#8310. Adapted from the original repository: https://github.com/G-Taxonomy-Workgroup/GPSE. This version is a standalone implementation that is decoupled from GraphGym, and thus aims for better accessibility and a smoother integration into PyG. While the priority of this PR is to enable loading and using pre-trained models in plug-and-play fashion, it also includes the custom loss function used to train the model. Nevertheless, it might be easier to use the original repository for pre-training and fine-tuning new GPSE models for the time being. This PR includes the following: - `GPSE`: The main GPSE module, that generates learned encodings for input graphs. - Several helper classes (`FeatureEncoder`, `GNNStackStage`, `IdentityHead`, `GNNInductiveHybridMultiHead`, `ResGatedGCNConvGraphGymLayer`, `Linear`, `MLP`, `GeneralMultiLayer`, `GeneralLayer`, `BatchNorm1dNode`, `BatchNorm1dEdge`, `VirtualNodePatchSingleton`) and wrapper functions (`GNNPreMP`, `GNNLayer`), all adapted from their GraphGym versions for compatibility and enabling the loading of weights pre-trained using the GraphGym/original version. - The class method `GPSE.from_pretrained()` that returns a model with pre-trained weights from the original repository/Zenodo files. - `GPSENodeEncoder`, a helper linear/MLP encoder that takes the GPSE encodings precomputed as`batch.pestat_GPSE` in the input graphs, maps them to a desired dimension and appends them to node features. - `precompute_GPSE` , a function that takes in a GPSE model and a dataset, and precomputes GPSE encodings in-place for a given dataset using the helper function `gpse_process_batch`. - The transform `AddGPSE`, which in similar fashion to `AddLaplacianEigenvectorPE` and `AddRandomWalkPE` adds the GPSE encodings to a given graph using the helper function `gpse_process` - The testing modules `test/test_gpse.py` and `test/test_add_gpse.py`. - The loss function `gpse_loss` and helper functions `cosim_col_sep` and `process_batch_idx` used in GPSE training. - A comprehensive example in `examples/gpse.py` is provided as a separate PR in pyg-team#10118. This PR has been tested with the latest (25.01-py3) NVIDIA stack, and has works without any issues. --------- Co-authored-by: Semih Cantürk <=> Co-authored-by: rusty1s <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Rishi Puri <[email protected]> Co-authored-by: Rishi Puri <[email protected]>

Following pyg-team#9018, Provides a comprehensive example in `examples/gpse.py` to compute GPSE encodings and use them for a graph regression task on the ZINC dataset. Two methods to compute GPSE encodings is demonstrated: - Through `precompute_GPSE`: Given a PyG dataset, computes GPSE encodings in-place _once_ before training, without saving them to storage. Ideal if you want to compute the encodings only once per run (unlike a dataset transform) but do not want to save the pre-transformed dataset to storage (unlike a PyG pre-transform). To run with default pretrained weights (molpcba): ``` python examples/gpse.py --gpse ``` To run with pretrained weights from any other dataset, please provide the pretraining dataset name from the available options as a kwarg: ``` python examples/gpse.py --gpse geom ``` - Through the `AddGPSE` transform: A PyG transform analogous to [AddLaplacianEigenvectorPE](https://pytorch-geometric.readthedocs.io/en/2.6.0/generated/torch_geometric.transforms.AddLaplacianEigenvectorPE.html#torch_geometric.transforms.AddLaplacianEigenvectorPE) and [AddRandomWalkPE](https://pytorch-geometric.readthedocs.io/en/2.6.0/generated/torch_geometric.transforms.AddRandomWalkPE.html#torch_geometric.transforms.AddRandomWalkPE), can be used as a pre-transform or transform to a PyG dataset. ``` python examples/gpse.py --gpse --as_transform ``` Using as a transform is not recommended as recomputing them for every batch in every epoch is quite inefficient; using it as a pre-transform or through `precompute_GPSE` is suggested instead. In either case, the `torch_geometric.nn.GPSENodeEncoder` is then used to compute a mapping of the GPSE encodings to the desired dimension, and append them to `batch.x` to prepare them as inputs to a GNN. This PR has been tested with the latest (25.01-py3) NVIDIA stack, and has works without any issues. --------- Co-authored-by: Semih Cantürk <=> Co-authored-by: rusty1s <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Rishi Puri <[email protected]> Co-authored-by: Rishi Puri <[email protected]>

Semih Cantürk and others added 21 commits March 5, 2024 00:06

GPSE implementation

0e84df7

rename example module

a3ac5a2

cosim_col_sep raises ValueError if no batch_idx

e4ed760

remove examples/gpse.py, to be added as a separate PR

e7add7e

GPSE implementation

7726933

rename example module

6bf3042

cosim_col_sep raises ValueError if no batch_idx

8a295af

remove examples/gpse.py, to be added as a separate PR

971fa48

Merge remote-tracking branch 'origin/gpse' into gpse

f24acbc

Merge branch 'master' into gpse

9f000d0

Merge branch 'master' into gpse

b7297ad

add changelog entry

f9ce537

update

c63fc9f

fix bn_eps and bn_mom args

5ed9a37

merge with latest master

8dbaa40

[pre-commit.ci] auto fixes from pre-commit.com hooks

f608bc5

for more information, see https://pre-commit.ci

fix docs

c79f1ce

Merge remote-tracking branch 'origin/gpse' into gpse

f0116dd

Merge branch 'master' into gpse

5ebb2fd

gpse example

a90b01e

fix linear encoding

572de86

semihcanturk requested review from EdisonLeeeee and wsad1 as code owners March 14, 2025 01:18

pre-commit-ci bot and others added 3 commits March 14, 2025 01:20

[pre-commit.ci] auto fixes from pre-commit.com hooks

bc0e7f4

for more information, see https://pre-commit.ci

Merge branch 'master' into example_gpse

2a73381

Merge branch 'master' into gpse

0f7b8eb

semihcanturk mentioned this pull request Mar 24, 2025

GPSE Implementation #9018

Merged

puririshi98 and others added 3 commits March 31, 2025 18:07

Merge branch 'master' into gpse

a56377e

fix GPSE type hint

dcc9593

Merge branch 'master' into gpse

f1df833

puririshi98 added 5 commits April 2, 2025 16:46

Merge branch 'master' into example_gpse

85aba93

Merge branch 'master' into example_gpse

8790835

Merge branch 'master' into gpse

f12f78e

Merge branch 'master' into gpse

5c9a8f2

Merge branch 'master' into example_gpse

90f39c3

semihcanturk and others added 4 commits April 21, 2025 23:55

Merge branch 'gpse' into example_gpse

75c92dd

Merge remote-tracking branch 'origin/master' into example_gpse

c8fdf0c

update GPSE example

6ad4029

Merge branch 'master' into example_gpse

cbdaa06

puririshi98 approved these changes Apr 22, 2025

View reviewed changes

puririshi98 enabled auto-merge (squash) April 22, 2025 19:45

auto-merge was automatically disabled April 22, 2025 19:49
Invalid email address

puririshi98 merged commit 7e078e6 into pyg-team:master Apr 22, 2025
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

GPSE example #10118

GPSE example #10118

Uh oh!

semihcanturk commented Mar 14, 2025 •

edited

Loading

Uh oh!

puririshi98 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

GPSE example #10118

GPSE example #10118

Uh oh!

Conversation

semihcanturk commented Mar 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

puririshi98 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

semihcanturk commented Mar 14, 2025 •

edited

Loading