01 Dec 07:31

rusty1s

ca4e5f8

PyG 2.2.0: Accelerations and Scalability

We are excited to announce the release of PyG 2.2 🎉🎉🎉

Highlights
Breaking Changes
Deprecations
Features
Bugfixes
Full Changelog

PyG 2.2 is the culmination of work from 78 contributors who have worked on features and bug-fixes for a total of over 320 commits since torch-geometric==2.1.0.

Highlights

`pyg-lib` Integration

We are proud to release and integrate pyg-lib==0.1.0 into PyG, the first stable version of our new low-level Graph Neural Network library to drive all CPU and GPU acceleration needs of PyG (#5330, #5347, #5384, #5388).

You can install pyg-lib as described in our README.md:

pip install pyg-lib -f https://data.pyg.org/whl/torch-${TORCH}+${CUDA}.html

import pyg_lib

Once pyg-lib is installed, it will get automatically picked up by PyG, e.g., to accelerate neighborhood sampling routines or to accelerate heterogeneous GNN execution:

pyg-lib provides fast and optimized CPU routines to iteratively sample neighbors in homogeneous and heterogeneous graphs, and heavily improves upon the previously used neighborhood sampling techniques utilized in PyG.

pyg-lib provides efficient GPU-based routines to parallelize workloads in heterogeneous graphs across different node types and edge types. We achieve this by leveraging type-dependent transformations via NVIDIA CUTLASS integration, which is flexible to implement most heterogeneous GNNs with, and efficient, even for sparse edge types or a large number of different node types.

`GraphStore` and `FeatureStore` Abstractions

PyG 2.2 includes numerous primitives to easily integrate with simple paradigms for scalable graph machine learning, enabling users to train GNNs on graphs far larger than the size of their machine's available memory. It does so by introducing simple, easy-to-use, and extensible abstractions of a FeatureStore and a GraphStore that plug directly into existing familiar PyG interfaces (see here for the accompanying tutorial).

feature_store = CustomFeatureStore()
feature_store['paper', 'x', None] = ...  # Add paper features
feature_store['author', 'x', None] = ...  # Add author features

graph_store = CustomGraphStore()
graph_store['edge', 'coo'] = ...  # Add edges in "COO" format

# `CustomGraphSampler` knows how to sample on `CustomGraphStore`:
graph_sampler = CustomGraphSampler(
    graph_store=graph_store,
    num_neighbors=[10, 20],
    ...
)

from torch_geometric.loader import NodeLoader
loader = NodeLoader(
    data=(feature_store, graph_store),
    node_sampler=graph_sampler,
    batch_size=20,
    input_nodes='paper',
)

for batch in loader:
    pass

Data loading and sampling routines are refactored and decomposed into torch_geometric.loader and torch_geometric.sampler modules, respectively (#5563, #5820, #5456, #5457, #5312, #5365, #5402, #5404, #5418).

Optimized and Fused Aggregations

PyG 2.2 further accelerates scatter aggregations based on CPU/GPU and with/without backward computation paths (requires torch>=1.12.0 and torch-scatter>=2.1.0) (#5232, #5241, #5353, #5386, #5399, #6051, #6052).

We also optimized the usage of nn.aggr.MultiAggregation by fusing the computation of multiple aggregations together (see here for more details) (#6036, #6040).

Here are some benchmarking results on PyTorch 1.12 (summed over 1000 runs):

Aggregators	Vanilla	Fusion
`[sum, mean]`	0.3325s	0.1996s
`[sum, mean, min, max]`	0.7139s	0.5037s
`[sum, mean, var]`	0.6849s	0.3871s
`[sum, mean, var, std]`	1.0955s	0.3973s

Lastly, we have incorporated "fused" GNN operators via the dgNN package, starting with a FusedGATConv implementation (#5140).

Community Sprint: Type Hints and TorchScript Support

We are running regular community sprints to get our community more involved in building PyG. Whether you are just beginning to use graph learning or have been leveraging GNNs in research or production, the community sprints welcome members of all levels with different types of projects.

We had our first community sprint on 10/12 to fully-incorporate type hints and TorchScript support over the entire code base. The goal was to improve usability and cleanliness of our codebase. We had 20 contributors participating, contributing to 120 type hints within 2 weeks, adding around 2400 lines of code (#5842, #5603, #5659, #5664, #5665, #5666, #5667, #5668, #5669, #5673, #5675, #5673, #5678, #5682, #5683, #5684, #5685, #5687, #5688, #5695, #5699, #5701, #5702, #5703, #5706, #5707, #5710, #5714, #5715, #5716, #5722, #5724, #5725, #5726, #5729, #5730, #5731, #5732, [#5733](https://github.com/pyg-team/pyt...

Assets 2

17 Aug 10:32

rusty1s

2.1.0

07bf02f

PyG 2.1.0: Principled aggregations, link-level and temporal samplers, data pipe support, ...

We are excited to announce the release of PyG 2.1.0 🎉🎉🎉

Highlights
Breaking Changes
Deprecations
Features
Bugfixes
Full Changelog

PyG 2.1.0 is the culmination of work from over 60 contributors who have worked on features and bug-fixes for a total of over 320 commits since torch-geometric==2.0.4.

Highlights

Principled Aggregations

See here for the accompanying tutorial.

Aggregation functions play an important role in the message passing framework and the readout functions of Graph Neural Networks. Specifically, many works in the literature (Hamilton et al. (2017), Xu et al. (2018), Corso et al. (2020), Li et al. (2020), Tailor et al. (2021), Bartunov et al. (2022)) demonstrate that the choice of aggregation functions contributes significantly to the representational power and performance of the model.

To facilitate further experimentation and unify the concepts of aggregation within GNNs across both MessagePassing and global readouts, we have made the concept of Aggregation a first-class principle in PyG (#4379, #4522, #4687, #4721, #4731, #4762, #4749, #4779, #4863, #4864, #4865, #4866, #4872, #4927, #4934, #4935, #4957, #4973, #4973, #4986, #4995, #5000, #5021, #5034, #5036, #5039, #4522, #5033, #5085, #5097, #5099, #5104, #5113, #5130, #5098, #5191). As of now, PyG provides support for various aggregations — from simple ones (e.g., mean, max, sum), to advanced ones (e.g., median, var, std), learnable ones (e.g., SoftmaxAggregation, PowerMeanAggregation), and exotic ones (e.g., LSTMAggregation, SortAggregation, EquilibriumAggregation). Furthermore, multiple aggregations can be combined and stacked together:

from torch_geometric.nn import MessagePassing, SoftmaxAggregation

class MyConv(MessagePassing):
    def __init__(self, ...):
        # Combines a set of aggregations and concatenates their results.
        # The interface also supports automatic resolution.
        super().__init__(aggr=['mean', 'std', SoftmaxAggregation(learn=True)])

Link-level Neighbor Loader

We added a new LinkNeighborLoader class for training scalable GNNs that perform edge-level predictions on giant graphs (#4396, #4439, #4441, #4446, #4508, #4509, #4868). LinkNeighborLoader comes with automatic support for both homogeneous and heterogenous data, and supports link prediction via automatic negative sampling as well as edge-level classification and regression models:

from torch_geometric.loader import LinkNeighborLoader

loader = LinkNeighborLoader(
    data,
    num_neighbors=[30] * 2,  # Sample 30 neighbors for each node for 2 iterations
    batch_size=128,  # Use a batch size of 128 for sampling training links
    edge_label_index=data.edge_index,  # Use the entire graph for supervision
    negative_sampling_ratio=1.0,  # Sample negative edges
)

sampled_data = next(iter(loader))
print(sampled_data)
>>> Data(x=[1368, 1433], edge_index=[2, 3103], edge_label_index=[2, 256], edge_label=[256])

Neighborhood Sampling based on Temporal Constraints

Both NeighborLoader and LinkNeighborLoader now support temporal sampling via the time_attr argument (#4025, #4877, #4908, #5137, #5173). If set, temporal sampling will be used such that neighbors are guaranteed to fulfill temporal constraints, i.e. neighbors have an earlier timestamp than the center node:

from torch_geometric.loader import NeighborLoader

data['paper'].time = torch.arange(data['paper'].num_nodes)

loader = NeighborLoader(
    data,
    input_nodes='paper',
    time_attr='time',  # Only sample papers that appeared before the seed paper
    num_neighbors=[30] * 2,
    batch_size=128,
)

Note that this feature requires torch-sparse>=0.6.14.

Functional `DataPipes`

See here for the accompanying example.

PyG now fully supports data loading using the newly introduced concept of DataPipes in PyTorch for easily constructing flexible and performant data pipelines (#4302, #4345, #4349). PyG provides DataPipe support for batching multiple PyG data objects together and for applying any PyG transform:

datapipe = FileOpener(['SMILES_HIV.csv'])
datapipe = datapipe.parse_csv_as_dict()
datapipe = datapipe.parse_smiles(target_key='HIV_active')
datapipe = datapipe.in_memory_cache()  # Cache graph instances in-memory.
datapipe = datapipe.shuffle()
datapipe = datapipe.batch_graphs(batch_size=32)

datapipe = FileLister([root_dir], masks='*.off', recursive=True)
datapipe = datapipe.read_mesh()
datapipe = datapipe.in_memory_cache()  # Cache graph instances in-memory.
datapipe = datapipe.sample_points(1024)  # Use PyG transforms from here.
datapipe = datapipe.knn_graph(k=8)
datapipe = datapipe.shuffle()
datapipe = datapipe.batch_graphs(batch_size=32)

Breaking Changes

The torch_geometric.utils.metric package has been removed. We now recommend to use the torchmetrics packa...

Assets 2

12 Mar 16:43

rusty1s

2.0.4

97d5557

2.0.4

PyG 2.0.4 🎉

A new minor PyG version release, bringing PyTorch 1.11 support to PyG. It further includes a variety of new features and bugfixes:

Features

Added Quiver examples for multi-GU training using GraphSAGE (#4103), thanks to @eedalong and @luomai
nn.model.to_captum: Full integration of explainability methods provided by the Captum library (#3990, #4076), thanks to @RBendias
nn.conv.RGATConv: The relational graph attentional operator (#4031, #4110), thanks to @fork123aniket
nn.pool.DMoNPooling: The spectral modularity pooling operator (#4166, #4242), thanks to @fork123aniket
nn.*: Support for shape information in the documentation (#3739, #3889, #3893, #3946, #3981, #4009, #4120, #4158), thanks to @saiden89 and @arunppsg and @konstantinosKokos
loader.TemporalDataLoader: A dataloader to load a TemporalData object in mini-batches (#3985, #3988), thanks to @otaviocx
loader.ImbalancedSampler: A weighted random sampler that randomly samples elements according to class distribution (#4198)
transforms.VirtualNode: A transform that adds a virtual node to a graph (#4163)
transforms.LargestConnectedComponents: Selects the subgraph that corresponds to the largest connected components in the graph (#3949), thanks to @abojchevski
utils.homophily: Support for class-insensitive edge homophily (#3977, #4152), thanks to @hash-ir and @jinjh0123
utils.get_mesh_laplacian: Mesh Laplacian computation (#4187), thanks to @daniel-unyi-42

Datasets

Added a dataset cheatsheet to the documentation that collects import graph statistics across a variety of datasets supported in PyG (#3807, #3817) (please consider helping us filling its remaining content)
datasets.EllipticBitcoinDataset: A dataset of Bitcoin transactions (#3815), thanks to @shravankumar147

Minor Changes

nn.models.MLP: MLPs can now either be initialized via a list of channels or by specifying hidden_channels and num_layers (#3957)
nn.models.BasicGNN: Final Linear transformations are now always applied (except for jk=None) (#4042)
nn.conv.MessagePassing: Message passing modules that make use of edge_updater are now jittable (#3765), thanks to @Padarn
nn.conv.MessagePassing: (Official) support for min and mul aggregations (#4219)
nn.LightGCN: Initialize embeddings via xavier_uniform for better model performance (#4083), thanks to @nishithshowri006
nn.conv.ChebConv: Automatic eigenvalue approximation (#4106), thanks to @daniel-unyi-42
nn.conv.APPNP: Added support for optional edge_weight, (690a01d), thanks to @YueeXiang
nn.conv.GravNetConv: Support for torch.jit.script (#3885), thanks to @RobMcH
nn.pool.global_*_pool: The batch vector is now optional (#4161)
nn.to_hetero: Added a warning in case to_hetero is used on HeteroData metadata with unused destination node types (#3775)
nn.to_hetero: Support for nested modules (ea135bf)
nn.Sequential: Support for indexing (#3790)
nn.Sequential: Support for OrderedDict as input (#4075)
datasets.ZINC: Added an in-depth description of the task (#3832), thanks to @gasteigerjo
datasets.FakeDataset: Support for different feature distributions across different labels (#4065), thanks to @arunppsg
datasets.FakeDataset: Support for custom global attributes (#4074), thanks to @arunppsg
transforms.NormalizeFeatures: Features will no longer be transformed in-place (ada5b9a)
transforms.NormalizeFeatures: Support for negative feature values (6008e30)
utils.is_undirected: Improved efficiency (#3789)
utils.dropout_adj: Improved efficiency (#4059)
utils.contains_isolated_nodes: Improved efficiency (970de13)
utils.to_networkx: Support for to_undirected options (upper triangle vs. lower triangle) (#3901, #3948), thanks to @RemyLau
graphgym: Support for custom metrics and loggers (#3494), thanks to @RemyLau
graphgym.register: Register operations can now be used as class decorators (#3779, #3782)
Documentation: Added a few exercises at the end of documentation tutorials (#3780), thanks to @PabloAMC
Documentation: Added better installation instructions to CONTRIBUTUNG.md (#3803, #3991, #3995), thanks to @Cho-Geonwoo and @RBendias and @RodrigoVillatoro
Refactor: Clean-up dependencies (#3908, #4133, #4172), thanks to @adelizer
CI: Improved test runtimes (#4241)
CI: Additional linting check via yamllint (#3886)
CI: Additional linting check via isort (66b1780), thanks to @mananshah99
torch.package: Model packaging via torch.package (#3997)

Bugfixes

data.HeteroData: Fixed a bug in data.{attr_name}_dict in case data.{attr_name} does not exist (#3897)
data.Data: Fixed data.is_edge_attr in case data.num_edges == 1 (#3880)
data.Batch: Fixed a device mismatch bug in case a batch object was indexed that was created from GPU tensors (e6aa4c9, c549b3b)
*...

Contributors

Padarn, luomai, and 29 other contributors

Assets 2

22 Dec 06:49

rusty1s

2.0.3

d47d9cd

2.0.3

PyG 2.0.3 🎉

A new minor PyG version release, including a variety of new features and bugfixes:

Features

GLNN: Graph-less Neural Networks [Example] (#3572)
LINKX: Large Scale Learning on Non-Homophilous Graphs [Example] (#3654)
Added an example for heterogeneous link classification (#3350) - thanks to @anniekmyatt
HANConv: The Heterogenous Graph Attention operator [Example] (#3444, #3577, #3581) - thanks to @rishubhkhurana and @wsad1
LGConv and LightGCN: Simplifying and Powering Graph Convolution Network for Recommendation (#3685) - thanks to @LukasHaas and @KathyFeiyang
PyTorch Lightning DataModule wrappers for PyG+PL multi-GPU training/inference without replicating datasets across processes :
- torch_geometric.data.LightningDataset for multi-GPU training via PL on graph-level tasks [Example] (#3596, #3634)
- torch_geometric.data.LightningNodeData for multi-GPU training via PL on node-level tasks [Example] (#3613, #3634)
NeighborLoader: Added CUDA support leading to major runtime improvements [Example] (#3736)
MessagePassing: Added the edge_updater/edge_update interface for updating edge features (#3450) - thanks to @Padarn
GNNExplainer: Added an example that reproduces the official BA-Shapes experiment (#3386) - thanks to @RBendias
torch_geometric.graphgym: Support for heterogeneous graphs and lazy initialization (#3460) - thanks to @JiaxuanYou
MLP: Added a basic MLP implementation (#3553)
PointTransformer: Classification and segmentation examples (#3344) - thanks to @QuanticDisaster and @wsad1
ShaDowKHopSampler: Added an example (#3411) - thanks to @SubhajitDuttaChowdhury
Data.subgraph(...) implementation (#3521)

Datasets

HGBDataset benchmark suite (#3454)
MalNetTiny dataset (#3472) - thanks to @rampasek
OMDB: Organic Materials Database (#3506)
BAShapes: The BA-Shapes dataset (#3386) - thanks to @RBendias
PolBlogs and EmailEUCore datasets (#3534) - thanks to @AlexDuvalinho
StochasticBlockModel and RandomPartition graph datasets (#3586) - thanks to @dongkwan-kim
LINKXDataset: A subset of the non-homophilous benchmark datasets from LINKX
FakeDataset and FakeHeteroDataset for testing purposes (#3741) - thanks to @levulinh

Minor Changes

torch_geometric.nn.norm: Improved the runtimes of normalization layers - thanks to @johnpeterflynn
DataLoader and NeighborLoader: Output tensors are now written to shared memory to avoid an extra copy in case num_workers > 0 (#3401 and #3734) - thanks to @johnpeterflynn
GATv2Conv: Support for edge features (#3421) - thanks to @Kenneth-Schroeder
Batch.from_data_list: Runtime improvements
TransformerConv: Runtime and memory consumption improvements (#3392) - thanks to @wsad1
mean_iou: Added IoU computation via omitting NaNs (#3464) - thanks to @GericoVi
DataLoader: follow_batch and exclude_keys are now optional arguments
Improvements to the package metadata (#3445) - thanks to @cthoyt
Updated the quick start widget to support PyTorch 1.10 (#3474) - thanks to @kathyfan
NeighborLoader and HGTLoader: Removed the persistent_workers=True default
voxel_grid: The batch argument is now optional (#3533) - thanks to @QuanticDisaster
TransformerConv: JIT support (#3538) - thanks to @RobMcH
Lazy modules can now correctly be saved and loaded via state_dict() and load_state_dict() (#3651) - thanks to @shubham-gupta-iitr
from_networkx: Support for nx.MultiDiGraph (#3646) - thanks to @max-zipfl-fzi
GATv2Conv: Support for lazy initialization (#3678) - thanks to @richcmwang
torch_geometric.graphgym: register_* functions can now be used as decorators (#3684)
AddSelfLoops: Now supports the full argument set of torch_geometric.utils.add_self_loops (#3702) - thanks to @dongkwan-kim
Documentation: Added shape information to ...

Contributors

panisson, Padarn, and 36 other contributors

Assets 2

26 Oct 12:41

rusty1s

2.0.2

101ed2c

2.0.2

A new minor version release, including further bugfixes, official PyTorch 1.10 support, as well as additional features and operators:

Features

Added video tutorials and Colabs from the PyTorch Geometric Tutorial project (thanks to @AntonioLonga)
Added the GraphMultisetTransformer operator (thanks to @JinheonBaek)
Added the PointTransformerConv operator (thanks to @QuanticDisaster)
Added the HEATConv operator (thanks to @Xiaoyu006)
Added the PNA GNN model (thanks to @RBendias)
Added the AddMetaPaths transform, which will add additional edge types to a HeteroData object based on a list of metapaths (thanks to @wsad1)
Added the Data.to_heterogeneous method to allow for the conversion from Data to HeteroData objects
Added the AttributedGraphDataset, containing a variety of attributes graphs
Added the Airports datasets
Added the structured_negative_sampling_feasible method, which checks if structured_negative_sampling is feasible (thanks to @WuliangHuang)
GATConv can now make use of multi-dimensional edge features to compute attention scores (thanks to @dongkwan-kim)
RandomNodeSplit and RandomLinkSplit now support HeteroData as input
MessagePassing inference can now be sped up via the decomposed_layers argument (thanks to @ZhouAo-ZA)
negative_sampling and batched_negative_sampling now support negative sampling in bipartite graphs
HeteroConv now supports the inclusion of arbitrary node-level or edge-level information for the underlying MessagePassing operators
GNNExplainer now supports multiple node-level masks and explaining regression problems (thanks to @gregorkrz)

Minor Changes

Data.to_homogeneous will now add node_type information to the homogeneous Data object
GINEConv now allows to transform edge features automatically in case their dimensionalities do not match (thanks to @CaypoH)
OGB_MAG will now add node_year information to paper nodes
Entities datasets do now allow the processing of HeteroData objects via the hetero=True option
Batch objects can now be batched together to form super batches
Added heterogeneous graph support for Center, Constant and LinearTransformation transformations
HeteroConv now allows to return "stacked" embeddings
The batch vector of a Batch object will now be initialized on the GPU in case other attributes are held in GPU memory

Bugfixes

Fixed the num_neighbors argument of NeighborLoader in order to specify an edge-type specific number of neighbors
Fixed the collate policy of lists of integers/strings to return nested lists
Fixed the Delaunay transformation in case the face attribute is not present in the data
Fixed the TGNMemory module to only read from the latest update (thanks to @cwh104504)
Fixed the pickle.PicklingError when Batch objects are used in a torch.multiprocessing.manager.Queue() (thanks to @RasmusOrsoe)
Fixed an issue with _parent state changing after pickling of Data objects (thanks to @zepx)
Fixed the ToUndirected transformation in case the number of edges and nodes are equal (thanks to @lmkmkrcc)
Fixed the from_networkx routine in case node-level and edge-level features share the same names
Removed the num_nodes warning when creating PairData objects
Fixed the initialization of the GeneralMultiLayer module in GraphGym (thanks to @fjulian)
Fixed custom model registration in GraphGym
Fixed a clash in the run_dir naming of GraphGym (thanks to @fjulian)
Includes a fix to prevent a GraphGym crash in case ROC-score is undefined (thanks to @fjulian)
Fixed the Batch.from_data_list routine on dataset slices (thanks to @dtortorella)
Fixed the MetaPath2Vec model in case there exists isolated nodes
Fixed torch_geometric.utils.coalesce with CUDA tensors

Contributors

zepx, dtortorella, and 15 other contributors

Assets 2

16 Sep 07:22

rusty1s

2.0.1

82a3bab

2.0.1

PyG 2.0.1

This is a minor release, bringing some emergency fixes to PyG 2.0.

Bugfixes

Fixed a bug in loader.DataLoader that raised a PicklingError for num_workers > 0 (thanks to @r-echeveste, @arglog and @RishabhPandit-00)
Fixed a bug in the creation of data.Batch objects in case customized data.Data objects expect non-default arguments (thanks to @Emiyalzn)
Fixed a bug in which SparseTensor attributes could not be batched along single dimensions (thanks to @rubenwiersma)

Contributors

rubenwiersma, r-echeveste, and 3 other contributors

Assets 2

13 Sep 07:48

rusty1s

2.0.0

c5152ea

2.0.0

PyG 2.0 🎉 🎉 🎉

PyG (PyTorch Geometric) has been moved from my own personal account rusty1s to its own organization account pyg-team to emphasize the ongoing collaboration between TU Dortmund University, Stanford University and many great external contributors. With this, we are releasing PyG 2.0, a new major release that brings sophisticated heterogeneous graph support, GraphGym integration and many other exciting features to PyG.

If you encounter any bugs in this new release, please do not hesitate to create an issue.

Heterogeneous Graph Support

We finally provide full heterogeneous graph support in PyG 2.0. See here for the accompanying tutorial.

Highlights

Heterogeneous Graph Storage: Heterogeneous graphs can now be stored in their own dedicated data.HeteroData class (thanks to @yaoyaowd):

from torch_geometric.data import HeteroData

data = HeteroData()

# Create two node types "paper" and "author" holding a single feature matrix:
data['paper'].x = torch.randn(num_papers, num_paper_features)
data['author'].x = torch.randn(num_authors, num_authors_features)

# Create an edge type ("paper", "written_by", "author") holding its graph connectivity:
data['paper', 'written_by', 'author'].edge_index = ...  # [2, num_edges]

data.HeteroData behaves similar to a regular homgeneous data.Data object:

print(data['paper'].num_nodes)
print(data['paper', 'written_by', 'author'].num_edges)
data = data.to('cuda')

Heterogeneous Mini-Batch Loading: Heterogeneous graphs can be converted to mini-batches for many small and single giant graphs via the loader.DataLoader and loader.NeighborLoader loaders, respectively. These loaders can now handle both homogeneous and heterogeneous graphs:

from torch_geometric.loader import DataLoader

loader = DataLoader(heterogeneous_graph_dataset, batch_size=32, shuffle=True)

from torch_geometric.loader import NeighborLoader

loader = NeighborLoader(heterogeneous_graph, num_neighbors=[30, 30], batch_size=128,
                        input_nodes=('paper', data['paper'].train_mask), shuffle=True)

Heterogeneous Graph Neural Networks: Heterogeneous GNNs can now easily be created from homogeneous ones via nn.to_hetero and nn.to_hetero_with_bases. These processes take an existing GNN model and duplicate their message functions to account for different node and edge types:

from torch_geometric.nn import SAGEConv, to_hetero

class GNN(torch.nn.Module):
    def __init__(hidden_channels, out_channels):
        super().__init__()
        self.conv1 = SAGEConv((-1, -1), hidden_channels)
        self.conv2 = SAGEConv((-1, -1), out_channels)

    def forward(self, x, edge_index):
        x = self.conv1(x, edge_index).relu()
        x = self.conv2(x, edge_index)
        return x

model = GNN(hidden_channels=64, out_channels=dataset.num_classes)
model = to_hetero(model, data.metadata(), aggr='sum')

Additional Features

A heterogeneous graph tutorial describing all newly released features (thanks to @mrjel)
A variety of heterogeneous GNN examples
Support for lazy initialization of GNN operators by passing -1 to the in_channels argument (implemented via nn.dense.Linear).
This allows to avoid calculating and keeping track of input tensor sizes, simplyfing the creation of heterogeneous graph models with varying feature dimensionalities across different node and edge types. Lazy initialization is supported for all existing PyG operators (thanks to @yaoyaowd):
```
from torch_geometric.nn import GATConv

conv = GATConv(-1, 64)

# We can initialize the model’s parameters by calling it once:
conv(x, edge_index)
```
nn.conv.HeteroConv: A generic wrapper for computing graph convolution on heterogeneous graphs (thanks to @RexYing)
nn.conv.HGTConv: The heterogeneous graph transformer operator from the "Heterogeneous Graph Transformer" paper
loader.HGTLoader: The heterogeneous graph sampler from the "Heterogeneous Graph Transformer" paper for learning on large-scale heterogeneous graphs (thanks to @chantat)
Support for heterogeneous graph transformations in transforms.AddSelfLoops, transforms.ToSparseTensor, transforms.NormalizeFeatures and transforms.ToUndirected
New heterogeneous graph datasets: datasets.OGB_MAG, datasets.IMDB, datasets.DBLP and datasets.LastFM
Support for converting heterogeneous graphs to "typed" homogeneous ones via data.HeteroData.to_homogeneous (thanks to @yzhao062)
A tutorial on creating a data.HeteroData object from raw *.csv files (thanks to @yaoyaowd and @mrjel)
An example to scale heterogeneous graph models via PyTorch Lightning

Managing Experiments with GraphGym

GraphGym is now officially supported in PyG 2.0 via torch_geometric.graphgym. See here for the accompanying tutorial. Overall, GraphGym is a platform for designing and evaluating Graph Neural Networks from configuration files via a highly modularized pipeline (thanks to @JiaxuanYou):

GraphGym is the perfect place to start learning about standardized GNN implementation and evaluation
GraphGym provides a simple interface to try out thousands of GNN architectures in parallel to find the best design for your specific task
GraphGym lets you easily do hyper-parameter search and visualize what design choices are better

Breaking Changes

The datasets.AMiner dataset now returns a data.HeteroData object. See here for our updated MetaPath2Vec example on AMiner.
transforms.AddTrainValTestMask has been replaced in favour of [transforms.RandomNodeSplit](https://pytorch-geometric.readthedocs.io/en/latest/modules/transforms.html#torch...

Contributors

yaoyaowd, beneisner, and 31 other contributors

Assets 2

26 Jun 08:50

rusty1s

1.7.2

5455b52

1.7.2

Datasets

The GitHub Web and ML developer dataset (thanks to @benedekrozemberczki)
The FacebookPagePage dataset (thanks to @benedekrozemberczki)
The Twitch gamer datasets (thanks to @benedekrozemberczki)
The DeezerEurope dataset (thanks to @benedekrozemberczki)
The GemsecDeezer dataset (thanks to @benedekrozemberczki)
The LastFMAsia dataset (thanks to @benedekrozemberczki)
The WikipediaNetwork datasets does now allow usage of the raw dataset as introduced in Multi-scale Attributed Node Embedding (thanks to @benedekrozemberczki)

Bugfixes

Fixed an error in DeepGCNLayer in case no normalization layer is provided (thanks to @lukasfolle)
Fixed a bug in GNNExplainer which mixed the loss computation for graph-level and node-level predictions (thanks to @panisson and @wsad1)

Assets 2

17 Jun 08:19

rusty1s

1.7.1

d86de00

1.7.1

A minor release that brings PyTorch 1.9.0 and Python 3.9 support to PyTorch Geometric. In case you are in the process of updating to PyTorch 1.9.0, please re-install the external dependencies for PyTorch 1.9.0 as well (torch-scatter and torch-sparse).

Features

EGConv (thanks to @shyam196)
GATv2Conv (thanks to @shakedbr)
GraphNorm normalization layer
GNNExplainer now supports explaining graph-level predictions (thanks to @wsad1)
bro and gini regularization (thanks to @rhsimplex)
train_test_split_edges() and to_undirected() can now edge features (thanks to @saiden89 and @SherylHYX)
Datasets can now be accessed with np.ndarray as well (thanks to @josephenguehard)
dense_to_sparse can now handle batched adjacency matrices
numba is now an optional dependency

Datasets

The tree-structured fake news propagation UPFD dataset (thanks to @YingtongDou)
The large-scale AmazonProducts graph from the GraphSAINT paper
Added support for two more datasets in the SNAPDataset benchmark suite (thanks to @SherylHYX)

Issues

Fixed an issue in which SuperGATConv used all positive edges for computing the auxiliary loss (thanks to @anniekmyatt)
Fixed a bug in which MemPooling produced NaN gradients (thanks to @wsad1)
Fixed an issue in which the schnetpack package was required for training SchNet (thanks to @mshuaibii)
Modfied XConv to sample without replacement in case dilation > 1 (thanks to @mayur-ag)
GraphSAINTSampler can now be used in combination with PyTorch Lightning
Fixed a bug in HypergraphConv in case num_nodes > num_edges (thanks to @THinnerichs)

Assets 2

09 Apr 08:44

rusty1s

1.7.0

e6b8d64

1.7.0

Major Features

Temporal Graph Network and an example utilizing graph attention, (thanks to @emalgorithm)
CorrectAndSmooth and an example on ogbn-products
PyTorch Lightning support, see here for the accompanying examples (thanks to @tchaton)
Sequential API, see here for the accompanying example
FiLMConv and an example on PPI (thanks to @ldv1)
SuperGAT and an example on Cora (thanks to @dongkwan-kim)
MemPooling (thanks to @wsad1)
PANConv (thanks to @YuGuangWang)
DiffGroupNorm (thanks to @wsad1)
ResGatedGraphConv (thanks to @ldv1)
FAConv (thanks to @wsad1)
AttentiveFP model for molecular graph learning and an example on ESOL (thanks to @thegodone)
Shadow k-hop Sampler (currently requires torch-sparse from master)

Additional Features

Inductive Deep Graph Infomax example (thanks to @harrygcoppock)
WLConv and an example of the Weisfeiler-Lehman subtree kernel (thanks to @chrsmrrs)
LabelPropagation
AddTrainValTestMask transform for creating various splitting strategies (thanks to @dongkwan-kim)
homophily measurement (thanks to @ldv1)
to_cugraph conversion

Minor Changes

More memory-efficient implementation of GCN2Conv
Improved TransformerConv with the beta argument being input and message dependent (thanks to @ldv1)
NeighborSampler now works with SparseTensor and supports an additional transform argument
Batch.from_data_list now supports batching along a new dimension via returning None in Data.__cat_dim__, see here for the accompanying tutorial (thanks to @Linux-cpp-lisp)
MetaLayer is now "jittable"
Lazy loading of torch_geometric.nn and torch_geometric.datasets, leading to faster imports (thanks to @Linux-cpp-lisp)
GNNExplainer now supports various output formats of the underlying GNN model (thanks to @wsad1)

Datasets

JODIE datasets for temporal graph learning
WordNet18RR (thanks to @minhtriet)
Reddit2
MixHopSyntheticDataset (thanks to @ldv1)
NELL

Bugfixes

Fixed SparseAdam usage in examples/metapath2vec.py (thanks to @declanmillar)
Fixed from_networkx to support empty edge lists (thanks to @shakedbr)
Fixed a numerical issue in softmax
Fixed an issue in DenseGraphConv with aggr="max" (thanks to @quqixun)
Fixed the norm computation in GraphSAINTSampler (thanks to @austintwang)
Cartesian and LocalCartesian now compute Cartesian coordinates from target to source nodes (thanks to @ldv1)

Assets 2

Releases: pyg-team/pytorch_geometric

PyG 2.2.0: Accelerations and Scalability

Highlights

pyg-lib Integration

GraphStore and FeatureStore Abstractions

Optimized and Fused Aggregations

Community Sprint: Type Hints and TorchScript Support

Uh oh!

PyG 2.1.0: Principled aggregations, link-level and temporal samplers, data pipe support, ...

Highlights

Principled Aggregations

Link-level Neighbor Loader

Neighborhood Sampling based on Temporal Constraints

Functional DataPipes

Breaking Changes

Uh oh!

2.0.4

PyG 2.0.4 🎉

Features

Datasets

Minor Changes

Bugfixes

Contributors

Uh oh!

2.0.3

PyG 2.0.3 🎉

Features

Datasets

Minor Changes

Contributors

Uh oh!

2.0.2

Features

Minor Changes

Bugfixes

Contributors

Uh oh!

2.0.1

PyG 2.0.1

Bugfixes

Contributors

Uh oh!

2.0.0

PyG 2.0 🎉 🎉 🎉

Heterogeneous Graph Support

Highlights

Additional Features

Managing Experiments with GraphGym

Breaking Changes

Contributors

Uh oh!

1.7.2

Datasets

Bugfixes

Uh oh!

1.7.1

Features

Datasets

Issues

Uh oh!

1.7.0

Major Features

Additional Features

Minor Changes

Datasets

Bugfixes

Uh oh!

`pyg-lib` Integration

`GraphStore` and `FeatureStore` Abstractions

Functional `DataPipes`