-
Notifications
You must be signed in to change notification settings - Fork 3.9k
Add MoleculeGPT #9710
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add MoleculeGPT #9710
Conversation
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
…tric into moleculegpt/dataset
for more information, see https://pre-commit.ci
…tric into moleculegpt/dataset
…tric into moleculegpt/dataset
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
for more information, see https://pre-commit.ci
…tric into moleculegpt/dataset
|
@xnuohz Looks great. Can you do us a favor and split the PR into multiple? I would imagine we can merge dataset, model and example separately to ease reviewing. |
|
@xnuohz, notice CI fails because: |
|
@puririshi98 Fixed CI and test the unit test locally. |
|
LGTM |
|
will wait for @rusty1s and @akihironitta to review/merge |
merging since @rusty1s is busy until new year. cc @akihironitta |
puririshi98
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
### Issue - pyg-team#9694 - pyg-team#9698 ### Feature Summary - Add `MoleculeGPTDataset` - Add `MoleculeGPT` as GNN & LLM Co-training model to PyG - Add an example for training and testing - Split the PR into 3 sub-PRs (pyg-team#9723, pyg-team#9724, pyg-team#9725) - Limited hardware resources, can't load `lmsys/vicuna-7b-v1.5`, use `TinyLlama/TinyLlama-1.1B-Chat-v0.1` instead, and the full training pipeline was not tested --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Giovanni Gatti <[email protected]> Co-authored-by: Rishi Puri <[email protected]>

Issue
MoleculeGPT: Dataset+Model+Unit tests+Example #9698Feature Summary
MoleculeGPTDatasetMoleculeGPTas GNN & LLM Co-training model to PyGlmsys/vicuna-7b-v1.5, useTinyLlama/TinyLlama-1.1B-Chat-v0.1instead, and the full training pipeline was not tested