-
Notifications
You must be signed in to change notification settings - Fork 1
Description
Hello,
Niels here from the open-source team at Hugging Face. I discovered your work on Arxiv and was wondering whether you would like to submit it to hf.co/papers to improve its discoverability.If you are one of the authors, you can submit it at https://huggingface.co/papers/submit.
The paper page lets people discuss about your paper and lets them find artifacts about it (your models, datasets or demo for instance), you can also claim
the paper as yours which will show up on your public profile at HF, add Github and project page URLs.
I saw in your GitHub repository (https://github.com/BCV-Uniandes/Cardium) that you are planning to release the CARDIUM dataset and pre-trained models in October 2025. This is fantastic news!
It'd be great to make these checkpoints and the CARDIUM dataset available on the 🤗 hub once they are ready, to improve their discoverability/visibility.
We can add tags so that people find them when filtering https://huggingface.co/models and https://huggingface.co/datasets.
Uploading models
See here for a guide: https://huggingface.co/docs/hub/models-uploading.
In this case, we could leverage the PyTorchModelHubMixin class which adds from_pretrained and push_to_hub to any custom nn.Module. Alternatively, one can leverages the hf_hub_download one-liner to download a checkpoint from the hub.
We encourage researchers to push each model checkpoint (e.g., separate repos for the image-based, tabular-based, and full multimodal models) to a separate model repository, so that things like download stats also work. We can then also link the checkpoints to the paper page. Given the CARDIUM model takes both image and tabular data as input for CHD detection, a relevant pipeline tag could be image-text-to-text.
Uploading dataset
Would be awesome to make the CARDIUM dataset (fetal ultrasound and echocardiographic images along with maternal clinical records) available on 🤗 , so that people can do:
from datasets import load_dataset
dataset = load_dataset("your-hf-org-or-username/your-dataset")See here for a guide: https://huggingface.co/docs/datasets/loading. The dataset viewer (https://huggingface.co/docs/hub/en/datasets-viewer) would also allow quick exploration of the tabular and image data. Relevant task categories for the CARDIUM dataset could include image-classification and text-classification to reflect its multimodal nature for a diagnostic task.
Let me know if you're interested/need any help regarding this once your artifacts are ready for release closer to October 2025!
Cheers,
Niels
ML Engineer @ HF 🤗