-
Notifications
You must be signed in to change notification settings - Fork 7.1k
[Feature Request] Target Transforms for keypoints in Image #523
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Yes, this is useful will be included in torchvision in the next release. |
Hi
Function Padding Transforms Link |
Thanks! |
Great! Would have loved to contribute but have some major commitments right now! |
Hi, I would like to work on this if someone is not working. |
I think it might be better to hold on a bit on this for the moment. |
@fmassa I can help with building that representation as well. Please let me know if that is something I can contribute to |
@Ridhwanluthra thanks for the offer! |
* Update readme * Unet3d update (pytorch#1) * enable multi-gpu training * enable multi-gpu training * enable multi-gpu training * enable multi-gpu training * enable multi-gpu training * test grad acc * test grad acc * test grad acc * fix number of samples logging * test * Add divergence detection * Add divergence detection * Remove debug prints * Remove debug prints * Remove debug prints * Remove debug prints * Remove debug prints * Remove debug prints
Torchvision's functional API allows user to explicitly specify angles to rotate the image or points to crop the image. Thus one can apply sane augmentation on target masks or bounding boxes.
But in cases where we have the output as keypoints in the image, for example say human body joint coordinates in given image, if I rotate the image I'll have to find corresponding joint locations in image (via elementary geometry). I felt that this effort off applying corresponding keypoints in augmented image could be handled in pytorch backend and would be very helpful.
Do you think it's worth it to add in torchvision package?
This could come handy in object detection, human joint annotations and maybe even more place
The text was updated successfully, but these errors were encountered: