OpenCV camera to PyTorch3D PerspectiveCameras #522

pengsongyou · 2021-01-18T22:10:46Z

Dear PyTorch3D team,

First of all, thanks so much for releasing this amazing library!

I have some camera intrinsic and extrinsic parameters from OpenCV, and I try to convert them to PyTorch3D PerspectiveCameras. I have been carefully following this amazing page. However, the calculated pixels in the screen coordinate system in PyTorch3D are always not correct. I provide my code snippet below:

# Given a projection matrix, obtain K, R, t
K, R, t = cv2.decomposeProjectionMatrix(P)[:3]
K = K / K[2, 2]
t = t[:3] / t[3]

# NOTE: I have verified p_camera = K @ (R @ p_world - R @ t) 
# is the p_world in camera coordinate system
# p_pix = p_camera[:2] / p_camera[2] are the pixels in screen coordinate system between [0, W-1] and [0, H-1]

pose = np.eye(4, dtype=np.float32)
pose[:3, :3] = R
pose[:3, 3] = -R @ t

T1 = torch.tensor([[-1, 0, 0, 0], [0, -1, 0, 0], [0, 0, -1, 0], [0, 0, 0, 1]],
           dtype=torch.float32) # assume OpenCV is X-right, Y-down, Z-in
T2 = torch.tensor([[-1, 0, 0, 0], [0, -1, 0, 0], [0, 0, 1, 0], [0, 0, 0, 1]],
           dtype=torch.float32) # assume OpenCV is X-right, Y-down, Z-out

# transform the pose from OpenCV to PyTorch3D (X-left, Y-up, Z-out)
T = T1 # or T2
pose = (T @ torch.tensor(pose, dtype=torch.float32) @ T)
R = pose[:3, :3].unsqueeze(0)
t = pose[:3, 3].unsqueeze(0)

# build focal length and principle points from K
focal = torch.tensor((torch.tensor(K[0, 0]), torch.tensor(K[1, 1]))).unsqueeze(0)
principle = torch.tensor((torch.tensor(K[0, 2]), torch.tensor(K[1, 2]))).unsqueeze(0)

img_size = (rgb.shape[1], rgb.shape[0]) # (Width, Height)

camera = PerspectiveCameras(R=R, T=t, focal_length=focal, principal_point=principle, image_size=(img_size,))
p_pix_p3d = camera.transform_points_screen(p_world.float(), (img_size,))
p_pix_p3d = p_pix_p3d[:2]

In my case, p_pix_p3d is always different from GT pixel p_pix, no matter if I use T1 or T2 as the transformation matrix. I am wondering if someone can kindly guide me on this? Thanks so much in advance for the help!

Best,
Songyou

The text was updated successfully, but these errors were encountered:

pengsongyou · 2021-01-19T11:52:53Z

Hi,

I figure out the solution myself after getting stuck here for quite some time :) I post my answer below.

First, OpenCV coordinate system is X-right, Y-down, Z-out, and PyTorch3D is X-left, Y-up, Z-out. You can notice that we need to flip X and Y axes. However, instead of what I was doing above (I still did not make that work), one can actually simply input the negative focal length to PerspectiveCameras:

Here I provide an example to help you understand:

# Assume we have the following parameters from OpenCV
fx fy # focal length in x and y axes
px py # principal points in x and y axes
R t # rotation and translation matrix

# First, (X, Y, Z) = R @ p_world + t, where p_world is 3D coordinte under world system
# To go from a coordinate under view system (X, Y, Z) to screen space, the perspective camera mode should consider
# the following transformation and we can get coordinates in screen space in the range of [0, W-1] and [0, H-1]
x_screen = fx * X / Z + px
y_screen = fy * Y / Z + py

# In PyTorch3D, we need to build the input first in order to define camera. Note that we consider batch size N = 1
RR = torch.from_numpy(R).permute(1, 0).unsqueeze(0) # dim = (1, 3, 3)
tt = torch.from_numpy(t).permute(1, 0) # dim = (1, 3)
f = torch.tensor((fx, fy), dtype=torch.float32).unsqueeze(0) # dim = (1, 2)
p = torch.tensor((px, py), dtype=torch.float32).unsqueeze(0) # dim = (1, 2)
img_size = (W, H) # (width, height) of the image

# Now, we can define the Perspective Camera model. 
# NOTE: you should consider negative focal length as input!!!
camera = PerspectiveCameras(R=RR, T=tt, focal_length=-f, principal_point=p, image_size=(img_size,))

p_world = torch.tensor([X, Y, Z], dtype=torch.float32)[None, None] # dim = (1, 1, 3)
out_screen = camera.transform_points_screen(p_world, (img_size,))

The out_screen[..., :2] should now correspond to (x_screen, y_screen). This verifies that we obtain 1:1 mapping from OpenCV to PyTorch3D.

Proof for negative focal length
Now we discuss why the negative focal length brought us the correct result. First, in the bottom of this official page, we know how to go from view coordinates to NDC coordinates. If I follow what my convention defined before (fx, fy, px, py are in screen space), then we can get

x_ndc = (fx * 2 / W) * X / Z - (px - W / 2) * 2 / W
y_ndc = (fy * 2 / H) * Y / Z - (py - H / 2) * 2 / H

Then if you check transform_points_screen function, the coordinates in screen space:

x_screen = (W - 1) / 2 * (1 - x_ndc)
y_screen = (H - 1) / 2 * (1 - y_ndc)

Now if you substitute x_ndc and y_ndc, you will obtain:

x_screen = (-fx * (W - 1) / W) * X / Z + (W - 1) / W * px
y_screen = (-fy * (H - 1) / H) * Y / Z + (H - 1) / H * py

Proved.

@nikhilaravi I am wondering why not directly incorporate the negative focal length, so people would not be spending very long time like me figuring all this out.

Best,
Songyou

nikhilaravi · 2021-01-22T00:00:42Z

@pengsongyou thank you for providing your detailed solution on this issue to help others. We are considering providing helper functions for converting from different coordinate system conventions to PyTorch3D as this is a common source of confusion. cc @davnov134 @gkioxari

MengXinChengXuYuan · 2021-03-31T06:20:23Z

@nikhilaravi, hi, I would like to ask that do we have this mentioned convertion method now?

MengXinChengXuYuan · 2021-03-31T08:34:19Z

@pengsongyou Hi I tried to use -f, the rendered result are very close as in opencv camera, but still has a little differ. I dont konw what could be wrong, any clue?
It seems that the rotation of the cam is not correct
Thanks in advance!

MengXinChengXuYuan · 2021-04-01T02:12:30Z

Solved...
For anyone who met the same problem like I did, the -f convertion only works when the R of the camera is I matrix
If not you can apply the rotation matrix to the points first and than make R = np.eyes(3), t = np.zeros(3) (for the new camera)

pengsongyou · 2021-04-01T06:15:11Z

Solved...
For anyone who met the same problem like I did, the -f convertion only works when the R of the camera is I matrix
If not you can apply the rotation matrix to the points first and than make R = np.eyes(3), t = np.zeros(3) (for the new camera)

That is strange because I can input R and t directly. I guess @nikhilaravi could provide some insights here.

MengXinChengXuYuan · 2021-04-02T01:13:42Z

@pengsongyou That's just so strange... Cause I just spent hours in examining all this params (using synthetic camera rt and smpl param), including camera R t, f, c, smpl t, when using -f, it works if only when the R is np.eyes(3)

@nikhilaravi I think the camera convertion is really needed cause in many CV fields we real have to use opencv camera :(

sailor-z · 2021-05-05T07:57:23Z

Hi, I‘m trying to render images using some specific extrinsics like provided ground-trurth on some public datasets instead of

R, T = look_at_view_transform(distance, elevation, azimuth, up=((0, 0, 1),), device=device)

The rendering part is as follows:

cameras = PerspectiveCameras(focal_length=(focal_length,), principal_point=(principal_point,), image_size = (image_size,), device=device)

silhouette_renderer = MeshRenderer(
        rasterizer=MeshRasterizer(
            cameras=cameras,
            raster_settings=raster_settings
        ),
        shader=SoftSilhouetteShader(blend_params=blend_params)
    )

silhouette = silhouette_renderer(meshes_world=mesh, R=R, T=T)

I got some strange rendered images by using both f and -f. Is there anyone who knows how to perform rendering using a specific extrinsic? @nikhilaravi Could you please give me a clue?

sailor-z · 2021-05-05T13:53:51Z

Sovled, using MengXinChengXuYuan's solution.

classner · 2021-07-13T05:02:52Z

Hi everyone!

We integrated a function to convert camera descriptions in 75432a0 ! Now you can just use the function pytorch3d.utils.cameras_from_opencv_projection for this purpose (pytorch3d.utils.opencv_from_cameras_projection does the inverse, and pytorch3d.utils.pulsar_from_opencv_projection and pytorch3d.utils.pulsar_from_cameras_projection do the same for the Pulsar representations).

Good luck with your projects!

Yang-L1 · 2022-03-15T10:59:00Z

@pengsongyou Hi, how do you compute the focal length and principal points from a intrinsic matrix?

fx fy # focal length in x and y axes
px py # principal points in x and y axes

I directly use the raw opencv intrinsics which does not work.
"""
fx = 443.676
fy = 443.676
px = 256.000
py = 256.000
"""
Thank you.

Yang-L1 · 2022-03-15T12:13:47Z

@pengsongyou Hi, how do you compute the focal length and principal points from a intrinsic matrix?
fx fy # focal length in x and y axes
px py # principal points in x and y axes
I directly use the raw opencv intrinsics which does not work. """ fx = 443.676 fy = 443.676 px = 256.000 py = 256.000 """ Thank you.

Pass in_ndc=False to PerspectiveCameras fixed this.

3a1b2c3 · 2023-01-03T10:58:54Z

Any chance to add opengl?

krahets · 2024-11-12T11:16:13Z

Any chance to add opengl?

You can easily convert an OpenCV pose matrix to an OpenGL pose matrix by flipping the Y-axis and Z-axis:

c2w = torch.eye(4)  # Camera-to-world (i.e., camera pose) in OpenCV coordinate system
c2w[0:3, 1:3] *= -1  # Convert to OpenGL coordinate system

xiaoc57 · 2024-11-13T14:19:40Z

Hello everyone! I am more concerned about how to verify whether the converted camera is correct. The method I am using now is depth back-projection point cloud. But I don't know if this is the standard way.

pengsongyou mentioned this issue Jan 19, 2021

Convert camera extrinsic openCV to R and T matrix of SfMPerspectiveCameras #287

Closed

nikhilaravi added the enhancement New feature or request label Jan 22, 2021

nikhilaravi self-assigned this Jan 22, 2021

classner mentioned this issue Mar 10, 2021

Pulsar perspective camera bug? Rendering same perspective camera gets different results using PulsarRenderer and PointRenderer #590

Closed

classner closed this as completed Jul 13, 2021

manasburagohain mentioned this issue Nov 13, 2021

Incorrect rendering of object compared to Ground Truth image on LineMod Dataset #934

Closed

pbranson mentioned this issue Aug 10, 2023

DJI P4 drone class oceanhackweek/ohw23_proj_drone_georef#5

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenCV camera to PyTorch3D PerspectiveCameras #522

OpenCV camera to PyTorch3D PerspectiveCameras #522

pengsongyou commented Jan 18, 2021 •

edited

Loading

pengsongyou commented Jan 19, 2021 •

edited

Loading

nikhilaravi commented Jan 22, 2021

MengXinChengXuYuan commented Mar 31, 2021

MengXinChengXuYuan commented Mar 31, 2021 •

edited

Loading

MengXinChengXuYuan commented Apr 1, 2021 •

edited

Loading

pengsongyou commented Apr 1, 2021

MengXinChengXuYuan commented Apr 2, 2021

sailor-z commented May 5, 2021

sailor-z commented May 5, 2021

classner commented Jul 13, 2021

Yang-L1 commented Mar 15, 2022 •

edited

Loading

Yang-L1 commented Mar 15, 2022

3a1b2c3 commented Jan 3, 2023

krahets commented Nov 12, 2024

xiaoc57 commented Nov 13, 2024

OpenCV camera to PyTorch3D PerspectiveCameras #522

OpenCV camera to PyTorch3D PerspectiveCameras #522

Comments

pengsongyou commented Jan 18, 2021 • edited Loading

pengsongyou commented Jan 19, 2021 • edited Loading

nikhilaravi commented Jan 22, 2021

MengXinChengXuYuan commented Mar 31, 2021

MengXinChengXuYuan commented Mar 31, 2021 • edited Loading

MengXinChengXuYuan commented Apr 1, 2021 • edited Loading

pengsongyou commented Apr 1, 2021

MengXinChengXuYuan commented Apr 2, 2021

sailor-z commented May 5, 2021

sailor-z commented May 5, 2021

classner commented Jul 13, 2021

Yang-L1 commented Mar 15, 2022 • edited Loading

Yang-L1 commented Mar 15, 2022

3a1b2c3 commented Jan 3, 2023

krahets commented Nov 12, 2024

xiaoc57 commented Nov 13, 2024

pengsongyou commented Jan 18, 2021 •

edited

Loading

pengsongyou commented Jan 19, 2021 •

edited

Loading

MengXinChengXuYuan commented Mar 31, 2021 •

edited

Loading

MengXinChengXuYuan commented Apr 1, 2021 •

edited

Loading

Yang-L1 commented Mar 15, 2022 •

edited

Loading