Skip to main content

Overview

Run MoGe on a single image to estimate depth and geometry. This node processes an input image through the MoGe model to generate a 3D point cloud, depth map, camera intrinsics, a mask, and surface normals.

Inputs

ParameterDescriptionData TypeRequiredRange
moge_modelThe MoGe model to use for inference.MOGE_MODELYesN/A
imageThe input image for depth and geometry estimation.IMAGEYesN/A
resolution_levelControls the processing resolution. 0 is fastest, 9 provides the most detail. (default: 9)INTYes0 to 9
fov_x_degreesHorizontal field of view of the source camera in degrees. Sets the focal length used to unproject the depth map into 3D. Set to 0.0 to automatically recover the field of view from the predicted points. (default: 0.0)FLOATYes0.0 to 170.0
batch_sizeNumber of images processed per inference call. Lower this value if you run out of memory when processing long videos or large image sets. (default: 4)INTYes1 to 64
force_projection(Advanced) Forces projection of the predicted points. (default: True)BOOLEANYesTrue/False
apply_maskWhen enabled, sets masked-out (sky or invalid) pixels to infinity in the points and depth outputs. This helps meshing tools ignore these areas. Disable to keep the raw predicted geometry everywhere; the mask is still returned separately. (default: True)BOOLEANYesTrue/False

Outputs

Output NameDescriptionData Type
moge_geometryA dictionary containing the estimated geometry. It includes the original image, and may contain points (3D point cloud), depth (depth map), intrinsics (camera intrinsics matrix), mask (mask identifying valid pixels), and normal (surface normals).MOGE_GEOMETRY
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub

Source fingerprint (SHA-256): 5213b280513850eeef2e22ae723ebb015789109435e28ddd79f91f9a4b4a1e79