Upload two images of the same object from different viewpoints and get the estimated relative 6DoF pose.
Pipeline: SAM3 (segmentation) → DepthAnything3 (depth) → ConceptPose (semantic 3D registration)
Paper | Code
Pre-cached categories: ape, banana, basket, benchvise, bike, book, bottle, bowl, box, cam, camera, can, car, cat, clamp, cracker box, croc, cup, cutlery, driller...
Override auto-generated semantic parts. Leave empty to use defaults.
Required only for new categories not in the pre-cached list.