Are you really into object recognition, but you are sick of looking at 2D boxes and 2D masks? Let's play with 3D shapes!
We build on Mask R-CNN and extend it to infer 3D meshes. Given an input image, we detect all objects, infer their 2D instance boxes and masks as well as their 3D object shapes, all end-to-end! Naturally, we call our approach Mesh R-CNN :D
Paper: https://arxiv.org/abs/1906.02739
Joint work with Jitendra Malik and Justin Johnson
P.S. This task is really hard!
Search
mask r-cnn paper 在 New, improved Detectron2 Mask R-CNN baselines 的美食出口停車場
The paper's highest-reported Mask R-CNN ResNet-50-FPN baseline is 47.2 Box AP and 41.8 Mask AP, which exceeds Detectron2's highest reported baseline of 41.0 ... ... <看更多>