Existing API allows to do this already. Bounding box is simply a label + 4 keypoints. Bounding Boxes need to support different formats, such as COCO.