Projects | Object Detection With a Unified Label Space From Multiple Datasets

MEDIA ANALYTICS

PROJECTS

PEOPLE

PUBLICATIONS

PATENTS

Object Detection With a Unified Label Space From Multiple Datasets

In this study, our objective is to develop a unified object detector capable of predicting across a diverse range of label categories found in multiple datasets. The advantages of having such a versatile object detector are clear and substantial, as it enables us to select and merge application-specific categories from various pre-existing datasets. Nevertheless, straightforwardly combining these datasets is not feasible due to inconsistencies in the way objects are annotated. To tackle this challenge, we have devised a framework tailored to handle these incomplete annotations. Additionally, we have employed a specialized pseudo-labeling approach customized for our particular scenario.

Collaborators: Xiangyun Zhao, Gaurav Sharma, Yi-Hsuan Tsai, Manmohan Chandraker, Ying Wu

Object Detection With a Unified Label Space From Multiple Datasets

(a) We train a single object detector from multiple datasets with heterogeneous label spaces. In contrast to prior work, our model unifies the label spaces of all datasets. (b) Illustration of the ambiguity of background in object detection when training from multiple datasets with different label spaces. Here, only “person” is consistent wrt. both datasets but “car” and “face” are missing in the other one, respectively. Naïve combination of the datasets leads to wrong training signals.

Project Site

As described in the main paper, for the evaluation over the unified label space, new bounding box annotations are required. Specifically, after unifying the label space, certain datasets contain object categories that were not annotated originally. While the task we propose involves handling such missing annotations during training, we still need to evaluate the model. Thus, we collect annotations for the missing categories in all respective datasets as the validation/test sets. In order to reproduce the results reported in the paper, the new annotations for the VOC, COCO, SUN-RGBD, LISA-Signs datasets are released. In addition, we annotated the Widerface and KITTI datasets recently, and we also release them for future research.

An example from the LISA-Signs dataset. In the original annotations, only traffic signs are annotated. All other categories are newly annotated.

Object Detection With a Unified Label Space From Multiple Datasets Paper

Xiangyun Zhao¹, Samuel Schulter², Gaurav Sharma², Yi-Hsuan Tsai²,
Manmohan Chandraker^2;3, Ying Wu¹

¹Northwestern University ²NEC Labs America ³UC San Diego
In European Conference on Computer Vision (ECCV) 2020

Material Links

[Download Paper] [Download Supplementary Paper] [PDF] [Supp] [Bibtex]

Abstract

Given multiple datasets with different label spaces, the goal of this work is to train a single object detector predicting over the union of all the label spaces. The practical benefits of such an object detector are obvious and significant—application-relevant categories can be picked and merged form arbitrary existing datasets. However, naïve merging of datasets is not possible in this case, due to inconsistent object annotations. Consider an object category like faces that is annotated in one dataset, but is not annotated in another dataset, although the object itself appears in the latter’s images. Some categories, like face here, would thus be considered foreground in one dataset, but background in another. To address this challenge, we design a framework which works with such partial annotations, and we exploit a pseudo-labeling approach that we adapt for our specific case. We propose loss functions that carefully integrate partial but correct annotations with complementary but noisy pseudo labels. Evaluation in the proposed novel setting requires full annotation on the test set. We collect the required annotations1 and define a new challenging experimental setup for this task based on existing public datasets. We show improved performances compared to competitive baselines and appropriate adaptations of existing work.

Projects | Object Detection With a Unified Label Space From Multiple Datasets

Object Detection With a Unified Label Space From Multiple Datasets

Project Site

Object Detection With a Unified Label Space From Multiple Datasets Paper

Abstract

Short Summarization

Contact Us

About Us

Our Pages

Read Our Blog Posts