LABIC - Bioinformatics and Computational Intelligence Laboratory

Local Repository of Research Datasets


UTFPR-SBD3: Soft Biometrics Dataset for clothing segmentation


 

  1. Introduction

Soft biometrics is an emerging area of research, mainly due to its extensive applicability in people surveillance. It is related to human characteristics that can be used for people tracking and identification based on appearance, including physical, behavioral or adhered (such as clothing) features. Semantic segmentation of clothes is still a challenge for researchers because of the wide variety of clothing styles, layering, and shapes. Datasets available for the clothing segmentation task, such as CFPD and Fashionista, are quite small, they have several annotation errors (at the pixel level) and a high unbalanced class distribution. To overcome these problems, we propose a new benchmark, named UTFPR-SBD3, containing 4,500 images manually annotated within 18 classes, plus the background.

  1. Dataset Description
This dataset was constructed by combining crawled images from Chictopia.com and images from three existing public datasets: CCP (Clothing Co-Parsing dataset), CFPD (Colorful Fashion Parsing Dataset) and Fashionista dataset.

All images in the dataset are standardized at 400x600 pixels in RGB, and they were manually annotated using the JS Segment Annotator, a free web-based image annotation tool. Clothes were grouped into classes, as follows:



Labels
Classes # of instances
Bag 2650
Belt 1364
Coat 1551
Dress 1065
Eyewear 1349
Footwear 4448
Hair 4446
Headwear 782
Neckwear 383
Pants 1464
Rompes/Jumpsuit 107
Shirt 3014
Shorts 747
Skin 4500
Skirt 1231
Socks 435
Stocking 450
Sweater 566

 

 
Considering that all images in the dataset were annotated in a per-pixel basis, the plot below shows the proportion of classes and annotated pixels in the dataset.
 

Label statistics per pixel and number of instances in the UTFPR-SBD3 dataset.

 

  1. Link to the dataset and code:

  2. Code and data are available at this paperswithcode site

  1. Related Papers:

  2. If you use of the UTFPR-SDB3 dataset, please cite the following reference in any publications:

     

  1. Licence

  2. This annotated data is released under the Creative Commons Attribution-NonCommercial license 4.0.