Fashionpedia (FP) — Normalized Dataset for FEL

1. Dataset Overview

Fashionpedia is a COCO-style fashion understanding dataset that combines instance annotations (object/box/segmentation) with attributes and an explicit category–part–attribute ontology. Its key contribution is enabling part-level attribute localization (e.g., “the sleeve is striped”) within a unified benchmark for detection, segmentation, and attribute prediction.

Research Objective

Earlier fashion datasets typically emphasized either categories or attributes, but did not explicitly model the relationship between garment parts and attributes. Fashionpedia addresses this by providing a structured ontology and detailed part-aware annotations.

Key Components

ComponentDescription
Categories27 high-level clothing categories
Parts19 clothing part labels (e.g., collar, sleeve)
Attributes46 attribute tags (e.g., solid, floral, knit)
OntologyHierarchical links between category–part–attribute
MasksPixel-level segmentation masks for garments and parts
Attributes (part-level)Attributes annotated at the part level

The dataset is distributed as COCO-style JSON annotations and supports multi-task learning and ontology-aware evaluation.


2. Folder and File Structure

(1) Original Fashionpedia Structure (Input)

fashionpedia_root/
├── instances_attributes_train2020.json   (core)
├── instances_attributes_val2020.json     (core)
├── attributes_train2020.json             (optional)
├── attributes_val2020.json               (optional)
├── info_test2020.json                    (optional)
└── images/                               (optional)
    ├── train/
    ├── val/
    └── test/

(2) Normalized Output Structure (FEL v1.3)

The FEL extractor normalizes Fashionpedia into ontology (terms), observation (instances/labels), and audit (manifest/QC) layers.

Core ontology tables

Core entity / observation tables

Run artifacts


3. Role in FEL

Fashionpedia strengthens FEL’s InstanceItem layer: meaning is expressed at the instance level (garment/part) with category + attributes + spatial evidence.

Node Construction

Relationship Construction

ImageItem ─contains────────▶ InstanceItem
InstanceItem ─has_category──▶ CategoryTerm
InstanceItem ─has_attribute─▶ AttributeTerm
ImageItem ─has_geometry────▶ GeometrySummary
(Separate) ImageItem ─has_attribute──▶ AttributeTerm (image-level sparse; fp_attr_sparse.csv.gz)


4. Extracted Benchmarks

Fashionpedia extraction is driven by the COCO-style JSONs; there is no benchmark folder split like DF1. Instead, the normalized schema supports multiple tasks (detection/segmentation/attributes) from one consistent graph layer.

Most important PK design

COCO ann.id is not globally unique across JSONs (e.g., train vs val), so it is not safe as a primary key.

FEL v1.3 solution (deterministic, globally unique):

instance_uid = FP:inst/<split>/<src_file_rel>/<row_index>

This ensures 100% uniqueness via split + dataset-relative provenance + JSON row position.


5. Graph Structure Description

The Fashionpedia normalized graph is image-hub with strong instance-level semantics.

1) Central hub: ImagesIndex

2) Instance & attribute layer

3) Taxonomy layer

4) Derived layers

5) Typed registry + run artifacts


FEL Integrity Checks (v1.3 Patch Verification Summary)

Based on the finalized v1.3 extractor policy, the following checks define “artifact correctness” for FEL ingestion:

Expected QC outcomes under this policy:


Unused Data and Rationale


QC Verdict

Under the stated FEL v1.3 QC policy, a “submittable” run satisfies:

➡️ This indicates PK stability, referential integrity, ontology separation, and auditability are all satisfied for FEL ingestion.


Fashionpedia Normalized Graph (Interactive)

Click below to open the interactive graph in a new window:

Open Fashionpedia Graph Interactive Editor

Final Summary

➡️ Core FEL input for ontology-aware segmentation, part-level attribute localization, and instance-level reasoning across fashion datasets.