Fashionpedia (FP) — Normalized Dataset for FEL

1. Dataset Overview

Fashionpedia is a COCO-style fashion understanding dataset that combines instance annotations (object/box/segmentation) with attributes and an explicit category–part–attribute ontology. Its key contribution is enabling part-level attribute localization (e.g., “the sleeve is striped”) within a unified benchmark for detection, segmentation, and attribute prediction.

Research Objective

Earlier fashion datasets typically emphasized either categories or attributes, but did not explicitly model the relationship between garment parts and attributes. Fashionpedia addresses this by providing a structured ontology and detailed part-aware annotations.

Key Components

Component	Description
Categories	27 high-level clothing categories
Parts	19 clothing part labels (e.g., collar, sleeve)
Attributes	46 attribute tags (e.g., solid, floral, knit)
Ontology	Hierarchical links between category–part–attribute
Masks	Pixel-level segmentation masks for garments and parts
Attributes (part-level)	Attributes annotated at the part level

The dataset is distributed as COCO-style JSON annotations and supports multi-task learning and ontology-aware evaluation.

2. Folder and File Structure

(1) Original Fashionpedia Structure (Input)

fashionpedia_root/
├── instances_attributes_train2020.json   (core)
├── instances_attributes_val2020.json     (core)
├── attributes_train2020.json             (optional)
├── attributes_val2020.json               (optional)
├── info_test2020.json                    (optional)
└── images/                               (optional)
    ├── train/
    ├── val/
    └── test/

(2) Normalized Output Structure (FEL v1.3)

The FEL extractor normalizes Fashionpedia into ontology (terms), observation (instances/labels), and audit (manifest/QC) layers.

Core ontology tables

fp_terms_categories.csv (category + part terms; includes term_role)
fp_terms_attributes.csv (attribute terms)

Core entity / observation tables

fp_images_index.csv.gz (ImageItem hub keyed by image_uid)
fp_instances.csv.gz (InstanceItem core table)
fp_attr_sparse.csv.gz (image-level attribute sparse; separated to avoid confusion)
fp_image_categories.csv.gz and fp_image_categories_agg.csv.gz (derived category log + aggregation)
fp_geometry.csv.gz (derived geometry summary per image)
items.csv.gz (typed registry for emitted entities)

Run artifacts

manifest.csv / manifest.jsonl
qc_summary.json
report.md

3. Role in FEL

Fashionpedia strengthens FEL’s InstanceItem layer: meaning is expressed at the instance level (garment/part) with category + attributes + spatial evidence.

Node Construction

ImageItem: image_uid from fp_images_index.csv.gz
InstanceItem: instance_uid from fp_instances.csv.gz
CategoryTerm: fp_cat_id from fp_terms_categories.csv
AttributeTerm: fp_attr_id from fp_terms_attributes.csv
GeometrySummary: derived per image_uid from fp_geometry.csv.gz

Relationship Construction

ImageItem ─contains────────▶ InstanceItem
InstanceItem ─has_category──▶ CategoryTerm
InstanceItem ─has_attribute─▶ AttributeTerm
ImageItem ─has_geometry────▶ GeometrySummary
(Separate) ImageItem ─has_attribute──▶ AttributeTerm (image-level sparse; fp_attr_sparse.csv.gz)

4. Extracted Benchmarks

Fashionpedia extraction is driven by the COCO-style JSONs; there is no benchmark folder split like DF1. Instead, the normalized schema supports multiple tasks (detection/segmentation/attributes) from one consistent graph layer.

Most important PK design

COCO ann.id is not globally unique across JSONs (e.g., train vs val), so it is not safe as a primary key.

FEL v1.3 solution (deterministic, globally unique):

instance_uid = FP:inst/<split>/<src_file_rel>/<row_index>

This ensures 100% uniqueness via split + dataset-relative provenance + JSON row position.

5. Graph Structure Description

The Fashionpedia normalized graph is image-hub with strong instance-level semantics.

1) Central hub: ImagesIndex

fp_images_index.csv.gz is the hub keyed by image_uid.
All major observation tables join through this key.

2) Instance & attribute layer

Instances (fp_instances.csv.gz): instance_uid + bbox + seg summary + instance-level attributes
AttrSparse (fp_attr_sparse.csv.gz): image-level sparse attributes (kept separate)

3) Taxonomy layer

TermsCategories (fp_terms_categories.csv) lookup by fp_cat_id
TermsAttributes (fp_terms_attributes.csv) lookup by fp_attr_id

4) Derived layers

ImageCategories → ImageCategoriesAgg: category log and aggregation
Geometry (fp_geometry.csv.gz): derived geometry summary per image_uid

5) Typed registry + run artifacts

Items (items.csv.gz) is a lightweight typed registry abstraction.
Manifest / Report / QCSummary are run-level artifacts (audit/validation), typically shown as dashed conceptual edges.

FEL Integrity Checks (v1.3 Patch Verification Summary)

Based on the finalized v1.3 extractor policy, the following checks define “artifact correctness” for FEL ingestion:

Patch A — items referential integrity: instance entries in items.csv.gz must be a subset of emitted fp_instances.instance_uid.
Patch B — dataset-relative provenance: fp_instances.src_file is POSIX, dataset-relative, and is embedded into instance_uid.
NEW-C — PK/QC hardening: instance_uid duplicates are hard-fail; instance_id duplicates are allowed but counted/reported.

Expected QC outcomes under this policy:

instance_uid duplicate = 0 (hard-fail if > 0)
hard_fail = False when required entities exist and PK/integrity conditions hold

Unused Data and Rationale

Raw segmentation polygon contents: not fully replicated/decoded in FEL due to size and resolution dependence; FEL keeps summaries for joinability and audit.
instance_id as PK: not used because it is not globally unique; replaced by deterministic instance_uid.
Image pixels: FEL prioritizes meaning graphs and evidence links; pixels are consumed downstream by models.

QC Verdict

Under the stated FEL v1.3 QC policy, a “submittable” run satisfies:

instance_uid duplicate = 0
categories > 0, attributes > 0, images > 0, instances_emitted > 0
manifest generated
hard_fail = False

➡️ This indicates PK stability, referential integrity, ontology separation, and auditability are all satisfied for FEL ingestion.

Fashionpedia Normalized Graph (Interactive)

Click below to open the interactive graph in a new window:

Open Fashionpedia Graph Interactive Editor

Final Summary

Fashionpedia adds strong instance-centric semantics to FEL: image → instance → (category/attribute) with spatial evidence.
Ontology terms (categories/parts/attributes) are separated from observations (instances/labels) for consistent graph learning.
Deterministic instance_uid eliminates COCO ID collisions across splits and files.
Manifest + QC + report provide reproducibility and auditability at the run level.

➡️ Core FEL input for ontology-aware segmentation, part-level attribute localization, and instance-level reasoning across fashion datasets.