DeepFashion (DF1) — Normalized Dataset for FEL

1. Dataset Overview

DeepFashion (DF1) is a large-scale fashion image dataset designed for clothing recognition and retrieval research.
It reflects real-world conditions such as pose variation, deformation, occlusion, and lighting.

In this pipeline, the original DF1 dataset has been restructured into a normalized schema for use in the
FEL (Feature Evidence Layer or Knowledge-Graph-based Learning Layer).

Key Characteristics

➡️ This converts DF1’s heterogeneous formats into a standardized schema optimized for multimodal learning and graph-based analysis.


2. Folder and File Structure

(1) Original DF1 Structure (Input)

Evaluation Splits

Bounding Box Files

Landmark Files

Category / Attribute Files

Text Descriptions


(2) Normalized Output Structure

Each benchmark is converted into:

benchmarks/ df1inshop/normalized/ df1consumer2shop/normalized/ df1capbcoarse/normalized/ df1capbfine/normalized/ df1_landmark/normalized/

Core Data Tables

Ontology / Term Tables

Management & Validation


3. Role in FEL

Node Construction

Relationship Construction

Image ─hasbbox──────────▶ BBoxEvidence
Image ─has
landmark──────▶ LandmarkEvidence
Image ─hascategory──────▶ CategoryTerm
Image ─has
attribute────▶ AttributeTerm
Image ─has_description──▶ Text
Image(query) ─matches───▶ Image(gallery)

Core Design Principles

Single Join Key
All data linked via image_uid

Evidence Separation
Ontology / Labels / Evidence split

Provenance Preservation
src_file + row_index stored


4. Extracted Benchmarks

In-shop Retrieval

Consumer-to-Shop Retrieval

CAPB

Coarse: - Images: 289,222 - Categories: 50 - Attributes: 1,000

Fine: - Images: 20,000 - Attributes: 26

Landmark Benchmark

Excluded Data

Original image files not included (metadata‑focused FEL design).


5. Graph Structure Description

Central Node

Images — anchor for all connections

Data Ingestion

ImageSources → Images (canonicalization)

Labels & Attributes

CategoryTerms → CategoryLabels → Images
AttrTerms → AttrSparse → Images
ColorTerms → ColorLabels → Images

Evaluation Structure

Reproducibility

Manifest → artifact audit
QCSummary → pipeline validation


DF1 Normalized Graph (Interactive)

Click below to open the interactive graph in a new window:

Open DF1 Graph Interactive Editor

Final Summary

The normalized DF1 dataset:

➡️ Core input for FEL‑based clothing recognition, retrieval, and recommendation systems.