Comprehensive Comparison of Major Fashion Resources

A structured, research-oriented overview of DeepFashion family datasets and related fashion resources.

1) Side-by-side Comparison

Category DeepFashion DeepFashion2 DeepFashion3D MMFashion Fashionpedia
Resource TypeDataset + ModelDataset + BenchmarkDataset + MethodologyPlatform / ToolkitOntology + Dataset
Year / Venue2016 / CVPR2019 / CVPR2020 / CVPR2020 / arXiv2020 / CVPR
Primary GoalLarge-scale 2D clothing recognition & retrievalReal-world, multi-task fashion understandingSingle-image 3D garment reconstructionUnified execution framework for fashion vision tasksStructuring fashion knowledge via explicit ontology
Problem MotivationLimited scale & labels in earlier datasetsSparse landmarks & single-garment bias in DFSevere lack of 3D garment dataFragmented fashion AI codebasesInconsistent definition of fashion concepts
Data Dimensionality2D images2D multi-instance scenes3D meshes + images2D (uses external datasets)2D pixel-level annotations
Dataset Scale800K+ images491K images / 801K garments2,078 3D garment modelsNo proprietary dataset48K+ images
Garment Categories50+13~10 garment typesDataset-dependent27
Core AnnotationsCategory, attributes, sparse landmarksBBox, mask, dense landmarks (39), pose, pairingMesh, UV, camera, pose, feature linesModels, configs, pipelinesInstance, part, attribute, mask
Landmark ConceptSparse 2D keypointsDense pose-aware landmarks3D structural feature linesTask-dependent moduleEncoded via part boundaries
Pose InformationClothing poseBody–garment pose (SMPL)Task-dependent
SegmentationInstance maskImplicit via mesh surfaceSupportedPart-level masks
AttributesImage-levelImage-levelTexture (not semantic)SupportedPart-localized
Ontology Explicitness✅ Core contribution
Main TasksClassification, retrieval, attribute predictionDetection, pose, segmentation, Re-ID3D reconstruction, registration, texture recoveryAttribute, retrieval, parsing, compatibilityDetection, segmentation, attribute localization
Proposed ModelFashionNetMatch R-CNN baselineHybrid mesh + implicit surfaceBackbone–Head modular frameworkMask R-CNN baseline
Methodological CoreLandmark-aware poolingMulti-task unified evaluationTemplate adaptation + physics-aware featuresModular engineering designOntology-driven annotation
Evaluation MetricsTop-k accuracy, recallAP, OKS, PCK, ReID recallChamfer, EMD, Normal Consistency, 3D IoUTask-standard metricsmIoU, AP (det/attr)
Major StrengthFirst large-scale fashion datasetReal-world complexity & multi-task scopeProvides true 3D structural evidenceEasy experimentation & extensibilityExplicit semantic structure of fashion
Main LimitationLacks structural & semantic depthLimited semantic/ontological reasoningComputationally heavy, frontal biasLimited theoretical contributionNo aesthetic/emotional modeling
Emotion / Aesthetics Modeled

Tip: On smaller screens, scroll horizontally to view all columns.

2) Conceptual Evolution in Fashion Vision

Stage Resource Key Question Answered
Appearance LevelDeepFashionWhat clothing is visible?
Structural 2D LevelDeepFashion2Where are parts under real-world variations?
Semantic-Part LevelFashionpediaWhich part has which property?
Geometric 3D LevelDeepFashion3DWhat is the physical garment shape?
Engineering LayerMMFashionHow do we implement and experiment efficiently?

3) Research-Oriented Selection Guide

Research Goal Most Suitable Resource
Attribute learningDeepFashion, Fashionpedia
Detection in complex scenesDeepFashion2, Fashionpedia
Landmark / garment poseDeepFashion2
Part-based reasoningFashionpedia
Clothing retrieval / Re-IDDeepFashion, DeepFashion2
Virtual try-on / simulationDeepFashion3D
3D garment reconstructionDeepFashion3D
Multi-task unified trainingFashionpedia + DeepFashion2
Rapid experimentationMMFashion

4) Big-Picture Insight

These resources form complementary layers rather than replacements: DeepFashion → DeepFashion2 → Fashionpedia → DeepFashion3D progressively moves fashion AI from visual appearance → structural layout → semantic understanding → physical geometry, while MMFashion provides the engineering backbone to operationalize all of them.