PRG

Pretrained Reversible Generation as Unsupervised Visual Representation Learning

¹ Xi'an Jiaotong University ² Shanghai Artificial Intelligence Laboratory
³ The Chinese University of Hong Kong ⁴ Nanjing University of Aeronautics and Astronautics
⁵ SenseTime Research
Accepted by ICCV 2025

Demo

Overview of the PRG as Unsupervised Visual Representation pipeline. Swiss-roll data is generated via (x, y) = (t cos t, t sin t), t ∈ [0, 3π], with a blue→red gradient as t increases.

Abstract

Recent generative models based on score matching and flow matching have significantly advanced generation tasks, but their potential in discriminative tasks remains underexplored. Previous approaches, such as generative classifiers, have not fully leveraged the capabilities of these models for discriminative tasks due to their intricate designs. We propose Pretrained Reversible Generation (PRG), which extracts unsupervised representations by reversing the generative process of a pretrained continuous generation model. PRG effectively reuses unsupervised generative models, leveraging their high capacity to serve as robust and generalizable feature extractors for downstream tasks. This framework enables the flexible selection of feature hierarchies tailored to specific downstream tasks. Our method consistently outperforms prior approaches across multiple benchmarks, achieving state-of-the-art performance among generative model based methods, including 78% top-1 accuracy on ImageNet at a resolution of 64×64. Extensive ablation studies, including out-of-distribution evaluations, further validate the effectiveness of our approach.

PRG EXP

CIFAR-10

Method	Param. (M)	Acc. (%)
Discriminative methods
WideResNet-28-10 ^[70]	36	96.3
ResNeXt-29-16×64d ^[66]	68	96.4
Generative methods
GLOW ^[19]	N/A	84.0
Energy model ^[18]	N/A	92.9
SBGC ^[73]	N/A	95.0
HybViT ^[67]	43	96.0
DDAE ^[65]	36	97.2
Our methods
PRG-GVP-onlyPretrain	42	54.10
PRG-GVP-S	42	97.35
PRG-ICFM-S	42	97.59
PRG-OTCFM-S	42	97.65

Tiny-ImageNet

Method	Param. (M)	Acc. (%)
Discriminative methods
WideResNet-28-10 ^[70]	36	69.3
Generative methods
HybViT ^[67]	43	56.7
DDAE ^[65]	40	69.4
Our methods
PRG-GVP-onlyPretrain	42	15.34
PRG-GVP-S	42	70.98
PRG-ICFM-S	42	71.12
PRG-OTCFM-S	42	71.33

ImageNet

Method	Param. (M)	Acc. (%)
Discriminative methods
ViT-L/16 (384²) ^[17]	307	76.5
ResNet-152 (224²) ^[23]	60	77.8
Swin-B (224²) ^[39]	88	83.5
Generative methods
HybViT (32²) ^[67]	43	53.5
DMSZC-DiTXL2 (256²) ^[34]	338	77.5
iGPT-L (48²) ^[10]	1362	72.6
Our methods
PRG-GVP-onlyPretrain (64²)	122	20.18
PRG-GVP-XL (64²)	122	77.84
PRG-ICFM-XL (64²)	122	78.12
PRG-OTCFM-XL (64²)	122	78.13

@article{xue2024pretrained, title={Pretrained Reversible Generation as Unsupervised Visual Representation Learning}, author={Xue, Rongkun and Zhang, Jinouwen and Niu, Yazhe and Shen, Dazhong and Ma, Bingqi and Liu, Yu and Yang, Jing}, journal={arXiv preprint arXiv:2412.01787}, year={2024} }

Pretrained Reversible Generation as Unsupervised Visual Representation Learning

Demo

Abstract

How can PRG work?

Why choose PRG?

PRG EXP

BibTeX