Long-Tail-image-aesthetics-and-quality-assessment

ICML2024: 首篇针对IAA中的长尾问题提出解决方案的工作

ELTA: An Enhancer against Long-Tail for Aesthetics-oriented Models

**Limin Liu, Shuai He, Anlong Ming, Rui Xie, Huadong Ma
Beijing University of Posts and Telecommunications, Equal contribution**

Introduction

现实世界中的数据集往往呈现长尾分布，影响了模型的泛化能力和公平性。这个问题在图像美学评估（IAA）任务中尤为突出，由于特征和标签之间存在严重的分布不匹配，以及美学对图像变化的高度敏感性，这种不平衡很难得到缓解。为了解决这些问题，我们提出了一种面向美学模型的长尾增强器（ELTA）。ELTA 首先利用专门设计的Mixup技术来增强高层空间中的Minority特征表示，同时保留图像固有的美学品质。然后通过相似度一致性方法对齐特征和标签，有效缓解了分布不匹配问题。最后，ELTA 采用了一个自适应的锐化策略改善模型的输出logits分布，从而提高伪标签的质量。

环境

einops==0.4.1
matplotlib==3.3.4
nni==2.6.1
numpy==1.19.5
pandas==1.1.5
Pillow==10.2.0
scikit_learn==1.4.0
scipy==1.5.4
timm==0.6.12
torch==1.10.1
torchvision==0.11.2
tqdm==4.64.1

模型训练

python main.py --csv_path           [dataset annotation file path]
               --dataset_path       [dataset image path]
               --mixup              # optional, enable TFA module
               --simloss_weight 1   # optional, enable FLSA module and specify weight
               ...                  # other arguments

权重文件: https://drive.google.com/file/d/1pA7kOCPHEUR5oNnocBZHH41Erud9Y30S/view?usp=drive_link

模型推理

python main.py -e                   [dataset annotation file path]
               --test_dataset_path  [dataset image path]
               --resume             [checkpoint path]   # required!
               ...                  # other arguments

在第一轮训练之后启用自训练

python main.py --st                 # enable self-training
               ...                  # other arguments

建议: 使用 NNI 自动调参

# 请先修改main_nni.py文件中命令 'trial_command' 和搜索空间 'search_space'
python main_nni.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CN_README.md

CN_README.md

Long-Tail-image-aesthetics-and-quality-assessment

ELTA: An Enhancer against Long-Tail for Aesthetics-oriented Models

**Limin Liu, Shuai He, Anlong Ming, Rui Xie, Huadong Ma
Beijing University of Posts and Telecommunications, Equal contribution**

Introduction

环境

模型训练

模型推理

在第一轮训练之后启用自训练

建议: 使用 NNI 自动调参

Files

CN_README.md

Latest commit

History

CN_README.md

File metadata and controls

Long-Tail-image-aesthetics-and-quality-assessment

ELTA: An Enhancer against Long-Tail for Aesthetics-oriented Models

Limin Liu*, Shuai He*, Anlong Ming*, Rui Xie, Huadong Ma Beijing University of Posts and Telecommunications, *Equal contribution

Introduction

环境

模型训练

模型推理

在第一轮训练之后启用自训练

建议: 使用 NNI 自动调参

**Limin Liu, Shuai He, Anlong Ming, Rui Xie, Huadong Ma
Beijing University of Posts and Telecommunications, Equal contribution**