Preprint / Version 2

Rethinking Image Quality Assessment through the Lens of Task Utility in Embodied Settings

##article.authors##

  • Jirong Zha Tsinghua University
  • Yemin Wang
  • Xiangmin Yi
  • Siqi Peng
  • Yingfeng Chen
  • Chen Gao
  • Xinlei Chen

DOI:

https://doi.org/10.31224/6844

Abstract

Image quality assessment (IQA) underpins embodied imaging pipelines by judging whether visual quality satisfies downstream tasks, yet most methods learn task-agnostic scores aligned with generic human ratings on static benchmarks. This objective mismatches the embodied and interactive settings, where image adequacy depends on task goals, context, and action requirements that shape an agent’s decisions. We argue that IQA should shift from score regression to goal-conditioned judgment defined by the utility of embodied tasks. Such utility-aware assessment demands models with strong reasoning, grounding, and tool-use capabilities, as enabled by multimodal large language models (MLLMs) agent. We advocate rethinking IQA from the perspective of embodied task utility and outline benchmarks, evaluation protocols, and research directions for developing MLLM-based embodied IQA agents.

Downloads

Download data is not yet available.

Downloads

Posted

2026-04-17 — Updated on 2026-04-23

Versions

Version justification

complete the contributor list for browser