
Premium content
Access to this content requires a subscription. You must be a premium user to view this content.

Would you like to see your presentation here, made available to a global audience of researchers?
Add your own presentation or have us affordably record your next conference.
keywords:
low level physics based vision
cv
In recent years, significant progress has been achieved in No-Reference Point Cloud Quality Assessment (NR-PCQA) research. However, existing methods mostly seek a direct mapping function from visual data to the Mean Opinion Score (MOS), which is contradictory to the mechanism of practical subjective evaluation. In response to this challenge, we propose a novel language-driven PCQA method named CLIP-PCQA. On the one hand, considering that human beings prefer to describe visual quality using discrete quality descriptions (e.g., excellent" andpoor") rather than specific scores, we adopt a retrieval-based mapping strategy to simulate the process of subjective assessment. More specifically, based on the philosophy of CLIP, we calculate the cosine similarity between the visual feature and multiple textual features corresponding to different quality descriptions, in which process an effective contrastive loss and learnable prompts are introduced to enhance the feature extraction. Meanwhile, given the personal limitations and bias in subjective experiments, we further covert the feature similarities into probabilities and consider the Opinion Score Distribution (OSD) rather than a single MOS as the final target. Experiment results show that our CLIP-PCQA outperforms other State-Of-The-Art (SOTA) approaches.