TY - GEN
T1 - Visual-verbal consistency of image saliency
AU - Liang, Haoran
AU - Jiang, Ming
AU - Liang, Ronghua
AU - Zhao, Qi
PY - 2017/11/27
Y1 - 2017/11/27
N2 - When looking at an image, humans shift their attention towards interesting regions, making sequences of eye fixations. When describing an image, they also come up with simple sentences that highlight the key elements in the scene. What is the correlation between where people look and what they describe in an image? To investigate this problem, we look into eye fixations and image captions, two types of subjective annotations that are relatively task-free and natural. From the annotations, we extract visual and verbal saliency ranks to compare against each other. We then propose a number of low-level and semantic-level features relevant to the visual-verbal consistency. Integrated into a computational model, the proposed features effectively predict the consistency between the two modalities on a large dataset with both types of annotations, namely SALICON [1].
AB - When looking at an image, humans shift their attention towards interesting regions, making sequences of eye fixations. When describing an image, they also come up with simple sentences that highlight the key elements in the scene. What is the correlation between where people look and what they describe in an image? To investigate this problem, we look into eye fixations and image captions, two types of subjective annotations that are relatively task-free and natural. From the annotations, we extract visual and verbal saliency ranks to compare against each other. We then propose a number of low-level and semantic-level features relevant to the visual-verbal consistency. Integrated into a computational model, the proposed features effectively predict the consistency between the two modalities on a large dataset with both types of annotations, namely SALICON [1].
KW - Correlation
KW - Image caption
KW - Visual saliency
KW - Visual-verbal consistency
UR - http://www.scopus.com/inward/record.url?scp=85044378447&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85044378447&partnerID=8YFLogxK
U2 - 10.1109/SMC.2017.8123171
DO - 10.1109/SMC.2017.8123171
M3 - Conference contribution
AN - SCOPUS:85044378447
T3 - 2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017
SP - 3489
EP - 3494
BT - 2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2017 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2017
Y2 - 5 October 2017 through 8 October 2017
ER -