In this paper, we would like to discuss the connection between visual processing and the understanding of an image. While the information of image viewing can be obtained from subjects' eye fixation, the understanding of an image can be obtained from the subjects' description of the given image. Furthermore, we proposed a new image labeling method based on the connection between eye fixation and image description by humans. By generating this new kind of labeling method, we can construct an image dataset with labels that are closer to how humans understand the incoming image. In addition, we would like to discuss the proof that the proposed labels better describe the image compared to other types of labeling systems. Research about the relationship between images and human descriptions can be applied to several different applications. For instance, by analyzing the pairwise similarity of user descriptions, we could have a measurement of the complexity of image content. Another possible application is to use this dataset as a criterion to find the difference in visual processing of individuals with or without certain psychological characteristic.
Bibliographical notePublisher Copyright:
© 2018 Copyright for the individual papers remains with the authors.
- Computer vision
- Eye fixation
- Image annotation
- Image representation
- Scene analysis
- Vision and scene understanding
- Visual attention