SALICON: Saliency in Context

Ming Jiang; Shengsheng Huang; Juanyong Duan; Qi Zhao

doi:10.1109/CVPR.2015.7298710

SALICON: Saliency in Context

Ming Jiang, Shengsheng Huang, Juanyong Duan, Qi Zhao

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

495 Scopus citations

Abstract

Saliency in Context (SALICON) is an ongoing effort that aims at understanding and predicting visual attention. This paper presents a new method to collect large-scale human data during natural explorations on images. While current datasets present a rich set of images and task-specific annotations such as category labels and object segments, this work focuses on recording and logging how humans shift their attention during visual exploration. The goal is to offer new possibilities to (1) complement task-specific annotations to advance the ultimate goal in visual understanding, and (2) understand visual attention and learn saliency models, all with human attentional data at a much larger scale. We designed a mouse-contingent multi-resolutional paradigm based on neurophysiological and psychophysical studies of peripheral vision, to simulate the natural viewing behavior of humans. The new paradigm allowed using a general-purpose mouse instead of an eye tracker to record viewing behaviors, thus enabling large-scale data collection. The paradigm was validated with controlled laboratory as well as large-scale online data. We report in this paper a proof-of-concept SALICON dataset of human 'free-viewing' data on 10,000 images from the Microsoft COCO (MS COCO) dataset with rich contextual information. We evaluated the use of the collected data in the context of saliency prediction, and demonstrated them a good source as ground truth for the evaluation of saliency algorithms.

Original language	English (US)
Title of host publication	IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015
Publisher	IEEE Computer Society
Pages	1072-1080
Number of pages	9
ISBN (Electronic)	9781467369640
DOIs	https://doi.org/10.1109/CVPR.2015.7298710
State	Published - Oct 14 2015
Event	IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015 - Boston, United States Duration: Jun 7 2015 → Jun 12 2015

Publication series

Name	Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition
Volume	07-12-June-2015
ISSN (Print)	1063-6919

Other

Other	IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015
Country/Territory	United States
City	Boston
Period	6/7/15 → 6/12/15

Access

10.1109/CVPR.2015.7298710

OpenUrl availability

Full text

Cite this

Jiang, M., Huang, S., Duan, J., & Zhao, Q. (2015). SALICON: Saliency in Context. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015 (pp. 1072-1080). Article 7298710 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; Vol. 07-12-June-2015). IEEE Computer Society. https://doi.org/10.1109/CVPR.2015.7298710

SALICON: Saliency in Context. / Jiang, Ming; Huang, Shengsheng; Duan, Juanyong et al.
IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015. IEEE Computer Society, 2015. p. 1072-1080 7298710 (Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition; Vol. 07-12-June-2015).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Jiang, M, Huang, S, Duan, J & Zhao, Q 2015, SALICON: Saliency in Context. in IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015., 7298710, Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 07-12-June-2015, IEEE Computer Society, pp. 1072-1080, IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, United States, 6/7/15. https://doi.org/10.1109/CVPR.2015.7298710

@inproceedings{e9e079e944174663adb6f4fa56767c70,

title = "SALICON: Saliency in Context",

abstract = "Saliency in Context (SALICON) is an ongoing effort that aims at understanding and predicting visual attention. This paper presents a new method to collect large-scale human data during natural explorations on images. While current datasets present a rich set of images and task-specific annotations such as category labels and object segments, this work focuses on recording and logging how humans shift their attention during visual exploration. The goal is to offer new possibilities to (1) complement task-specific annotations to advance the ultimate goal in visual understanding, and (2) understand visual attention and learn saliency models, all with human attentional data at a much larger scale. We designed a mouse-contingent multi-resolutional paradigm based on neurophysiological and psychophysical studies of peripheral vision, to simulate the natural viewing behavior of humans. The new paradigm allowed using a general-purpose mouse instead of an eye tracker to record viewing behaviors, thus enabling large-scale data collection. The paradigm was validated with controlled laboratory as well as large-scale online data. We report in this paper a proof-of-concept SALICON dataset of human 'free-viewing' data on 10,000 images from the Microsoft COCO (MS COCO) dataset with rich contextual information. We evaluated the use of the collected data in the context of saliency prediction, and demonstrated them a good source as ground truth for the evaluation of saliency algorithms.",

author = "Ming Jiang and Shengsheng Huang and Juanyong Duan and Qi Zhao",

year = "2015",

month = oct,

day = "14",

doi = "10.1109/CVPR.2015.7298710",

language = "English (US)",

series = "Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition",

publisher = "IEEE Computer Society",

pages = "1072--1080",

booktitle = "IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015",

note = "IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015 ; Conference date: 07-06-2015 Through 12-06-2015",

}

TY - GEN

T1 - SALICON

T2 - IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015

AU - Jiang, Ming

AU - Huang, Shengsheng

AU - Duan, Juanyong

AU - Zhao, Qi

PY - 2015/10/14

Y1 - 2015/10/14

N2 - Saliency in Context (SALICON) is an ongoing effort that aims at understanding and predicting visual attention. This paper presents a new method to collect large-scale human data during natural explorations on images. While current datasets present a rich set of images and task-specific annotations such as category labels and object segments, this work focuses on recording and logging how humans shift their attention during visual exploration. The goal is to offer new possibilities to (1) complement task-specific annotations to advance the ultimate goal in visual understanding, and (2) understand visual attention and learn saliency models, all with human attentional data at a much larger scale. We designed a mouse-contingent multi-resolutional paradigm based on neurophysiological and psychophysical studies of peripheral vision, to simulate the natural viewing behavior of humans. The new paradigm allowed using a general-purpose mouse instead of an eye tracker to record viewing behaviors, thus enabling large-scale data collection. The paradigm was validated with controlled laboratory as well as large-scale online data. We report in this paper a proof-of-concept SALICON dataset of human 'free-viewing' data on 10,000 images from the Microsoft COCO (MS COCO) dataset with rich contextual information. We evaluated the use of the collected data in the context of saliency prediction, and demonstrated them a good source as ground truth for the evaluation of saliency algorithms.

AB - Saliency in Context (SALICON) is an ongoing effort that aims at understanding and predicting visual attention. This paper presents a new method to collect large-scale human data during natural explorations on images. While current datasets present a rich set of images and task-specific annotations such as category labels and object segments, this work focuses on recording and logging how humans shift their attention during visual exploration. The goal is to offer new possibilities to (1) complement task-specific annotations to advance the ultimate goal in visual understanding, and (2) understand visual attention and learn saliency models, all with human attentional data at a much larger scale. We designed a mouse-contingent multi-resolutional paradigm based on neurophysiological and psychophysical studies of peripheral vision, to simulate the natural viewing behavior of humans. The new paradigm allowed using a general-purpose mouse instead of an eye tracker to record viewing behaviors, thus enabling large-scale data collection. The paradigm was validated with controlled laboratory as well as large-scale online data. We report in this paper a proof-of-concept SALICON dataset of human 'free-viewing' data on 10,000 images from the Microsoft COCO (MS COCO) dataset with rich contextual information. We evaluated the use of the collected data in the context of saliency prediction, and demonstrated them a good source as ground truth for the evaluation of saliency algorithms.

UR - http://www.scopus.com/inward/record.url?scp=84959225954&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84959225954&partnerID=8YFLogxK

U2 - 10.1109/CVPR.2015.7298710

DO - 10.1109/CVPR.2015.7298710

M3 - Conference contribution

AN - SCOPUS:84959225954

T3 - Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition

SP - 1072

EP - 1080

BT - IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015

PB - IEEE Computer Society

Y2 - 7 June 2015 through 12 June 2015

ER -

SALICON: Saliency in Context

Abstract

Publication series

Other

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this