Advances in learning visual saliency: From image primitives to semantic contents

Qi Zhao; Christof Koch

doi:10.1007/978-1-4614-8151-5_14

Advances in learning visual saliency: From image primitives to semantic contents

Qi Zhao, Christof Koch

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Chapter

4 Scopus citations

Abstract

Humans and other primates shift their gaze to allocate processing resources to a subset of the visual input. Understanding and emulating the way that human observers free-view a natural scene has both scientific and economic impact. While previous research focused on low-level image features in saliency, the problem of “semantic gap” has recently attracted attention from vision researchers, and higher-level features have been proposed to fill the gap. Based on various features, machine learning has become a popular computational tool to mine human data in the exploration of how people direct their gaze when inspecting a visual scene. While learning saliency consistently boosts the performance of a saliency model, insights of what is learned inside the black box is also of great interest to both the human vision and computer vision communities. This chapter introduces recent advances in features that determine saliency, reviews related learning methods and insights drawn from learning outcomes, and discusses resources and metrics in saliency prediction.

Original language	English (US)
Title of host publication	Neural Computation, Neural Devices, and Neural Prosthesis
Publisher	Springer New York
Pages	335-360
Number of pages	26
ISBN (Electronic)	9781461481515
ISBN (Print)	9781461481508
DOIs	https://doi.org/10.1007/978-1-4614-8151-5_14
State	Published - Jan 1 2014

Bibliographical note

Publisher Copyright:
© Springer Science+Business Media New York 2014.

Access

10.1007/978-1-4614-8151-5_14

OpenUrl availability

Full text

Cite this

@inbook{3f5cb8a2630241c795d35aa5c31508a6,

title = "Advances in learning visual saliency: From image primitives to semantic contents",

abstract = "Humans and other primates shift their gaze to allocate processing resources to a subset of the visual input. Understanding and emulating the way that human observers free-view a natural scene has both scientific and economic impact. While previous research focused on low-level image features in saliency, the problem of “semantic gap” has recently attracted attention from vision researchers, and higher-level features have been proposed to fill the gap. Based on various features, machine learning has become a popular computational tool to mine human data in the exploration of how people direct their gaze when inspecting a visual scene. While learning saliency consistently boosts the performance of a saliency model, insights of what is learned inside the black box is also of great interest to both the human vision and computer vision communities. This chapter introduces recent advances in features that determine saliency, reviews related learning methods and insights drawn from learning outcomes, and discusses resources and metrics in saliency prediction.",

author = "Qi Zhao and Christof Koch",

note = "Publisher Copyright: {\textcopyright} Springer Science+Business Media New York 2014.",

year = "2014",

month = jan,

day = "1",

doi = "10.1007/978-1-4614-8151-5_14",

language = "English (US)",

isbn = "9781461481508",

pages = "335--360",

booktitle = "Neural Computation, Neural Devices, and Neural Prosthesis",

publisher = "Springer New York",

}

TY - CHAP

T1 - Advances in learning visual saliency

T2 - From image primitives to semantic contents

AU - Zhao, Qi

AU - Koch, Christof

PY - 2014/1/1

Y1 - 2014/1/1

N2 - Humans and other primates shift their gaze to allocate processing resources to a subset of the visual input. Understanding and emulating the way that human observers free-view a natural scene has both scientific and economic impact. While previous research focused on low-level image features in saliency, the problem of “semantic gap” has recently attracted attention from vision researchers, and higher-level features have been proposed to fill the gap. Based on various features, machine learning has become a popular computational tool to mine human data in the exploration of how people direct their gaze when inspecting a visual scene. While learning saliency consistently boosts the performance of a saliency model, insights of what is learned inside the black box is also of great interest to both the human vision and computer vision communities. This chapter introduces recent advances in features that determine saliency, reviews related learning methods and insights drawn from learning outcomes, and discusses resources and metrics in saliency prediction.

AB - Humans and other primates shift their gaze to allocate processing resources to a subset of the visual input. Understanding and emulating the way that human observers free-view a natural scene has both scientific and economic impact. While previous research focused on low-level image features in saliency, the problem of “semantic gap” has recently attracted attention from vision researchers, and higher-level features have been proposed to fill the gap. Based on various features, machine learning has become a popular computational tool to mine human data in the exploration of how people direct their gaze when inspecting a visual scene. While learning saliency consistently boosts the performance of a saliency model, insights of what is learned inside the black box is also of great interest to both the human vision and computer vision communities. This chapter introduces recent advances in features that determine saliency, reviews related learning methods and insights drawn from learning outcomes, and discusses resources and metrics in saliency prediction.

UR - http://www.scopus.com/inward/record.url?scp=84955659837&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84955659837&partnerID=8YFLogxK

U2 - 10.1007/978-1-4614-8151-5_14

DO - 10.1007/978-1-4614-8151-5_14

M3 - Chapter

AN - SCOPUS:84955659837

SN - 9781461481508

SP - 335

EP - 360

BT - Neural Computation, Neural Devices, and Neural Prosthesis

PB - Springer New York

ER -

Advances in learning visual saliency: From image primitives to semantic contents

Abstract

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this