Multiview supervision by registration

Yilun Zhang; Hyun Soo Park

doi:10.1109/WACV45572.2020.9093591

Multiview supervision by registration

Yilun Zhang, Hyun Soo Park

Computer Science and Engineering

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

4 Scopus citations

Abstract

This paper presents a semi-supervised learning framework to train a keypoint detector using multiview image streams given the limited number of labeled instances (typically <4%). We leverage three self-supervisionary signals in multiview tracking to utilize the unlabeled data: (1) a keypoint in one view can be supervised by other views via epipolar geometry; (2) a keypoint detection must be consistent across time; (3) a visible keypoint in one view is likely to be visible in the adjacent view. We design a new end-to-end network that can propagate these self-supervisionary signals across the unlabeled data from the labeled data in a differentiable manner. We show that our approach outperforms existing detectors including DeepLabCut tailored to the keypoint detection of non-human species such as monkeys, dogs, and mice.

Original language	English (US)
Title of host publication	Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020
Publisher	Institute of Electrical and Electronics Engineers Inc.
Pages	409-417
Number of pages	9
ISBN (Electronic)	9781728165530
DOIs	https://doi.org/10.1109/WACV45572.2020.9093591
State	Published - Mar 2020
Event	2020 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2020 - Snowmass Village, United States Duration: Mar 1 2020 → Mar 5 2020

Publication series

Name	Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020

Conference

Conference	2020 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2020
Country/Territory	United States
City	Snowmass Village
Period	3/1/20 → 3/5/20

Bibliographical note

Funding Information:
8. Acknowledgements This work is supported by NSF IIS 1846031 and NSF IIS 1755895.

Publisher Copyright:
© 2020 IEEE.

Access

10.1109/WACV45572.2020.9093591

OpenUrl availability

Full text

Cite this

Zhang, Y., & Park, H. S. (2020). Multiview supervision by registration. In Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020 (pp. 409-417). Article 9093591 (Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/WACV45572.2020.9093591

Multiview supervision by registration. / Zhang, Yilun; Park, Hyun Soo.
Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020. Institute of Electrical and Electronics Engineers Inc., 2020. p. 409-417 9093591 (Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Zhang, Y & Park, HS 2020, Multiview supervision by registration. in Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020., 9093591, Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020, Institute of Electrical and Electronics Engineers Inc., pp. 409-417, 2020 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2020, Snowmass Village, United States, 3/1/20. https://doi.org/10.1109/WACV45572.2020.9093591

@inproceedings{b3a40a9d14d0425897a080b61151be3c,

title = "Multiview supervision by registration",

abstract = "This paper presents a semi-supervised learning framework to train a keypoint detector using multiview image streams given the limited number of labeled instances (typically <4%). We leverage three self-supervisionary signals in multiview tracking to utilize the unlabeled data: (1) a keypoint in one view can be supervised by other views via epipolar geometry; (2) a keypoint detection must be consistent across time; (3) a visible keypoint in one view is likely to be visible in the adjacent view. We design a new end-to-end network that can propagate these self-supervisionary signals across the unlabeled data from the labeled data in a differentiable manner. We show that our approach outperforms existing detectors including DeepLabCut tailored to the keypoint detection of non-human species such as monkeys, dogs, and mice.",

author = "Yilun Zhang and Park, {Hyun Soo}",

note = "Funding Information: 8. Acknowledgements This work is supported by NSF IIS 1846031 and NSF IIS 1755895. Publisher Copyright: {\textcopyright} 2020 IEEE.; 2020 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2020 ; Conference date: 01-03-2020 Through 05-03-2020",

year = "2020",

month = mar,

doi = "10.1109/WACV45572.2020.9093591",

language = "English (US)",

series = "Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020",

publisher = "Institute of Electrical and Electronics Engineers Inc.",

pages = "409--417",

booktitle = "Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020",

}

TY - GEN

T1 - Multiview supervision by registration

AU - Zhang, Yilun

AU - Park, Hyun Soo

PY - 2020/3

Y1 - 2020/3

N2 - This paper presents a semi-supervised learning framework to train a keypoint detector using multiview image streams given the limited number of labeled instances (typically <4%). We leverage three self-supervisionary signals in multiview tracking to utilize the unlabeled data: (1) a keypoint in one view can be supervised by other views via epipolar geometry; (2) a keypoint detection must be consistent across time; (3) a visible keypoint in one view is likely to be visible in the adjacent view. We design a new end-to-end network that can propagate these self-supervisionary signals across the unlabeled data from the labeled data in a differentiable manner. We show that our approach outperforms existing detectors including DeepLabCut tailored to the keypoint detection of non-human species such as monkeys, dogs, and mice.

AB - This paper presents a semi-supervised learning framework to train a keypoint detector using multiview image streams given the limited number of labeled instances (typically <4%). We leverage three self-supervisionary signals in multiview tracking to utilize the unlabeled data: (1) a keypoint in one view can be supervised by other views via epipolar geometry; (2) a keypoint detection must be consistent across time; (3) a visible keypoint in one view is likely to be visible in the adjacent view. We design a new end-to-end network that can propagate these self-supervisionary signals across the unlabeled data from the labeled data in a differentiable manner. We show that our approach outperforms existing detectors including DeepLabCut tailored to the keypoint detection of non-human species such as monkeys, dogs, and mice.

UR - http://www.scopus.com/inward/record.url?scp=85085520552&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85085520552&partnerID=8YFLogxK

U2 - 10.1109/WACV45572.2020.9093591

DO - 10.1109/WACV45572.2020.9093591

M3 - Conference contribution

AN - SCOPUS:85085520552

T3 - Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020

SP - 409

EP - 417

BT - Proceedings - 2020 IEEE Winter Conference on Applications of Computer Vision, WACV 2020

PB - Institute of Electrical and Electronics Engineers Inc.

T2 - 2020 IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2020

Y2 - 1 March 2020 through 5 March 2020

ER -

Multiview supervision by registration

Abstract

Publication series

Conference

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this