Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment

Nuruzzaman; G. N. Perdue; A. Ghosh; M. Wospakrik; F. Akbar; D. A. Andrade; M. Ascencio; L. Bellantoni; A. Bercellie; M. Betancourt; G. F.R.Caceres Vera; T. Cai; M. F. Carneiro; J. Chaves; D. Coplowe; H. Da Motta; G. A. Díaz; J. Felix; L. Fields; R. Fine; A. M. Gago; R. Galindo; T. Golan; R. Gran; J. Y. Han; D. A. Harris; D. Jena; J. Kleykamp; M. Kordosky; X. G. Lu; E. Maher; W. A. Mann; C. M. Marshall; K. S. McFarland; A. M. McGowan; B. Messerly; J. Miller; J. K. Nelson; C. Nguyen; A. Norrick; Nuruzzaman Nuruzzaman; A. Olivier; R. Patton; M. A. Ramírez; R. D. Ransome; H. Ray; L. Ren; D. Rimal; D. Ruterbories; H. Schellman; C. J.Solano Salinas; H. Su; S. Upadhyay; E. Valencia; J. Wolcott; B. Yaeggy; S. Young

doi:10.1088/1748-0221/13/11/P11020

Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment

Nuruzzaman, G. N. Perdue, A. Ghosh, M. Wospakrik, F. Akbar, D. A. Andrade, M. Ascencio, L. Bellantoni, A. Bercellie, M. Betancourt, G. F.R.Caceres Vera, T. Cai, M. F. Carneiro, J. Chaves, D. Coplowe, H. Da Motta, G. A. Díaz, J. Felix, L. Fields, R. FineA. M. Gago, R. Galindo, T. Golan, R. Gran, J. Y. Han, D. A. Harris, D. Jena, J. Kleykamp, M. Kordosky, X. G. Lu, E. Maher, W. A. Mann, C. M. Marshall, K. S. McFarland, A. M. McGowan, B. Messerly, J. Miller, J. K. Nelson, C. Nguyen, A. Norrick, Nuruzzaman Nuruzzaman, A. Olivier, R. Patton, M. A. Ramírez, R. D. Ransome, H. Ray, L. Ren, D. Rimal, D. Ruterbories, H. Schellman, C. J.Solano Salinas, H. Su, S. Upadhyay, E. Valencia, J. Wolcott, B. Yaeggy, S. Young

Physics & Astronomy (Duluth)

Research output: Contribution to journal › Article › peer-review

18 Scopus citations

Abstract

We present a simulation-based study using deep convolutional neural networks (DCNNs) to identify neutrino interaction vertices in the MINERvA passive targets region, and illustrate the application of domain adversarial neural networks (DANNs) in this context. DANNs are designed to be trained in one domain (simulated data) but tested in a second domain (physics data) and utilize unlabeled data from the second domain so that during training only features which are unable to discriminate between the domains are promoted. MINERvA is a neutrino-nucleus scattering experiment using the NuMI beamline at Fermilab. A-dependent cross sections are an important part of the physics program, and these measurements require vertex finding in complicated events. To illustrate the impact of the DANN we used a modified set of simulation in place of physics data during the training of the DANN and then used the label of the modified simulation during the evaluation of the DANN. We find that deep learning based methods offer significant advantages over our prior track-based reconstruction for the task of vertex finding, and that DANNs are able to improve the performance of deep networks by leveraging available unlabeled data and by mitigating network performance degradation rooted in biases in the physics models used for training.

Original language	English (US)
Article number	P11020
Journal	Journal of Instrumentation
Volume	13
Issue number	11
DOIs	https://doi.org/10.1088/1748-0221/13/11/P11020
State	Published - Nov 26 2018

Bibliographical note

Funding Information:
This document was prepared by the MINERνA collaboration using the resources of the Fermi National Accelerator Laboratory (Fermilab), a U.S. Department of Energy, Office of Science, HEP User Facility. Fermilab is managed by Fermi Research Alliance, LLC (FRA), acting under Contract No. DE-AC02-07CH11359, which included the MINERνA construction project. The research here was aslo sponsored by the Laboratory Directed Research and Development Program of Oak Ridge National Laboratory, managed by UT-Battelle, LLC, for the U.S. Department of Energy. This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725. Construction support also was granted by the United States National Science Foundation under Award PHY-0619727 and by the University of Rochester. Additional support for participating scientists was provided by NSF and DOE (U.S.A.) by CAPES and CNPq (Brazil), by CoNaCyT (Mexico), by Proyecto Basal FB 0821, CONICYT PIA ACT1413, Fondecyt 3170845 and 11130133 (Chile), by PIIC (DGIP-UTFSM), by CONCYTEC, DGI-PUCP and IDI/IGI-UNI (Peru), by Latin American Center for Physics (CLAF), by RAS and the Russian Ministry of Education and Science (Russia), and by the National Science Centre of Poland, grant number DEC-2017/01/X/ST2/00128. We thank the MINOS Collaboration for use of its near detector data. Finally, we thank the staff of Fermilab for support of the beamline and the detector.

Publisher Copyright:
© 2018 IOP Publishing Ltd and Sissa Medialab.

Keywords

Analysis and statistical methods
Neutrino detectors
Pattern recognition, cluster finding, calibration andfitting methods

Access

10.1088/1748-0221/13/11/P11020

OpenUrl availability

Full text

Cite this

Nuruzzaman, Perdue, G. N., Ghosh, A., Wospakrik, M., Akbar, F., Andrade, D. A., Ascencio, M., Bellantoni, L., Bercellie, A., Betancourt, M., Vera, G. F. R. C., Cai, T., Carneiro, M. F., Chaves, J., Coplowe, D., Motta, H. D., Díaz, G. A., Felix, J., Fields, L., ... Young, S. (2018). Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment. Journal of Instrumentation, 13(11), Article P11020. https://doi.org/10.1088/1748-0221/13/11/P11020

Nuruzzaman, Perdue, GN, Ghosh, A, Wospakrik, M, Akbar, F, Andrade, DA, Ascencio, M, Bellantoni, L, Bercellie, A, Betancourt, M, Vera, GFRC, Cai, T, Carneiro, MF, Chaves, J, Coplowe, D, Motta, HD, Díaz, GA, Felix, J, Fields, L, Fine, R, Gago, AM, Galindo, R, Golan, T, Gran, R, Han, JY, Harris, DA, Jena, D, Kleykamp, J, Kordosky, M, Lu, XG, Maher, E, Mann, WA, Marshall, CM, McFarland, KS, McGowan, AM, Messerly, B, Miller, J, Nelson, JK, Nguyen, C, Norrick, A, Nuruzzaman, N, Olivier, A, Patton, R, Ramírez, MA, Ransome, RD, Ray, H, Ren, L, Rimal, D, Ruterbories, D, Schellman, H, Salinas, CJS, Su, H, Upadhyay, S, Valencia, E, Wolcott, J, Yaeggy, B & Young, S 2018, 'Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment', Journal of Instrumentation, vol. 13, no. 11, P11020. https://doi.org/10.1088/1748-0221/13/11/P11020

@article{a5c31adaf29b440ab097866db4db6c94,

title = "Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment",

abstract = "We present a simulation-based study using deep convolutional neural networks (DCNNs) to identify neutrino interaction vertices in the MINERvA passive targets region, and illustrate the application of domain adversarial neural networks (DANNs) in this context. DANNs are designed to be trained in one domain (simulated data) but tested in a second domain (physics data) and utilize unlabeled data from the second domain so that during training only features which are unable to discriminate between the domains are promoted. MINERvA is a neutrino-nucleus scattering experiment using the NuMI beamline at Fermilab. A-dependent cross sections are an important part of the physics program, and these measurements require vertex finding in complicated events. To illustrate the impact of the DANN we used a modified set of simulation in place of physics data during the training of the DANN and then used the label of the modified simulation during the evaluation of the DANN. We find that deep learning based methods offer significant advantages over our prior track-based reconstruction for the task of vertex finding, and that DANNs are able to improve the performance of deep networks by leveraging available unlabeled data and by mitigating network performance degradation rooted in biases in the physics models used for training.",

keywords = "Analysis and statistical methods, Neutrino detectors, Pattern recognition, cluster finding, calibration andfitting methods",

author = "Nuruzzaman and Perdue, {G. N.} and A. Ghosh and M. Wospakrik and F. Akbar and Andrade, {D. A.} and M. Ascencio and L. Bellantoni and A. Bercellie and M. Betancourt and Vera, {G. F.R.Caceres} and T. Cai and Carneiro, {M. F.} and J. Chaves and D. Coplowe and Motta, {H. Da} and D{\'i}az, {G. A.} and J. Felix and L. Fields and R. Fine and Gago, {A. M.} and R. Galindo and T. Golan and R. Gran and Han, {J. Y.} and Harris, {D. A.} and D. Jena and J. Kleykamp and M. Kordosky and Lu, {X. G.} and E. Maher and Mann, {W. A.} and Marshall, {C. M.} and McFarland, {K. S.} and McGowan, {A. M.} and B. Messerly and J. Miller and Nelson, {J. K.} and C. Nguyen and A. Norrick and Nuruzzaman Nuruzzaman and A. Olivier and R. Patton and Ram{\'i}rez, {M. A.} and Ransome, {R. D.} and H. Ray and L. Ren and D. Rimal and D. Ruterbories and H. Schellman and Salinas, {C. J.Solano} and H. Su and S. Upadhyay and E. Valencia and J. Wolcott and B. Yaeggy and S. Young",

note = "Funding Information: This document was prepared by the MINERνA collaboration using the resources of the Fermi National Accelerator Laboratory (Fermilab), a U.S. Department of Energy, Office of Science, HEP User Facility. Fermilab is managed by Fermi Research Alliance, LLC (FRA), acting under Contract No. DE-AC02-07CH11359, which included the MINERνA construction project. The research here was aslo sponsored by the Laboratory Directed Research and Development Program of Oak Ridge National Laboratory, managed by UT-Battelle, LLC, for the U.S. Department of Energy. This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725. Construction support also was granted by the United States National Science Foundation under Award PHY-0619727 and by the University of Rochester. Additional support for participating scientists was provided by NSF and DOE (U.S.A.) by CAPES and CNPq (Brazil), by CoNaCyT (Mexico), by Proyecto Basal FB 0821, CONICYT PIA ACT1413, Fondecyt 3170845 and 11130133 (Chile), by PIIC (DGIP-UTFSM), by CONCYTEC, DGI-PUCP and IDI/IGI-UNI (Peru), by Latin American Center for Physics (CLAF), by RAS and the Russian Ministry of Education and Science (Russia), and by the National Science Centre of Poland, grant number DEC-2017/01/X/ST2/00128. We thank the MINOS Collaboration for use of its near detector data. Finally, we thank the staff of Fermilab for support of the beamline and the detector. Publisher Copyright: {\textcopyright} 2018 IOP Publishing Ltd and Sissa Medialab.",

year = "2018",

month = nov,

day = "26",

doi = "10.1088/1748-0221/13/11/P11020",

language = "English (US)",

volume = "13",

journal = "Journal of Instrumentation",

issn = "1748-0221",

publisher = "IOP Publishing Ltd.",

number = "11",

}

TY - JOUR

T1 - Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment

AU - Nuruzzaman,

AU - Perdue, G. N.

AU - Ghosh, A.

AU - Wospakrik, M.

AU - Akbar, F.

AU - Andrade, D. A.

AU - Ascencio, M.

AU - Bellantoni, L.

AU - Bercellie, A.

AU - Betancourt, M.

AU - Vera, G. F.R.Caceres

AU - Cai, T.

AU - Carneiro, M. F.

AU - Chaves, J.

AU - Coplowe, D.

AU - Motta, H. Da

AU - Díaz, G. A.

AU - Felix, J.

AU - Fields, L.

AU - Fine, R.

AU - Gago, A. M.

AU - Galindo, R.

AU - Golan, T.

AU - Gran, R.

AU - Han, J. Y.

AU - Harris, D. A.

AU - Jena, D.

AU - Kleykamp, J.

AU - Kordosky, M.

AU - Lu, X. G.

AU - Maher, E.

AU - Mann, W. A.

AU - Marshall, C. M.

AU - McFarland, K. S.

AU - McGowan, A. M.

AU - Messerly, B.

AU - Miller, J.

AU - Nelson, J. K.

AU - Nguyen, C.

AU - Norrick, A.

AU - Nuruzzaman, Nuruzzaman

AU - Olivier, A.

AU - Patton, R.

AU - Ramírez, M. A.

AU - Ransome, R. D.

AU - Ray, H.

AU - Ren, L.

AU - Rimal, D.

AU - Ruterbories, D.

AU - Schellman, H.

AU - Salinas, C. J.Solano

AU - Su, H.

AU - Upadhyay, S.

AU - Valencia, E.

AU - Wolcott, J.

AU - Yaeggy, B.

AU - Young, S.

N1 - Funding Information: This document was prepared by the MINERνA collaboration using the resources of the Fermi National Accelerator Laboratory (Fermilab), a U.S. Department of Energy, Office of Science, HEP User Facility. Fermilab is managed by Fermi Research Alliance, LLC (FRA), acting under Contract No. DE-AC02-07CH11359, which included the MINERνA construction project. The research here was aslo sponsored by the Laboratory Directed Research and Development Program of Oak Ridge National Laboratory, managed by UT-Battelle, LLC, for the U.S. Department of Energy. This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725. Construction support also was granted by the United States National Science Foundation under Award PHY-0619727 and by the University of Rochester. Additional support for participating scientists was provided by NSF and DOE (U.S.A.) by CAPES and CNPq (Brazil), by CoNaCyT (Mexico), by Proyecto Basal FB 0821, CONICYT PIA ACT1413, Fondecyt 3170845 and 11130133 (Chile), by PIIC (DGIP-UTFSM), by CONCYTEC, DGI-PUCP and IDI/IGI-UNI (Peru), by Latin American Center for Physics (CLAF), by RAS and the Russian Ministry of Education and Science (Russia), and by the National Science Centre of Poland, grant number DEC-2017/01/X/ST2/00128. We thank the MINOS Collaboration for use of its near detector data. Finally, we thank the staff of Fermilab for support of the beamline and the detector. Publisher Copyright: © 2018 IOP Publishing Ltd and Sissa Medialab.

PY - 2018/11/26

Y1 - 2018/11/26

N2 - We present a simulation-based study using deep convolutional neural networks (DCNNs) to identify neutrino interaction vertices in the MINERvA passive targets region, and illustrate the application of domain adversarial neural networks (DANNs) in this context. DANNs are designed to be trained in one domain (simulated data) but tested in a second domain (physics data) and utilize unlabeled data from the second domain so that during training only features which are unable to discriminate between the domains are promoted. MINERvA is a neutrino-nucleus scattering experiment using the NuMI beamline at Fermilab. A-dependent cross sections are an important part of the physics program, and these measurements require vertex finding in complicated events. To illustrate the impact of the DANN we used a modified set of simulation in place of physics data during the training of the DANN and then used the label of the modified simulation during the evaluation of the DANN. We find that deep learning based methods offer significant advantages over our prior track-based reconstruction for the task of vertex finding, and that DANNs are able to improve the performance of deep networks by leveraging available unlabeled data and by mitigating network performance degradation rooted in biases in the physics models used for training.

AB - We present a simulation-based study using deep convolutional neural networks (DCNNs) to identify neutrino interaction vertices in the MINERvA passive targets region, and illustrate the application of domain adversarial neural networks (DANNs) in this context. DANNs are designed to be trained in one domain (simulated data) but tested in a second domain (physics data) and utilize unlabeled data from the second domain so that during training only features which are unable to discriminate between the domains are promoted. MINERvA is a neutrino-nucleus scattering experiment using the NuMI beamline at Fermilab. A-dependent cross sections are an important part of the physics program, and these measurements require vertex finding in complicated events. To illustrate the impact of the DANN we used a modified set of simulation in place of physics data during the training of the DANN and then used the label of the modified simulation during the evaluation of the DANN. We find that deep learning based methods offer significant advantages over our prior track-based reconstruction for the task of vertex finding, and that DANNs are able to improve the performance of deep networks by leveraging available unlabeled data and by mitigating network performance degradation rooted in biases in the physics models used for training.

KW - Analysis and statistical methods

KW - Neutrino detectors

KW - Pattern recognition, cluster finding, calibration andfitting methods

UR - http://www.scopus.com/inward/record.url?scp=85057619219&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85057619219&partnerID=8YFLogxK

U2 - 10.1088/1748-0221/13/11/P11020

DO - 10.1088/1748-0221/13/11/P11020

M3 - Article

AN - SCOPUS:85057619219

SN - 1748-0221

VL - 13

JO - Journal of Instrumentation

JF - Journal of Instrumentation

IS - 11

M1 - P11020

ER -

Reducing model bias in a deep learning classifier using domain adversarial neural networks in the MINERvA experiment

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this