What analogies reveal about word vectors and their compositionality

Gregory P. Finley; Stephanie Farmer; Serguei V.S. Pakhomov

doi:10.18653/v1/s17-1001

What analogies reveal about word vectors and their compositionality

Gregory P. Finley, Stephanie Farmer, Serguei V.S. Pakhomov

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

16 Scopus citations

Abstract

Analogy completion via vector arithmetic has become a common means of demonstrating the compositionality of word embeddings. Previous work have shown that this strategy works more reliably for certain types of analogical word relationships than for others, but these studies have not offered a convincing account for why this is the case. We arrive at such an account through an experiment that targets a wide variety of analogy questions and defines a baseline condition to more accurately measure the efficacy of our system. We find that the most reliably solvable analogy categories involve either 1) the application of a morpheme with clear syntactic effects, 2) male-female alternations, or 3) named entities. These broader types do not pattern cleanly along a syntactic- semantic divide. We suggest instead that their commonality is distributional, in that the difference between the distributions of two words in any given pair encompasses a relatively small number of word types. Our study offers a needed explanation for why analogy tests succeed and fail where they do and provides nuanced insight into the relationship between word distributions and the theoretical linguistic domains of syntax and semantics.

Original language	English (US)
Title of host publication	*SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings
Publisher	Association for Computational Linguistics (ACL)
Pages	1-11
Number of pages	11
ISBN (Electronic)	9781945626531
DOIs	https://doi.org/10.18653/v1/s17-1001
State	Published - 2017
Event	6th Joint Conference on Lexical and Computational Semantics, *SEM 2017 - Vancouver, Canada Duration: Aug 3 2017 → Aug 4 2017

Publication series

Name	*SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings

Other

Other	6th Joint Conference on Lexical and Computational Semantics, *SEM 2017
Country/Territory	Canada
City	Vancouver
Period	8/3/17 → 8/4/17

Bibliographical note

Funding Information:
This work was partially supported by a University of Minnesota Academic Health Center Faculty Development Award and by the National Institute of General Medical Sciences (GM102282).

Publisher Copyright:
© 2017 Association for Computational Linguistics.

Access

10.18653/v1/s17-1001

OpenUrl availability

Full text

Cite this

Finley, G. P., Farmer, S., & Pakhomov, S. V. S. (2017). What analogies reveal about word vectors and their compositionality. In *SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings (pp. 1-11). (*SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/s17-1001

What analogies reveal about word vectors and their compositionality. / Finley, Gregory P.; Farmer, Stephanie; Pakhomov, Serguei V.S.
*SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings. Association for Computational Linguistics (ACL), 2017. p. 1-11 (*SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings).

Research output: Chapter in Book/Report/Conference proceeding › Conference contribution

Finley, GP, Farmer, S & Pakhomov, SVS 2017, What analogies reveal about word vectors and their compositionality. in *SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings. *SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings, Association for Computational Linguistics (ACL), pp. 1-11, 6th Joint Conference on Lexical and Computational Semantics, *SEM 2017, Vancouver, Canada, 8/3/17. https://doi.org/10.18653/v1/s17-1001

Finley GP, Farmer S, Pakhomov SVS. What analogies reveal about word vectors and their compositionality. In *SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings. Association for Computational Linguistics (ACL). 2017. p. 1-11. (*SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings). doi: 10.18653/v1/s17-1001

Finley, Gregory P. ; Farmer, Stephanie ; Pakhomov, Serguei V.S. / What analogies reveal about word vectors and their compositionality. *SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings. Association for Computational Linguistics (ACL), 2017. pp. 1-11 (*SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings).

@inproceedings{724c2bb99e614b7094861177699ac019,

title = "What analogies reveal about word vectors and their compositionality",

abstract = "Analogy completion via vector arithmetic has become a common means of demonstrating the compositionality of word embeddings. Previous work have shown that this strategy works more reliably for certain types of analogical word relationships than for others, but these studies have not offered a convincing account for why this is the case. We arrive at such an account through an experiment that targets a wide variety of analogy questions and defines a baseline condition to more accurately measure the efficacy of our system. We find that the most reliably solvable analogy categories involve either 1) the application of a morpheme with clear syntactic effects, 2) male-female alternations, or 3) named entities. These broader types do not pattern cleanly along a syntactic- semantic divide. We suggest instead that their commonality is distributional, in that the difference between the distributions of two words in any given pair encompasses a relatively small number of word types. Our study offers a needed explanation for why analogy tests succeed and fail where they do and provides nuanced insight into the relationship between word distributions and the theoretical linguistic domains of syntax and semantics.",

author = "Finley, {Gregory P.} and Stephanie Farmer and Pakhomov, {Serguei V.S.}",

note = "Funding Information: This work was partially supported by a University of Minnesota Academic Health Center Faculty Development Award and by the National Institute of General Medical Sciences (GM102282). Publisher Copyright: {\textcopyright} 2017 Association for Computational Linguistics.; 6th Joint Conference on Lexical and Computational Semantics, *SEM 2017 ; Conference date: 03-08-2017 Through 04-08-2017",

year = "2017",

doi = "10.18653/v1/s17-1001",

language = "English (US)",

series = "*SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings",

publisher = "Association for Computational Linguistics (ACL)",

pages = "1--11",

booktitle = "*SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings",

}

TY - GEN

T1 - What analogies reveal about word vectors and their compositionality

AU - Finley, Gregory P.

AU - Farmer, Stephanie

AU - Pakhomov, Serguei V.S.

N1 - Funding Information: This work was partially supported by a University of Minnesota Academic Health Center Faculty Development Award and by the National Institute of General Medical Sciences (GM102282). Publisher Copyright: © 2017 Association for Computational Linguistics.

PY - 2017

Y1 - 2017

N2 - Analogy completion via vector arithmetic has become a common means of demonstrating the compositionality of word embeddings. Previous work have shown that this strategy works more reliably for certain types of analogical word relationships than for others, but these studies have not offered a convincing account for why this is the case. We arrive at such an account through an experiment that targets a wide variety of analogy questions and defines a baseline condition to more accurately measure the efficacy of our system. We find that the most reliably solvable analogy categories involve either 1) the application of a morpheme with clear syntactic effects, 2) male-female alternations, or 3) named entities. These broader types do not pattern cleanly along a syntactic- semantic divide. We suggest instead that their commonality is distributional, in that the difference between the distributions of two words in any given pair encompasses a relatively small number of word types. Our study offers a needed explanation for why analogy tests succeed and fail where they do and provides nuanced insight into the relationship between word distributions and the theoretical linguistic domains of syntax and semantics.

AB - Analogy completion via vector arithmetic has become a common means of demonstrating the compositionality of word embeddings. Previous work have shown that this strategy works more reliably for certain types of analogical word relationships than for others, but these studies have not offered a convincing account for why this is the case. We arrive at such an account through an experiment that targets a wide variety of analogy questions and defines a baseline condition to more accurately measure the efficacy of our system. We find that the most reliably solvable analogy categories involve either 1) the application of a morpheme with clear syntactic effects, 2) male-female alternations, or 3) named entities. These broader types do not pattern cleanly along a syntactic- semantic divide. We suggest instead that their commonality is distributional, in that the difference between the distributions of two words in any given pair encompasses a relatively small number of word types. Our study offers a needed explanation for why analogy tests succeed and fail where they do and provides nuanced insight into the relationship between word distributions and the theoretical linguistic domains of syntax and semantics.

UR - http://www.scopus.com/inward/record.url?scp=85036617466&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85036617466&partnerID=8YFLogxK

U2 - 10.18653/v1/s17-1001

DO - 10.18653/v1/s17-1001

M3 - Conference contribution

AN - SCOPUS:85036617466

T3 - *SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings

SP - 1

EP - 11

BT - *SEM 2017 - 6th Joint Conference on Lexical and Computational Semantics, Proceedings

PB - Association for Computational Linguistics (ACL)

T2 - 6th Joint Conference on Lexical and Computational Semantics, *SEM 2017

Y2 - 3 August 2017 through 4 August 2017

ER -

What analogies reveal about word vectors and their compositionality

Abstract

Publication series

Other

Bibliographical note

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this