Efficient grounding of abstract spatial concepts for natural language interaction with robot platforms

Rohan Paul; Jacob Arkin; Derya Aksaray; Nicholas Roy; Thomas M. Howard

doi:10.1177/0278364918777627

Efficient grounding of abstract spatial concepts for natural language interaction with robot platforms

Rohan Paul, Jacob Arkin, Derya Aksaray, Nicholas Roy, Thomas M. Howard

Aerospace Engineering and Mechanics

Research output: Contribution to journal › Article › peer-review

39 Scopus citations

Abstract

Our goal is to develop models that allow a robot to efficiently understand or “ground” natural language instructions in the context of its world representation. Contemporary approaches estimate correspondences between language instructions and possible groundings such as objects, regions, and goals for actions that the robot should execute. However, these approaches typically reason in relatively small domains and do not model abstract spatial concepts such as as “rows,” “columns,” or “groups” of objects and, hence, are unable to interpret an instruction such as “pick up the middle block in the row of five blocks.” In this paper, we introduce two new models for efficient natural language understanding of robot instructions. The first model, which we call the adaptive distributed correspondence graph (ADCG), is a probabilistic model for interpreting abstract concepts that require hierarchical reasoning over constituent concrete entities as well as notions of cardinality and ordinality. Abstract grounding variables form a Markov boundary over concrete groundings, effectively de-correlating them from the remaining variables in the graph. This structure reduces the complexity of model training and inference. Inference in the model is posed as an approximate search procedure that orders factor computation such that the estimated probable concrete groundings focus the search for abstract concepts towards likely hypothesis, pruning away improbable portions of the exponentially large space of abstractions. Further, we address the issue of scalability to complex domains and introduce a hierarchical extension to a second model termed the hierarchical adaptive distributed correspondence graph (HADCG). The model utilizes the abstractions in the ADCG but infers a coarse symbolic structure from the utterance and the environment model and then performs fine-grained inference over the reduced graphical model, further improving the efficiency of inference. Empirical evaluation demonstrates accurate grounding of abstract concepts embedded in complex natural language instructions commanding a robotic torso and a mobile robot. Further, the proposed approximate inference method allows significant efficiency gains compared with the baseline, with minimal trade-off in accuracy.

Original language	English (US)
Pages (from-to)	1269-1299
Number of pages	31
Journal	International Journal of Robotics Research
Volume	37
Issue number	10
DOIs	https://doi.org/10.1177/0278364918777627
State	Published - Sep 1 2018

Bibliographical note

Publisher Copyright:
© The Author(s) 2018.

Keywords

Human-Robot interaction
abstract spatial concepts
language grounding
robot learning

Access

10.1177/0278364918777627

OpenUrl availability

Full text

Cite this

@article{052f894f89984e4f87d22f8dfbb3dda9,

title = "Efficient grounding of abstract spatial concepts for natural language interaction with robot platforms",

abstract = "Our goal is to develop models that allow a robot to efficiently understand or “ground” natural language instructions in the context of its world representation. Contemporary approaches estimate correspondences between language instructions and possible groundings such as objects, regions, and goals for actions that the robot should execute. However, these approaches typically reason in relatively small domains and do not model abstract spatial concepts such as as “rows,” “columns,” or “groups” of objects and, hence, are unable to interpret an instruction such as “pick up the middle block in the row of five blocks.” In this paper, we introduce two new models for efficient natural language understanding of robot instructions. The first model, which we call the adaptive distributed correspondence graph (ADCG), is a probabilistic model for interpreting abstract concepts that require hierarchical reasoning over constituent concrete entities as well as notions of cardinality and ordinality. Abstract grounding variables form a Markov boundary over concrete groundings, effectively de-correlating them from the remaining variables in the graph. This structure reduces the complexity of model training and inference. Inference in the model is posed as an approximate search procedure that orders factor computation such that the estimated probable concrete groundings focus the search for abstract concepts towards likely hypothesis, pruning away improbable portions of the exponentially large space of abstractions. Further, we address the issue of scalability to complex domains and introduce a hierarchical extension to a second model termed the hierarchical adaptive distributed correspondence graph (HADCG). The model utilizes the abstractions in the ADCG but infers a coarse symbolic structure from the utterance and the environment model and then performs fine-grained inference over the reduced graphical model, further improving the efficiency of inference. Empirical evaluation demonstrates accurate grounding of abstract concepts embedded in complex natural language instructions commanding a robotic torso and a mobile robot. Further, the proposed approximate inference method allows significant efficiency gains compared with the baseline, with minimal trade-off in accuracy.",

keywords = "Human-Robot interaction, abstract spatial concepts, language grounding, robot learning",

author = "Rohan Paul and Jacob Arkin and Derya Aksaray and Nicholas Roy and Howard, {Thomas M.}",

note = "Publisher Copyright: {\textcopyright} The Author(s) 2018.",

year = "2018",

month = sep,

day = "1",

doi = "10.1177/0278364918777627",

language = "English (US)",

volume = "37",

pages = "1269--1299",

journal = "International Journal of Robotics Research",

issn = "0278-3649",

publisher = "SAGE Publications Inc.",

number = "10",

}

TY - JOUR

T1 - Efficient grounding of abstract spatial concepts for natural language interaction with robot platforms

AU - Paul, Rohan

AU - Arkin, Jacob

AU - Aksaray, Derya

AU - Roy, Nicholas

AU - Howard, Thomas M.

N1 - Publisher Copyright: © The Author(s) 2018.

PY - 2018/9/1

Y1 - 2018/9/1

N2 - Our goal is to develop models that allow a robot to efficiently understand or “ground” natural language instructions in the context of its world representation. Contemporary approaches estimate correspondences between language instructions and possible groundings such as objects, regions, and goals for actions that the robot should execute. However, these approaches typically reason in relatively small domains and do not model abstract spatial concepts such as as “rows,” “columns,” or “groups” of objects and, hence, are unable to interpret an instruction such as “pick up the middle block in the row of five blocks.” In this paper, we introduce two new models for efficient natural language understanding of robot instructions. The first model, which we call the adaptive distributed correspondence graph (ADCG), is a probabilistic model for interpreting abstract concepts that require hierarchical reasoning over constituent concrete entities as well as notions of cardinality and ordinality. Abstract grounding variables form a Markov boundary over concrete groundings, effectively de-correlating them from the remaining variables in the graph. This structure reduces the complexity of model training and inference. Inference in the model is posed as an approximate search procedure that orders factor computation such that the estimated probable concrete groundings focus the search for abstract concepts towards likely hypothesis, pruning away improbable portions of the exponentially large space of abstractions. Further, we address the issue of scalability to complex domains and introduce a hierarchical extension to a second model termed the hierarchical adaptive distributed correspondence graph (HADCG). The model utilizes the abstractions in the ADCG but infers a coarse symbolic structure from the utterance and the environment model and then performs fine-grained inference over the reduced graphical model, further improving the efficiency of inference. Empirical evaluation demonstrates accurate grounding of abstract concepts embedded in complex natural language instructions commanding a robotic torso and a mobile robot. Further, the proposed approximate inference method allows significant efficiency gains compared with the baseline, with minimal trade-off in accuracy.

AB - Our goal is to develop models that allow a robot to efficiently understand or “ground” natural language instructions in the context of its world representation. Contemporary approaches estimate correspondences between language instructions and possible groundings such as objects, regions, and goals for actions that the robot should execute. However, these approaches typically reason in relatively small domains and do not model abstract spatial concepts such as as “rows,” “columns,” or “groups” of objects and, hence, are unable to interpret an instruction such as “pick up the middle block in the row of five blocks.” In this paper, we introduce two new models for efficient natural language understanding of robot instructions. The first model, which we call the adaptive distributed correspondence graph (ADCG), is a probabilistic model for interpreting abstract concepts that require hierarchical reasoning over constituent concrete entities as well as notions of cardinality and ordinality. Abstract grounding variables form a Markov boundary over concrete groundings, effectively de-correlating them from the remaining variables in the graph. This structure reduces the complexity of model training and inference. Inference in the model is posed as an approximate search procedure that orders factor computation such that the estimated probable concrete groundings focus the search for abstract concepts towards likely hypothesis, pruning away improbable portions of the exponentially large space of abstractions. Further, we address the issue of scalability to complex domains and introduce a hierarchical extension to a second model termed the hierarchical adaptive distributed correspondence graph (HADCG). The model utilizes the abstractions in the ADCG but infers a coarse symbolic structure from the utterance and the environment model and then performs fine-grained inference over the reduced graphical model, further improving the efficiency of inference. Empirical evaluation demonstrates accurate grounding of abstract concepts embedded in complex natural language instructions commanding a robotic torso and a mobile robot. Further, the proposed approximate inference method allows significant efficiency gains compared with the baseline, with minimal trade-off in accuracy.

KW - Human-Robot interaction

KW - abstract spatial concepts

KW - language grounding

KW - robot learning

UR - http://www.scopus.com/inward/record.url?scp=85049839251&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85049839251&partnerID=8YFLogxK

U2 - 10.1177/0278364918777627

DO - 10.1177/0278364918777627

M3 - Article

AN - SCOPUS:85049839251

SN - 0278-3649

VL - 37

SP - 1269

EP - 1299

JO - International Journal of Robotics Research

JF - International Journal of Robotics Research

IS - 10

ER -

Efficient grounding of abstract spatial concepts for natural language interaction with robot platforms

Abstract

Bibliographical note

Keywords

Access

OpenUrl availability

Other files and links

Fingerprint

Cite this