Indoor scene knowledge acquisition using a natural language interface
Published: 2012
Publication Name: Proceedings of the international Workshop on Spatial Knowledge Acquisition with Limited Information Displays (SKALID'12)
Abstract:
Abstract: This paper proposes an interface that uses automatically-generated Natural Language (NL) descriptions to describe indoor scenes based on photos taken of that scene from smartphones or other portable camera-equipped mobile devices. The goal is to develop a non-visual interface based on spatio-linguistic descriptions which could assist blind people in knowing the contents of an indoor scene (e.g., room structure, furniture, landmarks, etc.) and supporting efficient navigation of this space based on these descriptions. In this paper, we concentrate on understanding the most salient content of a stereotypic indoor scene that is described by an observer, categorizing the description strategies employed in this process, and evaluating the best presentation of directional information using NL descriptions in order to support the most accurate spatial behaviors and mental representations of these scenes by means of human behavioral experiments. This knowledge will then be used to develop a domain specific indoor scene ontology, which in turn will be used to generate automated NL descriptions of indoor scenes based on their photographs, which will finally be integrated into a real-time non-visual scene description system.
Citation: Kesavan, S. & Giudice, N.A. (2012). Indoor scene knowledge acquisition using a natural language interface. In C. Graf, N.A. Giudice, & F. Schmid (Eds.) Proceedings of the international Workshop on Spatial Knowledge Acquisition with Limited Information Displays (SKALID’12), pp. 1-6. August, Monastery Seeon, Germany.