Imageability estimation

Project: Imageability estimation

The semantic gap is the lack of coincidence between the information one can extract from data and its interpretation. For a machine it is often challenging to select the best fitting wording out of a group of candidates: For an image of a car, the tag “Vehicle” might be too vague, while the model number might be too specific. An understanding of the visual implications of tag can help to decide the correct degree of abstractness.

In this research, we estimate a measurement for the perceived differences between words. Abstract words have a broad mental image due to them being less visually defined, while concrete words with a rather narrow visual feature space are visually easier to grasp. We cooporate this idea into the model and look for feature variety differences of large datasets for different words.

Comparing various modalities like vision, language, and phonemes, we explore differences of perception regarding words when it comes to each modality.

[Sources and pretrained model] https://github.com/mkasu/imageabilityestimation

[Our results and dataset] https://github.com/mkasu/imageabilitycorpus

Main project members

Chihaya Matsuhira

Finished Doctor in AY2024

Dr. Marc A. Kastner

Cooperative Research Fellow (Hiroshima City University)

Recent publications

A multi-modal dataset for analyzing the imageability of concepts across modalities

Marc A. Kastner, Chihaya Matsuhira, Ichiro Ide, Shin'ichi Satoh
Proc. 4th IEEE Int. Conf. on Multimedia Information Processing and Retrieval (MIPR2021), pp.213-218, Online, September 2021.

Imageability estimation using visual and language features

Chihaya Matsuhira, Marc Aurel Kastner, Ichiro Ide, Yasutomo Kawanishi, Takatsugu Hirayama, Keisuke Doman, Daisuke Deguchi, Hiroshi Murase
Proc ACM Int. Conf. on Multimedia Retrieval (ICMR) 2020, pp.306-310, Online, October 2020.

Estimating the imageability of words by mining visual characteristics from crawled image data

Marc Aurel Kastner, Ichiro Ide, Frank Nack, Yasutomo Kawanishi, Takatsugu Hirayama, Daisuke Deguchi, Hiroshi Murase
Multimedia Tools and Applications, vol.79, no.25, pp.18167-18199, July 2020.

On the quantification of the mental image of visual concepts for multi-modal applications

Marc Aurel Kastner, Ichiro Ide, Yasutomo Kawanishi, Takatsugu Hirayama, Daisuke Deguchi, Hiroshi Murase
IPSJ Tech. Rep. Computer Vision and Image Media, 2020-CVIM-222-5, Online, May 2020.

TBA (in Japanese)

Chihaya Matsuhira, Marc Aurel Kastner, Ichiro Ide, Yasutomo Kawanishi, Takatsugu Hirayama, Keisuke Doman, Daisuke Deguchi, Hiroshi Murase
26th ANLP Annual Meeting, no.P1-12; pp.45-48, Online, March 2020.

Estimating the visual variety of concepts by referring to Web popularity

Marc Aurel Kastner, Ichiro Ide, Yasutomo Kawanishi, Takatsugu Hirayama, Daisuke Deguchi, Hiroshi Murase
Multimedia Tools and Applications, vol.78, no.7, pp.9463-9488, April 2019.

A preliminary study on estimating word imageability labels using Web image data mining

Marc Aurel Kastner, Ichiro Ide, Yasutomo Kawanishi, Takatsugu Hirayama, Daisuke Deguchi, Hiroshi Murase
Proc. 25th ANLP Annual Meeting, no.A4-7; pp.747-750, Nagoya Univ, March 2019.

TBA (in Japanese)

Marc Aurel Kastner, Ichiro Ide, Yasutomo Kawanishi, Takatsugu Hirayama, Daisuke Deguchi, Hiroshi Murase
IPSJ Tech. Rep. Computer Vision and Image Media, 2018-CVIM-211-4, Ritsumeikan Univ., Biwako-Kusatsu Campus, March 2018.

Contents Science Lab

Project: Imageability estimation

Main project members

Chihaya Matsuhira

Dr. Marc A. Kastner

Recent publications

A multi-modal dataset for analyzing the imageability of concepts across modalities

Imageability estimation using visual and language features

Estimating the imageability of words by mining visual characteristics from crawled image data

On the quantification of the mental image of visual concepts for multi-modal applications

TBA (in Japanese)

Estimating the visual variety of concepts by referring to Web popularity

A preliminary study on estimating word imageability labels using Web image data mining

TBA (in Japanese)