Section 01
[Introduction] FoodSense: Innovative Research Connecting Food Images and Multisensory Experiences
This article introduces the FoodSense dataset (containing 66,842 human-annotated entries covering four sensory dimensions: taste, smell, texture, and sound), aiming to fill the gap in AI food understanding where deep cognitive awareness of sensory experiences is lacking; it trains the FoodSense-VL vision-language model to enable multisensory reasoning and discusses its application scenarios and cognitive science significance.