| Organizer | Submission Deadline | Notification of Acceptance | Submission Email | Download |
|---|---|---|---|---|
| University of Strathclyde | August 7, 2026 | 7-20 workdays | sympo_glasgow@confcds.org | Manuscript Template |
The shift toward personalized and adaptive media communication requires a deeper understanding of the viewer’s cognitive state during content consumption, which can be effectively supported by integrating visual perception, image understanding, and scene analysis technologies. Traditional methods for assessing cognitive load still rely largely on intrusive physiological sensors (e.g., EEG, eye-tracking), which are impractical for consumer-scale communication applications. This creates a critical gap in enabling dynamic media adaptation, including intelligent ad insertion, personalized pacing, and user-centered content delivery. With recent advances in computer vision, target recognition, and multimodal feature extraction, it is now possible to estimate cognitive load using only audio-visual cues from the media itself— where target recognition helps identify key visual elements, image understanding deciphers the semantic meaning of visual content, and scene analysis contextualizes the overall visual environment to capture viewer engagement.
By leveraging indicators such as speech rate, motion intensity, visual transitions, and shot pacing, and insights derived from visual perception, this approach moves beyond hardware dependency and opens new possibilities for building cognitively-aware media communication systems that seamlessly integrate computer vision-driven analysis with adaptive communication strategies.
This symposium aims to introduce and explore a multimodal framework for segment-level cognitive load estimation that functions without physiological sensors. The central goal is to address the problem of bridging low-level audio-visual features and high-level cognitive dimensions, and to demonstrate how computer vision-based cues can support adaptive media communication.
The symposium will rationalize the need for such a framework by highlighting the limitations of current sensor-based methods and the untapped potential of semantic and perceptual features (derived from visual perception and image understanding) embedded in video content. The symposium will discuss the model’s architecture, its validation using narrative-aligned datasets, and its implications for applications such as non-intrusive ad insertion, cognitively-sensitive interfaces, adaptive streaming, and user-oriented pacing-all supported by computer vision, target recognition, and scene analysis technologies. The symposium encourages discussions on expanding the application scope of this approach and further integrating visual perception and communication innovations to enhance the framework’s applicability, facilitating technological breakthroughs and development in the field of cognitive load estimation.
The scope of this symposium centers on multimodal cognitive load estimation for adaptive media communication, with an emphasis on computer vision-based methods that deeply integrate relevant technologies such as target recognition, image understanding, scene analysis, and visual perception to facilitate technological innovation in adaptive media within the communication field. Specific themes include: audio-visual feature extraction for cognitive load modeling based on visual perception and image understanding; reasoning-assisted fusion mechanisms combined with computer vision technologies; segment-level narrative transition analysis relying on scene analysis and target recognition; lightweight estimation frameworks for consumer applications; and applications in intelligent ad insertion, personalized pacing, and cognitively-aware media interfaces, with the collaborative integration of computer vision and communication technologies running through the entire process.
Accepted papers of this symposium will be published in Applied and Computational Engineering (Print ISSN: 2755-2721), and will be submitted to Conference Proceedings Citation Index (CPCI), Crossref, Portico, Google Scholar, CNKI, and other databases for indexing. The situation may be affected by factors among databases like processing time, workflow, policy, etc.
The papers will be exported to production and publication on a regular basis. Early-registered papers are expected to be published online earlier.
This symposium is organized by CONF-CDS 2026 and will independently proceed the submission and publication process.