MultiMedia Modeling 27th International Conference, MMM 2021, Prague, Czech Republic, June 22-24, 2021 : proceedings. Part II /

The two-volume set LNCS 12572 and 1273 constitutes the thoroughly refereed proceedings of the 27th International Conference on MultiMedia Modeling, MMM 2021, held in Prague, Czech Republic, in June2021. Of the 211 submitted regular papers, 40 papers were selected for oral presentation and 33 for pos...

Full description

Corporate Authors: International Conference on Multi-Media Modeling Prague, Czech Republic)
Other Authors: International Conference on Multi-Media Modeling, Lokoč, Jakub,, Skopal, Tomas,, Schoeffmann, Klaus,, Mezaris, Vasileios,, Li, Xirong,, Vrochidis, Stefanos, 1975-, Patras, Ioannis,, SpringerLink (Online service)
Format: eBook
Language: English
Published: Cham : Springer, [2021]
Physical Description: 1 online resource (xxv, 501 pages) : illustrations (chiefly color).
Series: Lecture notes in computer science ; 12573.
LNCS sublibrary. Information systems and applications, incl. Internet/Web, and HCI.
Subjects:
Table of Contents:
  • Intro
  • Preface
  • Organization
  • Contents
  • Part II
  • Contents
  • Part I
  • MSCANet: Adaptive Multi-scale Context Aggregation Network for Congested Crowd Counting
  • 1 Introduction
  • 2 Related Work
  • 3 Proposed Method
  • 3.1 Multi-scale Context Aggregation Module
  • 3.2 Multi-scale Context Aggregation Network
  • 3.3 Compared to Other Context Modules
  • 4 Experiments
  • 4.1 Datasets
  • 4.2 Implementation Details
  • 4.3 Evaluation Metrics
  • 4.4 Comparison with State-of-the-Arts
  • 4.5 Ablation Study
  • 5 Conclusion
  • References.
  • Tropical Cyclones Tracking Based on Satellite Cloud Images: Database and Comprehensive Study
  • 1 Introduction
  • 2 The Proposed TCTSCI Database
  • 2.1 Data Preprocessing
  • 2.2 Annotation
  • 2.3 Attributes
  • 3 Evaluation
  • 3.1 Evaluation Metric
  • 3.2 Evaluated Trackers
  • 3.3 Evaluation Results with OPE
  • 3.4 Evaluation Results with EAO
  • 3.5 Analysis
  • 4 Conclusion
  • References
  • Image Registration Improved by Generative Adversarial Networks
  • 1 Introduction
  • 2 Proposed Method
  • 2.1 Background
  • 2.2 Proposed Network Structure
  • 2.3 Loss Function
  • 3 Experiments
  • 3.1 Dataset.
  • 3.2 Implementation Details
  • 3.3 Results
  • 4 Conclusion
  • References
  • Deep 3D Modeling of Human Bodies from Freehand Sketching
  • 1 Introduction
  • 2 Related Work
  • 3 Our Method
  • 3.1 Intermediate Skeleton Construction
  • 3.2 Joint-Wise Pose Regression
  • 3.3 Loss
  • 4 Experiments and Discussion
  • 4.1 Dataset
  • 4.2 Network Details and Training Settings
  • 4.3 Results and Discussion
  • 4.4 Body Modeling by Freehand Sketching
  • 5 Conclusions
  • References
  • Two-Stage Real-Time Multi-object Tracking with Candidate Selection
  • 1 Introduction
  • 2 Related Work.
  • 2.1 Tracking-by-Detection Methods
  • 2.2 Simultaneous Detection and Tracking Methods
  • 3 Proposed Method
  • 3.1 Backbone Network
  • 3.2 Two Branches
  • 3.3 Candidate Selection
  • 3.4 Cascade Data Association
  • 4 Experiments
  • 4.1 Datasets and Metrics
  • 4.2 Implementation Details
  • 4.3 Experimental Results
  • 5 Conclusion
  • References
  • Tell as You Imagine: Sentence Imageability-Aware Image Captioning
  • 1 Introduction
  • 2 Related Work
  • 3 Image Captioning Considering Imageability
  • 3.1 Data Augmentation
  • 3.2 Sentence Imageability Calculation
  • 3.3 Image Captioning
  • 4 Evaluation.
  • 4.1 Environment
  • 4.2 Analysis on the Sentence Imageability Scores
  • 4.3 Evaluation of Image Captioning Results
  • 4.4 Subjective Evaluation
  • 5 Conclusion
  • References
  • Deep Face Swapping via Cross-Identity Adversarial Training
  • 1 Introduction
  • 2 Related Works
  • 3 Our Approach
  • 3.1 Network Architecture
  • 3.2 Model Objective
  • 4 Implementation
  • 5 Experiments and Analysis
  • 5.1 Qualitative Results
  • 5.2 Quantitative Results
  • 5.3 Ablation Study
  • 5.4 Difficult Cases
  • 6 Conclusion
  • References.