: 24th International Conference, MMM 2018, Bangkok, Thailand, February 5-7, 2018, Proceedings, Part I (Lecture Notes in Science)

The two-volume set LNCS 10704 and 10705 constitutes the thoroughly refereed proceedings of the 24th International Conference on Multimedia Modeling, MMM 2018, held in Bangkok, Thailand, in February 2018.

Of the 185 full papers submitted, 46 were selected for oral presentation and 28 for poster presentation; in addition, 5 papers were accepted for Multimedia Analytics: Perspectives, Techniques, and Applications, 12 extended abstracts for demonstrations ,and 9 accepted papers  for Video Browser Showdown 2018. All papers presented were carefully reviewed and selected from 185 submissions.

Chapter 1. A Markov Network Based Passage Retrieval Method for Multimodal Question Answering in the Cultural Heritage Domain
Chapter 2. A Method of Weather Radar Echo Extrapolation Based on Convolutional Neural Networks
Chapter 3. A Motion-Driven Approach for Fine-Grained Temporal Segmentation of User-Generated Videos
Chapter 4. A Novel Human Action Recognition Framework for Video Content Analysis
Chapter 5. Adaptive Image Representation Using Information Gain and Saliency: Application to Cultural Heritage Datasets
Chapter 6. AGO: Accelerating Global Optimization for Accurate Stereo Matching
Chapter 7. An RNN-Based Speech-Music Discrimination Used for Hybrid Audio Coder
Chapter 8. Co-occurrent Structural Edge Detection for Color-Guided Depth Map Super-Resolution
Chapter 9. Collision-Free LSTM for Human Trajectory Prediction
Chapter 10. Convolution with Logarithmic Filter Groups for Efficient Shallow CNN
Chapter 11. Cost-Sensitive Deep Metric Learning for Fine-Grained Image Classification
Chapter 12. Crowd Distribution Estimation with Multi-scale Recursive Convolutional Neural Network
Chapter 13. Deep Convolutional Neural Network for Correlating Images and Sentences
Chapter 14. Deep Pedestrian Detection Using Contextual Information and Multi-level Features
Chapter 15. Dual-Way Guided Depth Image Inpainting with RGBD Image Pairs
Chapter 16. Efficient and Interactive Spatial-Semantic Image Retrieval
Chapter 17. Evaluation of Visual Content Descriptors for Supporting Ad-Hoc Video Search Tasks at the Video Browser Showdown
Chapter 18. Find Me a Sky: A Data-Driven Method for Color-Consistent Sky Search and Replacement
Chapter 19. Font Recognition in Natural Images via Transfer Learning
Chapter 20. Frame-Based Classification of Operation Phases in Cataract Surgery Videos
Chapter 21. High-Precision 3D Coarse Registration Using RANSAC and Randomly-Picked Rejections
Chapter 22. Image Aesthetic Distribution Prediction with Fully Convolutional Network
Chapter 23. Improving the Quality of Video-to-Language by Optimizing Annotation of the Material
Chapter 24. Iterative Active Classification of Large Image Collection
Chapter 25. Learning to Index in Large-Scale Datasets
Chapter 26. Light Field Foreground Matting Based on Defocus and Correspondence
Chapter 27. LOCO: Local Context Based Faster R-CNN for Small Traffic Sign Detection
Chapter 28. Multi-hypothesis-Based Error Concealment for Whole Frame Loss in HEVC
Chapter 29. Multi-stream Fusion Model for Social Relation Recognition from Videos
Chapter 30. Multimodal Augmented Reality – Augmenting Auditory-Tactile Feedback to Change the Perception of Thickness
Chapter 31. Parameter Selection for Denoising Algorithms Using NR-IQA with CNN
Chapter 32. Real-Time Polyps Segmentation for Colonoscopy Video Frames Using Compressed Fully Convolutional Network
Chapter 33. Recursive Pyramid Network with Joint Attention for Cross-Media Retrieval
Chapter 34. Reinforcing Pedestrian Parsing on Small Scale Dataset
Chapter 35. Remote Sensing Image Fusion Based on Two-Stream Fusion Network
Chapter 36. REVT: Robust and Efficient Visual Tracking by Region-Convolutional Regression Network
Chapter 37. Shallow-Water Image Enhancement Using Relative Global Histogram Stretching Based on Adaptive Parameter Acquisition
Chapter 38. Spatiotemporal 3D Models of Aging Fruit from Multi-view Time-Lapse Videos
Chapter 39. Stitch-Based Image Stylization for Thread Art Using Sparse Modeling
Chapter 40. Teacher and Student Joint Learning for Compact Facial Landmark Detection Network
Chapter 41. Text Image Deblurring via Intensity Extremums Prior
Chapter 42. The CAMETRON Lecture Recording System: High Quality Video Recording and Editing with Minimal Human Supervision
Chapter 43. Towards Demographic-Based Photographic Aesthetics Prediction for Portraitures
Chapter 44. Triplet Convolutional Network for Music Version Identification
Chapter 45. Two-Level Segment-Based Bitrate Control for Live ABR Streaming
Chapter 46. Uyghur Text Localization with Fast Component Detection
Chapter 47. Approaches for Segmentation of Visual Lifelog Data
Chapter 48. Category Specific Post Popularity Prediction
Chapter 49. Image Aesthetics and Content in Selecting Memorable Keyframes from Lifelogs
Chapter 50. On the Traceability of Results from Deep Learning-Based Cloud Services
Chapter 51. Rethinking Summarization and Storytelling for Modern Social Multimedia

  • Title: MultiMedia Modeling: 24th International Conference, Part I
  • Length: 648 pages
  • Edition: 1st ed. 2018
  • Language: English
  • Publisher:
  • Publication Date: 2018-02-17
  • ISBN-10: 3319736027
  • ISBN-13: 9783319736020

