Technical Program

PDF Version: Program Detail | Timetable

Monday, 5 February PROGRAM

 

 

8.30-10.00

Session 0: Opening Speech & Keynote (1)

(venue)

10.00-10.30

Refreshment Break

(venue)

10.30-12.00

Session 1: Best Paper Session

(venue)

Chair: TBA

 

1 .  [18] LOCO: Local Context based Faster R-CNN for Small Traffic Sign Detection

2 .  [86] Crowd Distribution Estimation with Multi-scale Recursive Convolutional Neural Network

3 .  [110] A Markov Network Based Passage Retrieval Method for Multimodal Question Answering in the Cultural Heritage Domain

4 .  [170] Frame-Based Classification of Operation Phases in Cataract Surgery Videos

12.00-14.00

Lunch

(venue)

VBS preparation

(venue)

14.00-15.30

Session 2: [TBA]

(venue)

15.30-16.00

Refreshment Break

(venue)

16.00-18.00

Session 3: Video Browser Showdown (VBS) Competition

(venue)

Chairs: Cathal Gurrin, Werner Bailer, Klaus Schoeffmann, Jakub Lokoc, Kunwadee Sripanidkulchai

 

1 .     [195] Enhanced VIREO KIS at VBS 2018

2 .     [184] Competitive Video Retrieval with vitrivr

3 .     [202] The ITEC Collaborative Video Search System at the Video Browser Showdown 2018

4 .     [192] Sketch-based Similarity Search for Collaborative Feature Maps               

5 .     [200] Video Search Based on Semantic Extraction and Locally Regional Object Proposal         

6 .     [189] Revisiting SIRET Video Retrieval Tool 

7 .     [201] Sloth Search System 

8 .     [188] VERGE IN VBS 2018   

9 .     [196] Fusing keyword search and visual exploration for untagged videos     

17.00-20.00

Welcome reception [International Cocktails]

(venue)


Tuesday, 6 February PROGRAM

 

 

 

9.00-10.00

Session 4: Keynote (2)

(venue)

 

10.00-10.30

Refreshment Break

(venue)

 

10.30-12.00

Session 5A: Music, Audio, Art

(venue)

Chair: TBA

Session 5B: Multimedia Application

(venue)

Chair: TBA

 

1 .  [21] Stitch-Based Image Stylization for Thread Art Using Sparse Modeling

2 .  [141] Towards Demographic-based Photographic Aesthetics Prediction for Portraitures

3 .  [48] An RNN-Based Speech-Music Discrimination Used for Hybrid Audio Coder

4 .  [5] Triplet Convolutional Network for Music Version Identification

1 .  [171] Multimodal Augmented Reality – Augmenting Auditory-Tactile Feedback to Change the Perception of Thickness

2 .  [173] Spatiotemporal 3D Models of Aging Fruit from Multi-View Time-Lapse Videos

3 .  [91] Two-level Segment-based Bitrate Control for Live ABR Streaming

4 .  [9] Multi-Hypothesis-based Error Concealment for Whole Frame Loss in HEVC 

12.00-13.30

Lunch

(venue)

VBS preparation

(venue)

13.30-15.00 

Session 6A: Quality Enhancement

(venue)

Chair: TBA

Session 6B: SS – Multimedia Analytics: Perspectives, Techniques and Applications

Chairs: Laurent Amsaleg, Cathal Gurrin, Björn Þór Jónsson, Stevan Rudinac 

 

1 .  [118] Parameter Selection for Denoising Algorithms Using NR-IQA with CNN

2 .  [100] Co-occurrent Structural Edge Detection for Color-Guided Depth Map Super-resolution

3 .  [19] Shallow-water Image Enhancement Using Relative Global Histogram Stretching Based on Adaptive Parameter Acquisition

4 .  [97] The CAMETRON lecture recording system: High quality video recording and editing with minimal human supervision 

1 .  [158] Rethinking Summarization and Storytelling for Modern Social Multimedia

2 .  [63] On the Traceability of Results from Deep Learning-based Cloud Services

3 .  [64] Category Specific Post Popularity Prediction

4 .  [167] Image Aesthetics and Content in Selecting Memorable Keyframes from Lifelogs

5 .  [46] Approaches for Event Segmentation of Visual Lifelog Data

15.00-16.30 

Session 7A: Face and Text Processing

(venue)

Chair: TBA 

Session 7B: Indexing and Classification

(venue)

Chair: TBA

 

1 .  [150] Teacher and Student Joint Learning for Compact Facial Landmark Detection Network

2 .  [34] Font Recognition in Natural Images via Transfer Learning

3 .  [147] Uyghur Text Localization with Fast Component Detection

4 .  [94] Text Image Deblurring via Intensity Extremums Prior

1 .  [84] Learning to Index in Large-Scale Datasets

2 .  [10] Iterative Active Classification of Large Image Collection

3 .  [36] Cost-sensitive Deep Metric Learning for Fine-Grained Image Classification

16.30-17.00

Refreshment Break

(venue)

 

17.00-18.30

Session 8A: Video Search and Recognition

(venue)

Chair: TBA

Session 8B: Video and Image Processing

(venue)

Chair: TBA

 

1 .  [180] Evaluation of Visual Content Descriptors for Supporting Ad-Hoc Video Search Tasks at the Video Browser Showdown

2 .  [104] A Novel 3D Human Action Recognition Framework for Video Content Analysis

3 .  [102] Multi-Stream Fusion Model for Social Relation Recognition from Videos

1 .  [132] A motion-driven approach for fine-grained temporal segmentation of user-generated videos

2 .  [164] Real-Time Polyps Segmentation for Colonoscopy Video Frames Using Compressed Fully Convolutional Network

3 .  [124] Find me a sky: a data-driven method for color-consistent sky search & replacement

4 .  [39] Remote Sensing Image Fusion Based on Two-stream Fusion Network

19.00-21.00

 

MMM 2018 Banquet

(venue) 

 

 

Wednesday, 7 February PROGRAM

 

 

 

9.00-10.00

Session 9: Keynote (3)

(venue)

 

10.00-10.30

Refreshment Break

(venue)

 

10.30-12.00

Session 10A: Cross-modal Retrieval and Processing

(venue)

Chair: TBA

Session 10B: Visual Parsing and Tracking

(venue)

Chair: TBA

 

1 .  [159] Efficient and Interactive Spatial-Semantic Image Retrieval

2 .  [35] Recursive Pyramid Network with Joint Attention for Cross-media Retrieval

3 .  [172] Improving the quality of video-to-language models by optimizing annotation of the training material

4 .  [52] Deep Convolutional Neural Network for Correlating Images and Sentences

1 .  [82] Reinforcing Pedestrian Parsing on Small Scale Dataset

2 .  [69] Deep Pedestrian Detection using Contextual Information and Multi-level Features

3 .  [65] REVT: Robust and Efficient Visual Tracking by Region-Convolutional Regression Network

4 .  [25] Collision-Free LSTM for Human Trajectory Prediction

12.00-13.30

Lunch

(venue)

VBS preparation

(venue)

13.30-15.00 

Session 11A: Vision & Graphics

(venue)

Chair: TBA

Session 11B: Deep Learning and Its Applications

(venue)

Chair: TBA

 

1 .  [50] Light Field Foreground Matting Based on Defocus and Correspondence

2 .  [3] Dual-way Guided Depth Image Inpainting With RGBD Image Pairs

3 .  [15] High-Precision 3D Coarse Registration Using RANSAC and Randomly-Picked Rejections

4 .  [22] AGO: Accelerating Global Optimization for Accurate Stereo Matching

1 .  [149] Convolution with Logarithmic Filter Groups for Efficient Shallow CNN

2 .  [14] Image Aesthetic Distribution Prediction with Fully Convolutional Network

3 .  [140] A Method of weather radar echo extrapolation based on convolutional neural networks

4 .  [162] Adaptive Image Representation using Information Gain and Saliency: Application to Cultural Heritage Datasets

15.00-17.00 

Refreshment Break

(venue)

 

Session 12A: Poster Session

(venue)

Chairs: TBA

Session 12B: Demonstration Session

(venue)

Chairs: Wolfgang Huerst, Joemon Jose, Suvit Nakpeerayuth

 

1 .    [2] Scalable Bag of Selected Deep Features for Visual Instance Retrieval

2 .    [27] Efficient Two-layer Model towards Cover Song Identification

3 .    [32] Source Distortion Estimation for Wyner-Ziv Distributed Video Coding                 

4 .    [37] A Novel Frontal Facial Synthesis Algorithm based on Individual Residual Faces

5 .    [43] A Text Recognition and Retrieval System for e-Business Image Management

6 .    [53] ShapeCreator: 3D Shape Generation from Isomorphic Datasets Based on Autoencoder   

7 .     [57] k-Labelsets for Multimedia Classification with Global and Local Label Correlation

8 .    [58] Domain Invariant Subspace Learning for Cross-modal Retrieval                 

9 .    [72] Coarse-to-Fine Image Super-Resolution Using Convolutional Neural Networks

10 . [76] CNN-Based DCT-Like Transform for Image Compression

11 . [83] Data Augmentation for EEG-Based Emotion Recognition with Deep Convolutional Neural Networks

12 . [88] A New Accurate Image Denoising Method Based on Sparse Coding Coefficients

13 . [90] Fusion Networks for Air-writing Recognition

14 . [96] Cloud of Line Distribution and Random Forest Text Detection from Natural/Video Scene Images         

15 .  [98] Room Floor Plan Generation on a Project Tango Device

16 . [101] Person Re-identification by Incorporating PCA Loss in CNN

17 . [105] Ensemble System for First Person Interactive Action Recognition

18 . [108] SRN: The Movie Character Relationship Analysis via Social Network

19 . [111] Effective Action Detection using Temporal Context and Posterior Probability of Length

20 . [115] Vehicle Semantics Extraction and Retrieval for Long-term Carpark Video Surveillance

21 . [117] Implicit Affective Video Tagging using Pupillary Response                

22 .  [122] Accurate Detection for Scene Texts with a Cascaded CNN Networks

23 . [127] Robust and Real-Time Visual Tracking based on Complementary Learners

24 . [135] The Long Tail of Web Video

25 . [151] Venue Prediction for Social Images by Exploiting Rich Temporal Patterns in LBSNs              

26 . [155] LVFS: a Lightweight Video Storage File System for IP Camera-based Surveillance Applications

27 . [160] Food Photo Recognition for Dietary Tracking: System and Experiment         

28 .  [169] SeqSense: video recommendation using topic sequence mining

 

1 .  [126] Lifelog Exploration Prototype in Virtual Reality              

2 .  [181] Auto accessory segmentation and interactive try-on system

3 .  [185] ActionVis: An Explorative Tool to Visualize Surgical Actions in Gynecologic Laparoscopy

4 .  [186] Automatic Smoke Classification in Endoscopic Video      

5 .  [187] Depth Representation of LiDAR Point Cloud with Adaptive Surface Patching for Object Classification

6 .  [190] Ontlus: 3D Content Collaborative Creation via Virtual Reality    

7 .  [191] Programmatic 3D Printing of a Revolving Camera Track to Automatically Capture Dense Images for 3D Scanning of Objects           

8 .  [193] Video Browsing on a Circular Timeline        

9 .  [194] Multi-Camera Microenvironment to Capture Multi-view Time-lapse Videos for 3D Analysis of Aging Objects 

10 .   [197] A Virtual Reality Interface for Interactions with Spatiotemporal 3D Data              

11 .   [198] AR DeepCalorieCam: An iOS App for Food Calorie Estimation with Augmented Reality         

12 .   [199] ImageX – Explore and search local/private images               

 

17.00-17.30

 

Closing Session

(venue)