August 29 – Workshop Day

09:00-09:10 Welcome and objectives of the workshop.
09:10-09:30 ResnetCrowd: A Residual Deep Learning Architecture for Crowd Counting, Violent Behaviour Detection and Crowd Density Level Classification
Mark A Marsden, Kevin McGuinness, Suzanne Little and Noel O'Connor
09:30-09:50 Background Modelling Based on Generative Unet.
Ye Tao, Petar Palasek, Zhihao Ling and Ioannis Patras
09:50-10:10 -Assessing Post-Detection Filters for a Generic Pedestrian Detector in a Tracking-By-Detection Scheme.
Volker Eiselein, Erik Bochinski and Thomas Sikora
10:10-10:30 Semantic Filtering for Video Stabilization.
Konstantinos Karageorgos, Anastasios Dimou, Apostolos Axenopoulos, Federico Alvarez and Petros Daras
11:00-11:20Background initialisation by spatio-temporal motion estimation.
Sriram Varadarajan, Hui Wang, Bryan Scotney and Omar Nibouche
11:20-11:40A Robust method for the Recognition of Palmprints.
Omar Nibouche, Hui Wang, Sriram Varadarajan and Bryan Scotney
14:00 - 17:00
14:00-14:15Welcome from organizers
14:15-14:45Prediction of learning space occupation through WLAN access point data using Kalman filter and Gradient Boosting Regression
Stefan Selzer, Stylianos Asteriadis, Marius Politze
14:45-15:15Person Tracking Association Using Multimodal Systems
A. Belmonte-Hernández, V. Solachidis, T. Theodoridis, G. Hernández–Penaloza, G. Conti, N. Vretos, F. Àlvarez and P. Daras
16:00-16:30Multimodal monitoring of Parkinson’s and Alzheimer’s patients using the ICT4LIFE platform
Federico Alvarez, Mirela Popa, Nicholas Vretos, Alberto Belmonte-Hernández, Stelios Asteriadis, Vassilis Solachidis, Triana Mariscal, Dario Dotti, Petros Daras
16:30-17:00ICT4Life Open Source Libraries supporting Multimodal Analysis of different diseases
Thomas Theodoridis, Vassilis Solachidis, Nicholas Vretos, Petros Daras
17:00 Discussion and conclusions
9:00 - 19:00
09:00-09:10 Organizers Welcome Speech
09:10-09:30 Talk from Nvidia
Serge Palaric
09:30-9:50Holistic Recognition of Low Quality License Plates by CNN using Track Annotated Data
Jakub Špaňhel, Jakub Sochor, Roman Juránek, Adam Herout, Lukáš Maršík, Pavel Zemčík
09:50-10:10Park Smart
Daniele Di Mauro, Marco Moltisanti Giovanni Patanè, Sebastiano Battiato, GiovanniMaria Farinella
10:10-10:30Abnormal Crowd Behavior Detection Using Novel Optical Flow-Based Features
Cem Direkoglu, Melike Sah, Noel O'Connor
11:00-11:20 Spatial Pyramid Context-Aware Moving Vehicle Detection and Tracking in Urban Aerial Imagery
Mahdieh Poostchi, Kannappan Palaniappan, Guna Seetharaman
11:20-11:40Online pedestrian tracking with multi-stage re-identification
Yifan Jiang, Hyunhak Shin, Jaeyong Ju, Hanseok Ko
11:40-12:00Multi-Pedestrian Detection and Tracking using Unified Multi-channel Features
Young Chul Lim, Min Sung Kang
12:00-12:20Deep Trajectory Representation-Based Clustering for Motion Pattern Extraction in Videos
Jonathan Boyle, Tahir Nawaz, James Ferryman
12:20-12:40Semantic Labeling for improved Vehicle Detection in Aerial Imagery
Lars Sommer, Kun Nie, Arne Schumann, Tobias Schuchert, Jürgen Beyerer
14:00-14:20Dynamic representations for autonomous driving
Juan Sebastian Olier, Pablo Marín-Plaza, David Martín, Lucio Marcenaro, Emilia Barakova, Matthias Rauterberg, Carlo Regazzoni
14:20-14:40Combining LiDAR Space Clustering and Convolutional Neural Networks for Pedestrian Detection
Damien Matti, Hazım Kemal Ekenel, Jean-Philippe Thiran
14:40-15:00An Open-data, Agent-based Model of Alcohol Related Crime
Joseph Redfern, Kirill Sidorov, Paul L. Rosin, Simon C. Moore, Padraig Corcoran, David Marshall
15:00-15:20Multimodal Vehicle Type Classification Using Convolutional Neural Network and Statistical Representations of MFCC
Berkay Selbes, Mustafa Sert
16:00-16:20Challenge Presentation Siwei Lyu
16:20-17:30Challenge spotlight presentations
17:30-19:00Poster session for challenge works
Robust Multi-object Tracking with Semantic Color Correlation
Noor Al-Shakarji, Filiz Bunyak, Guna Seetharaman, Kannappan Palaniappan
Joint Tracking with Event Grouping and Temporal Constraints
Wei Tian, Martin Lauer
High-Speed Tracking-by-Detection Without Using Image Information
Erik Bochinski, Volker Eiselein, Thomas Sikora
Sequential Sensor Fusion Combining Probability Hypothesis Density and Kernelized Correlation Filters for Multi-Object Tracking in Video Data
Tino Kutschbach, Erik Bochinski, Volker Eiselein, Thomas Sikora
Geometric Proposals for Faster R-CNN
Sikandar Amin, Fabio Galasso
Vehicle Detection with Sub-Class Training using R-CNN for the DETRAC Benchmark
Sitapa Rujikietgumjorn, Nattachai Watcharapinchai
Evolving boxes for fast vehicle detection (E-B)
Li Wang, Yao Lu, Hong Wang, Yingbin Zheng,Hao Ye, Xiangyang Xue
Towards lightweight convolutional neural networks for object detection (SSDR)
Dmitriy Anisimov, Tatiana Khanova
Faster R-CNN with ResNet101 (FRCNN-Res).
Nenghui Song, Yi Wei, Ming-Ching Chang
Region-based Deformable Fully Convolu-tional Network (DFCN).
Shuo Wang, Koray Ozcan
Higher-order Graph and Flow network based Tracker (HGFT).
Xiaoyi Yu, Guang Han
E Multi-task Deep Learning for Fast Online Multiple Object Tracking (MTT)
Yuqi Zhang, Yongzhen Huang, Liang Wang
9:00 - 12:40
9:00-9:10 Opening Remarks
Regular papers
9:10-9:30Preventive Maintenance of Critical Infrastructures using 5G Networks & drones
Theodore Zahariadis, Lambros Sarakis, Artemis Voulkidis, Panagiotis Karkazis, Panagiotis Trakadas
9:30-9:50Tracking and following a moving object with a quadcopter
Ricardo Fonseca, Werner Creixell
9:50-10:10Ordered Minimum Distance Bag-of-Words Approach for Aerial Object Identification
Eren Unlu, Emmanuel Zenou, Nicolas Riviere
10:10-10:30Flying Object Detection for Automatic UAV Recognition
Lars Sommer, Arne Schumann, Thomas Muller, Tobias Schuchert, Jurgen Beyerer
11:00-11:10 Presentation of the challenge
challenge papers
11:10-11:30A Study on Detecting Drones Using Deep Convolutional Neural Networks
Muhammad Saqib, Nabin Sharma, Sultan Daud Khan Makkah, Michael Blumenstein
11:30-11:50Using Deep Networks for Drone Detection
Cemal Aker, Sinan Kalkan
11:50-12:10Drone Detection Using Combined Motion and Shape Features
Mohammad Farhadi, Ruhallah Amandi
12:10-12:30Deep Cross-Domain Flying Object Classification for Robust UAV Detection
Arne Schumann, Lars Sommer, Johannes Klatte, Tobias Schuchert, Jurgen Beyerer
12:30-12:40 Closing remarks
9:00 - 12:30
Session 1
09:00-09:10 Workshop on Signal Processing for Understanding Crowd Dynamics: Welcome and Introduction
L. Marcenaro and N. Conci
L. Marcenaro and N. Conci
09:10-09:30Weakly supervised training of deep convolutional neural networks for overhead pedestrian localization in depth fields
A. Corbetta*, V. Menkovski, and F. Toschi
09:30-09:50Active Estimation of Motivational Spots for Modeling Dynamic Interactions
J.S. Olier*, D.A. Campo, L. Marcenaro, E. Barakova, M. Rauterberg, and C.S. Regazzoni
09:50-10:10A novel crowd density estimation technique using local binary pattern and Gabor features
A.K. Pai*, K.A. Kotegar, and R. U
10:10-10:30Active learning for high-density crowd count regression
J. Vandoni, E. Aldea*, and S. Le Hégarat-Mascle
Session 2
11:00-11:20Gathering of Data under Laboratory Conditions for the Deep Analysis of Pedestrian Dynamics in Crowds
M. Boltes*, J. Schumann, and D. Salden
11:20-11:40Data-driven Crowd Simulation
N. Bisagno*, B. Zhang, and N. Conci
11:40-12:30Panel discussion. Crowd analytics: where are we now, and what's next?
9:00 - 12:30
9:00-9:30 Opening Session
9:30-10:00Real-time HOG-based Pedestrian Detection in Thermal Images for an Embedded System
Sitapa Rujikietgumjorn and Nattachai Watcharapinchai
10:00-10:30Software Framework for Tensor Stream Processing on Embedded Vision Platforms
Bogusław Cyganek
11:00-11-30A real-time system for audio source localization with cheap sensor device
Alessia Saggese, Nicola Strisciuglio, Mario Vento, Nicolai Petkov
11:30-12:00An Enhanced System on Chip-Based Sobel Edge Detector
Ahmed S. Khalil, Mohamed Shalaby, Emad Hegazi
12:00-12:30 Discussion and Closing Session
14:00 - 17:00
First Session
14:00-14:30Improving Automation of Subsea Robotics Tasks by Integration of Visual Cues in 2-D Optical and FL Sonar Images
Invited talk Prof. S. Negahdaripour
14:30-15:00Information Centric Networking Architectures in Safety Enforcement Services: perspectives and challenges
Invited talk Prof. L. Alfredo Grieco
15:00-15:15Monitoring and Mapping with Robot Swarms for Agricultural Applications
Dario Albani, Joris IJsselmuiden, Ramon Haken, Vito Trianni
15:15-15:30A Multisensor Platform for Comprehensive Detection of Crop Status: Results from two Case Studies
Stefan Rilling, Michael Nielsen, Annalisa Milella, Christian Jestel, Peter Fröhlich,, Giulio Reina
Second Session
16:00-16:15Review on Research Studies and Monitoring System Applied to Cetaceans in the Gulf of Taranto
R. Carlucci, R. Maglietta, G. Buscaino, G. Cipriano, A. Milella, V. Pollazzon, P.
Bondanese, C. De Leonardis, S. Mona, M. Nitti, E. Papale, V. Renò, P. Ricci, E. Stella,
C. Fanizza
16:15-16:30Information-Centric Networking in Environmental Monitoring: an overview on publish-subscribe implementations
Agnese V. Ventrella, Luigi Alfredo Grieco, and Giuseppe Piro
16:30-16:45Survey and navigation in agricultural environments using robotic technologies
Rocco Galati, Giulio Reina, Arcangelo Messina, Angelo Gentile
16:45-17:00What has been missed for Real Life Driving? An Inspirational Thinking from Human Innate Biases
Jiawei Xu, Yu-An Chen, Kun Guo, Jiheng Wang, Federica Menchinelli, Chao Jiang, Chuang Zhang, Ling Shao

August 30 – Conference Day 1


08:50 - 09:00 Welcome (General Chairs)
09:00 - 10:00Keynote Talk 1
Seeing Objects and People in the 3D world: Visual Intelligence in Perspective

Silvio Savarese
Stanford University

Abstract: Computers can now recognize objects from images, classify simple human activities or reconstruct the 3D geometry of an environment.  However, these achievements are far from the kind of coherent and integrated interpretations that humans are capable of from just a quick glance of the complex 3D world. When we look at an environment, we don't just recognize the objects in isolation, but rather perceive a rich scenery of the 3D space, its objects, the people and all the relations among them. This allows us to effortlessly navigate through the environment, or to interact with objects in the scene with amazing precision or to predict what is about to happen next. In this talk I will give an overview of the research from my group and discuss our latest work on designing visual models that can process different sensing modalities and enable intelligent understanding of the sensing data.  I will also demonstrate that our models are potentially transformative in application areas related to autonomous or assisted navigation, smart environments, social robotics, augmented reality, and large scale information management.
10:30 - 12:10Oral Session 1 - Person Re-identification
Session Chair: Giovanni Maria Farinella
10:30 - 10:55Deep Spatial Pyramid for Person Re-identification,
Slawomir Bak, Peter Carr
10:55 - 11:20Exploiting Gaussian Mixture Importance for Person Re-identification
Xiangping Zhu, Amran Bhuiyan, Mohamed Lamine Mekhalfi, Vittorio Murino
11:20 . 11:45Multi-region Bilinear Convolutional Neural Networks for Person Re-Identification,
Evgeniya Ustinova, Victor Lempitsky
11:45 - 12:10Triplet CNN and Pedestrian Attribute Recognition for Improved Person Re-identification,
Yiqiang Chen, Stefan Duffner, Andrei STOIAN, Jean-yves DUFOUR, Atilla BASKURT
14:00 - 15:40Industrial Surveillance Day
Session 1 – Industrial Presentations

Session Chair: Ming-Ching Chang, University at Albany
Welcome & Introduction by Session Chairs
nvidia_logo_horizontalSerge PALARIC (Senior Director Sales and Marketing EMEAI)
1280px-Bosch-brand.svgHolger Fillbrandt (BOSCH)
MN-Logo-BlackEric Karmouch (Lead Research Engineer USA)
osram-logoFabio Galasso (Germany)
leonardo-finmeccanica-logo-160428133334_mediumFrancesco Calabrò (Italy)
logo_aiic_1- Italian Association of Critical Infrastructures' Experts
Priscilla Inzerilli (Italy)
GEMing-Ching Chang (USA)
16:10 - 17:30Industrial Surveillance Day
Session 2 - AVSS Challenges: Dataset & Poster Presentations

Session Chair: Ming-Ching Chang, University at Albany Research
- Challenge on Advanced Traffic Monitoring
Siwei Lyu (University at Albany, USA)
- Challenge on Drone-vs-Bird Detection
Angelo Coluccia (University of Salento, Italy)
Challenge Poster Presentations
17:30 Closing Remarks
18:00 - 19:30Guided City Tour
19:30 - 23:00Welcome Reception at Alex Restaurant


August 31 – Conference Day 2

08.30Registration & Poster Set-up
08:50-09:00 Announcements by Organizers
09:00 - 10:00Keynote Talk 2
Saliency and Personalization in Deep Models of Human Activity

Stan Sclaroff
Boston University

Abstract: What is visually salient in models for classification of human activities? How can we adapt and better personalize models of human movements, activities, and gestures? In this talk, I will report on our recent research related to computer-vision based tracking and analysis of human actions, interactions and communicative behaviors. I will describe new methods we have developed for top-down saliency estimation in convolutional neural networks and recurrent neural network models, with applications to space-time localization and classification of human activities in video. I will also describe our new formulation for personalizing gesture recognition using hierarchical Bayesian neural networks (HBNNs). Our HBNN models can adapt themselves to new subjects when only a small number of subject-specific personalization data is available.
10:30 - 12:10Oral Session 2 - Action and Event Recognition
Session Chair: Stan Sclaroff
10:30 - 10:55Structured LSTM for Human-Object Interaction Detection and Anticipation
Anh Truong, Atsuo Yoshitaka
10:55 - 11:20Action Localization in Video using a Graph-based Feature Representation
Iveel Jargalsaikhan, Noel O’Connor, Suzanne Little
11:20 - 11:45Enhancing audio surveillance with Deep Neural Networks
Federico Colangelo, Federica Battisti, Marco Carli, Alessandro Neri, Francesco Calabrò
11:45 - 12:10Inferring State Transition from Bystander to Participant in Free-style Conversational Interaction
Tatsuya Era, Hiroki Yoshimura, Masashi Nishiyama, Yoshio Iwai
12:10 - 12:40Poster Spotlight
14:00 - 15:30Poster Session 1 - Detection and Tracking
NOTE: posters of oral papers presented in ORAL SESSION 1 and ORAL SESSION 2 will be hosted in this poster session
Active Collaborative Ensemble Tracking
Kourosh Meshgi, Maryadm sadat Mirzaei, Shigeyuki Oba, Shin Ishii
Background Modeling using Adaptive Properties of Hybrid Features
Jaemyun Kim, Adin Ramirez Rivera, Byeongwoo Kim, Kaushik Roy, Oksam Chae
Combing Spatial and Temporal Features for Crowd Counting with Point Supervision
Haiying Jiang
Analytics of Deep Neural Network in Change Detection
Tsubasa Minematsu, Atsushi Shimada, Rin-Ichiro Taniguchi
Movies Tags Extraction Using Deep Learning
Umair Khan, Miguel Amor, Naveed Ejaz, Heiko Sparenberg
Multi-scale Histogram Tone Mapping Algorithm Enables Better Object Detection in Wide Dynamic Range Images
Jie Yang
Robust License Plate Detection In The Wild
Gee-Sern Hsu, ArulMurugan Ambikapa, Sheng-Luen Chung
Approximate License Plate String Matching for Vehicle Re-Identification

Motion Compensation of Submillimeter Wave 3D Imaging Radar Data for Security Screening
Maria Axelsson, Mikael Karlsson, Henrik Peterson
Attributes Co-occurrence Pattern Mining for Video-based Person Re-identification
Xiu Zhang, Federico Pala, Bir Bhanu
Semantic Annotation of Surveillance Videos for Abnormal Crowd Behaviour Search and Analysis Melike Sah, Cem Direkoglu
A Signal Detection Theory Approach for Camera Tamper Detection
Pranav Mantini, Shishir K. Shah
Suspected Vehicle Detection for Driving without License Plate Using Symmelets and Edge Connectivity
Jun-Wei Hsieh
Joint Cost Minimization for Multi-Object Tracking
Fast gender recognition in videos using a novel descriptor based on the gradient magnitudes of facial landmarks
George Azzopardi, Antonio Greco, Alessia Saggese, Mario Vento
Aerial Video Surveillance System for Small-Scale UAV Environment Monitoring
Danilo Avola, Gian Luca Foresti, Niki Martinel, Christian Micheloni, Daniele Pannone, Claudio Piciarelli
16:00 - 18:05Oral Session 3 - Object and people detection and tracking
Session Chair: Francois Bremond
16:00 - 16:25People Detection in Top-View Fisheye Imaging
Oded Krams, Nahum Kiryati
16:25 - 16:50CNN-based Cascaded Multi-task Learning of High-level Prior and Density Estimation for Crowd Counting
Vishwanath Sindagi, Vishal Patel
16:50 - 17:15Multi-Object tracking using Multi-Channel Part Appearance Representation
Thi Lan Anh NGUYEN, Francois Bremond, Furqan Muhammed Khan, Farhood Negin
17:15 - 17:40Active visual tracking in multi-agent scenarios
Yiming Wang, Andrea Cavallaro
17:40 - 18:05An efficient and effective method for people detection from top-view depth cameras
Vincenzo Carletti, Luca Del Pizzo, Gennaro Percannella, Mario Vento
18:05 Closing Remarks
19:30 - 20:30Bus to Castro Marina
20:30 - 00:00Gala Dinner at Grotta del Conte Restaurant


September 1 – Conference Day 3

08.30Registration & Poster Set-up
08:50-09:00 Announcements by Organizers
09:00 - 10:00Keynote Talk 3
Geometry, Uncertainty and Deep Learning

Roberto Cipolla
Cambridge University

Abstract: Understanding what a model does not know is a critical part of safe machine learning systems. New tools, such as Bayesian deep learning, provide a framework for understanding uncertainty in deep learning models, aiding interpretability and safety of such systems. Additionally, knowledge of geometry is an important consideration in designing effective algorithms. In particular, we will explore the use of geometry to help design networks that can be trained with unlabelled data for stereo and for human body pose and shape recovery.
10:30 - 12:10Oral Session 4 - Low Level Image Processing
Session Chair: to be announced
10:30 - 10:55ADM-HIPaR: An Efficient Background Subtraction Approach
Thien Huynh-The, Sungyoung Lee, Cam-Hao Hua
10:55 - 11:20Background Subtraction Using Encoder-Decoder Structured Convolutional Neural Network
Kyungsun Lim, Won-Dong Jang, Chang-Su Kim
11:20 - 11:45An Evidential Framework for Pedestrian Detection in High-Density Crowds
Jennifer Vandoni, Emanuel Aldea, Sylvie Le Hégarat
11:45 - 12:10An Adaptive Fusion Scheme of Color and Edge Features for Background Subtraction
Kaushik Roy, Jaemyun Kim, Md Tauhid Bin Iqbal, Farkhod Makhmudkhujaev, Byungyong Ryu, Oksam Chae
14:00 - 15:15Oral Session 5 - Soft Biometrics
Session Chair: Gianluca Foresti
14:00 - 14:25Generative Adversarial Models for People Attribute Recognition in Surveillance
Matteo Fabbri, Simone Calderara, Rita Cucchiara
14:25 - 14:50Video-Based Single Sample Face Recognition Using Face Frontalization via Autoencoders Deep Neural Networks
Saman Bashbaghi, Mostafa Parchami, Eric Granger
14:50 - 15:15Convolutional NNs for Face Recognition in Video Surveillance Using a Single Training Sample Per Person
Mostafa Parchami, Saman Bashbaghi, Eric Granger
15:15 - 15:45Poster Spotlight
16:15 - 17:45Poster Session 2 - Surveillance Systems
NOTE: posters of oral papers presented in ORAL SESSION 3, 4 and 5 will be hosted in this poster session
Action Recognition from Extremely Low-Resolution Thermal Image Sequence
Takayuki Kawashima, Yasutomo Kawanishi, Daisuke Deguchi, Ichiro Ide , Hiroshi Murase, Tomoyoshi Aizawa, Masato Kawade
Learning to Detect Violent Videos using Convolutional Long Short-Term Memory
Swathikiran Sudhakaran, Oswald Lanz
Latent Embeddings for Collective Activity Recognition
Yongyi Tang, Peizhen Zhang, Jian-Fang Hu, Wei-Shi Zheng
PASS: Privacy Aware Secure Signature Scheme for Surveillance Systems
Jihye Kim, Seunghwa Lee, Jungjun Yoon, Hankyung Ko, Seungri Kim, Hyunok Oh
Modeling and classification of trajectories based on a Gaussian process decomposition into discrete components
Damian Campo, Mohamad Baydoun, Lucio Marcenaro, Andrea Cavallaro, Carlo Regazzoni
A 3D-Autism Dataset for Repetitive Behaviours with Kinect Sensor
Omar RIHAWI, Djemal MERAD, Jean Luc Damoiseaux
Action Recognition based on a mixture of RGB and Depth based skeleton
Srijan Das, Michal Koperski, Francois Bremond, Gianpiero Francesca
Abnormal behavior detection in LWIR surveillance of railway platforms
Kristof Van Beeck, Kristof Van Engeland, Joost Vennekens, Toon Goedemé
Applying Audio Description for Context Understanding of Surveillance Videos by People With Visual Impairments
Virginia Campos, Luiz Goncalves, Tiago Araujo
Learning Feature Representation for Face Verification
Sangwoo Park, Jongmin Yu, Moongu Jeon
A batch asynchronous tracker for wireless smart-camera networks
Sandeep Katragadda, Andrea Cavallaro
Hyper-optimization tools comparison for parameter tuning applications
Camille Maurice, Jorge Francisco Madrigal Diaz, Frédéric Lerasle
Activity Recognition Using a Panoramic Camera for Homecare
Oscal T.-C. Chen, Ching-Han Tsai, Hung Ha Manh, Wei-Chih Lai
Building an Intelligent Video and Image Analysis Evaluation Platform for Public Security
Chuanping Hu, Gengjian Xue, Lin Mei, Li Qi, Jie Shao, Yanfeng Shang, Jian Wang
A knowledge-based approach for video event detection using spatio-temporal sliding windows
Danilo Cavaliere, Sabrina Senatore, Pierluigi Ritrovato, Luca Greco
17:45Closing Remarks