IEEE International Workshop on Socially Intelligent Surveillance and Monitoring (SISM 2011)


In general terms, surveillance and monitoring technologies aim at understanding what people do in a given environment, whether this means to ensure the safety of workers on the factory floor, to detect crimes occurring in indoor or outdoor settings, or to monitor the flow of large crowds through public spaces. Computer vision & pattern recognition are the main technologies used for automatic monitoring of public spaces. Effective approaches for tracking people, recognizing poses, postures, gestures, collective crowd phenomena in public environments have been developed in the last years, especially in the video surveillance context, aimed at classifying (suspect, unusual, abnormal) behaviors.

However, surveillance and monitoring technologies rarely consider that they analyze human behavior, a phenomenon subject to principles and laws rigorous enough to produce stable and predictable patterns corresponding to social, affective, and psychological phenomena. On the other hand, these phenomena are the subject of other computing domains, in particular Social Signal Processing and Affective Computing, that typically neglect scenarios relevant to surveillance and monitoring technologies, especially when it comes to social and affective dimensions of space in human activities.

The mission of SISM  is to fill this gap by gathering researchers active in computer vision and pattern recognition, human sciences and automatic behavior understanding, in the same spirit as it has been pursued in the first edition of the workshop. Joint research across these communities will have a major impact on any technology that can benefit from automatic monitoring approaches, including video-surveillance, architecture, ambient intelligence, marketing, office space design, urbanism, etc..

Keynote speakers

Luc Van Gool (Computer Vision Laboratory, ETH Zurich, Switzerland)

Walking with groups

Tracking individuals in crowds is a difficult task. Thus, better models about how people normally walk can be of help. Such a model – LTA or Linear Trajectory Avoidance – is presented. We show examples of how LTA improves the predicted paths of individuals in crowds. Then, we move on to remedy some weaknesses of the model. It still treats all people in a crowd as being individually guided by LTA. In reality, some people walk in groups and therefore tend to stay closer together. Moreover, also other people are typically aware of such groups and behave differently with respect to a group of people vs. the same number of individuals walking along the same trajectories.

Frank E. Pollick (Department of Psychology, University of Glasgow, UK)

Effects of Experience in CCTV Surveillance Monitoring

It would seem obvious that individuals who daily monitor CCTV activity in urban environments would show signs of expertise in watching human activity.  However, this has not widely been studied and little is known about how performance changes with experience or the underlying mechanisms.  In a series of experiments we examined eye movements, behavioral judgments and brain activity (fMRI) of novice and experienced CCTV operators while they watched scenes of street activity.  These included both tasks of passive viewing and judging hostile intentions.  Results showed that operators had overall more efficient eye movements, and brain activity consistent with more extensive processing of the viewed activities.  These results will be discussed in the context of mechanisms for judging hostile intent.


09:15 Opening

09:30 Invited talk: Walking with groups
Luc Van Gool (Computer Vision Laboratory, ETH Zurich, Switzerland)

10:30 Coffee break

11:00 Who Knows Who – Inverting the Social Force Model for Finding Groups
Jan Sochman (Czech Technical University in Prague, Czech Republic)
David C. Hogg (University of Leeds, United Kingdom)

11:30 Do They Like Me? Using Video Cues to Predict Desires during Speed-dates
Arno Veenstra, Hayley Hung (University of Amsterdam, The Netherlands)

12:00 Differentiating Spontaneous From Posed Facial Expressions Within a Generic Facial Expression Recognition Framework
Tomas Pfister, Xiaobai Li, Guoying Zhao, Matti Pietikainen (University of Oulu, Finland)

12:30 Lunch Break

14:30 Invited talk: Effects of Experience in CCTV Surveillance Monitoring
Frank E. Pollick (Department of Psychology, University of Glasgow, UK)

15:30 An Efficient IP Approach to Constrained Multiple Face Tracking and Recognition
Andre Cohen, Vladimir Pavlovic (Rutgers University, USA)

16:00 Coffee Break

16.30 Human Pose Estimation Using Structural Support Vector Machines
Ke Chen, Shaogang Gong, Tao Xiang (Queen Mary University of London, United Kingdom)

17.00 A Joint Estimation of Head and Body Orientation Cues in Surveillance Video
Cheng Chen, Alexandre Heili, Jean-Marc Odobez (Idiap Research Institute, Switzerland)

17:30 Panel discussion

18.30 Closing


Call for Papers

Interested participants are invited to submit papers that should describe high-quality original research joining computer vision and pattern recognition, human sciences and automatic behavior understanding areas. Topics of interest include (but are by no means limited to):

  • Proxemics
  • Human ethology
  • Kinesics
  • Spatial Empathy
  • Territoriality
  • Expressions and emotions
  • Tracking: multi-person, multi-camera, group/crowd
  • Motion segmentation and analysis
  • Crowd/group analysis and simulation
  • Social force models
  • Collective and emergent behaviour
  • Gesture/Action recognition
  • Activity analysis
  • Multi-person/group/crowd interaction analysis
  • Spatial and temporal reasoning
  • Sensory integration and data fusion
  • Situation awareness and understanding
  • Applications: Ambient Intelligence, Surveillance and Monitoring, Domotics, Intelligent, Perceptual Marketing



The Workshop is held in conjunction with the International Conference on Computer Vision (Barcellona, November 6-13, 2011).

Program committee

A. Camurri (University of Genova)
R. Chellappa (University of Maryland)
I. Cohen (Honeywell)
R. Cucchiara (University of Modena Reggio-Emilia)
J. Ferryman (University of Reading, UK)
D. Gatica-Perez (IDIAP Research Institute, Switzerland)
S. Gong (Queen Mary, University of London)
K. Grammer (Ludwig-Boltzmann-Institute for Urban Ethology, Austria)
H. Högni Vilhjálmsson (Reykjavík University)
Y. Ivanov (MERL, USA)
J-M. Odobez (IDIAP Research Institute)
M. Pantic (Imperial College London/University of Twente)
C.Pelachaud (CNRS)
I.Poggi (Universita’ Roma Tre)
F. Pollick (University of Glasgow)
I. Reid (University of Oxford)
A. K. Roy Chowdhury (University of California, Riverside)
N. Sebe (University of Trento)
M. Turk (University of California, Santa Barbara)
T. Xiang (Queen Mary, University of London)

Submission policy

In submitting a manuscript to the workshop, authors acknowledge that no paper with substantially similar content has been submitted to another conference or workshop. Double submission to ICCV 2011 and SISM is the only exception allowed.

Manuscripts have to be submitted in the ICCV 2011 paper format workshops. Papers accepted for the workshop will be allocated 8 pages in the ICCV 2011 Proceedings.

SISM reviewing is double blind. Reviewing will be made by members of the program committee, and each paper will receive at least two reviews. Acceptance will be based on relevance to the workshop, novelty, and technical quality.

The papers can be submitted at the following link:

Important dates

June 27, 2011: Paper submission
August 10, 2011: Notification of acceptance
Camera-ready papers due: TBA


Vittorio Murino, University of Verona/ Italian Institute of Technology
Marco Cristani, University of Verona/ Italian Institute of Technology
Alessandro Vinciarelli, University of Glasgow/ IDIAP Research Institute


Email for all inquiries: vittorio.murino AT

1 comment to IEEE International Workshop on Socially Intelligent Surveillance and Monitoring (SISM 2011)

Leave a Reply




You can use these HTML tags

<a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <strike> <strong>