Tunç Aydın is a Research Scientist at Disney Research located at the Zürich Lab. My current research primarily focuses on image and video processing problems that address various movie production challenges, such as natural matting, green-screen keying, color grading, edge-aware filtering, and temporal coherence, among others. I have also been interested in analyzing visual content in terms of visual quality and aesthetic plausibility by utilizing knowledge of the human visual system. In my work I tend to utilize High Dynamic Range, Stereoscopic 3D, and High Frame-rate content, in addition to standard 8-bit images and videos.
Prior to joining Disney Research, I worked as a Research Associate at the Max-Planck-Institut für Informatik from 2006-2011, where I obtained my PhD degree under the supervision of Karol Myszkowski and Hans-Peter Seidel. I received the Eurographics PhD award in 2012 for my dissertation. I hold a Master’s degree in Computer Science from the College of Computing at Georgia Institute of Technology, and a Bachelor’s degree in Civil Engineering from Istanbul Teknik Universitesi. His talk takes place on Wednesday, November 1, 2017 at 13:00 in room A112.
Themos Stafylakis is a Marie Curie Research Fellow on audiovisual automatic speech recognition at the Computer Vision Laboratory of University of Nottingham (UK). He holds a PhD from Technical University of Athens (Greece) on Speaker Diarization for Broadcast News. He has a strong publication record on speaker recognition and diarization, as a result of his 5-year post-doc at CRIM (Montreal, Canada), under the supervision of Patrick Kenny. He is currently working on lip-reading and audiovisual speech recognition using deep learning methods. His talk takes place on Wednesday, November 22, 2017.
Deep Word Embeddings for Audiovisual Speech Recognition
During the last few years, visual and audiovisual automatic speech recognition (ASR) are witnessing a renaissance, which can largely be attributed to the advent of deep learning methods. Deep architectures and learning algorithms initially proposed for audio-based ASR are combined with powerful computer vision models and are finding their way to lipreading and audiovisual ASR. In my talk, I will go through some of the most recent advances in audiovisual ASR, with emphasis on those based on deep learning. I will then present a deep architecture for visual and audiovisual ASR which attains state-of-the-art results in the challenging lipreading-in-the-wild database. Finally, I will focus on how this architecture can generalize to words unseen during training and discuss its applicability in continuous speech audiovisual ASR.
Vlastimil Havran is Associate professor at the Czech Technical University in Prague. His research interests include data structures and algorithms for rendering images and videos, visibility calculations, geometric range searching for global illumination, software architectures for rendering, applied Monte Carlo methods, data compression etc. His talk takes place on Monday, December 4, 2017 at 12:00 in room E105.
Surface reflectance in rendering algorithms
The rendering of images by computers, i.e., computationally solving a rendering equation, consists of three components: computing visibility for example by ray tracing, the interaction of light with surface and efficient Monte Carlo sampling algorithms. In this talk, we focus on various aspects of surface reflectance. That is a key issue to get high fidelity of objects’ visual appearance in the rendered images not only in the movie industry but also in real time applications of virtual and augmented reality. First, we recall the initial concepts of surface reflectance and its use in rendering equation. Then we will present our results on the surface reflectance characterization and its possible use in rendering algorithms. Further, we will show why the standard surface reflectance model usually represented as bidirectional reflectance distribution function needs to be extended spatially to achieve high fidelity of visual appearance. As this spatial extension leads to a big data problems, we will describe our algorithm for compression of spatially varying surface reflectance data. We also will describe an effective perceptually motivated method to compare two similar surface reflectance datasets, where one can be the reference data and the second one the result of its compression. As the last topic, we will describe the concepts and problems when we measure such surface reflectance datasets for real-world applications.
Miloslav Druckmüller is a Professor of Applied Mathematics at the Institute of mathematics, Faculty of Mechanical Engineering, Brno University of Technology and the head of the Department of Computer Graphics and Geometry. His main interests are numerical methods of image analysis, digital image processing, computer graphics and complex variable analysis. During the last 10 years he has been cooperating widely with the Institute for Astronomy, University of Hawaii in the field of solar coronal plasma research. He created a large archive of K-corona (photospheric light scattered on free electrons) images and temperature maps based on Fe and Ni ions observing based on data obtained during total solar eclipses during last two decades. Nowadays his research is mainly focused on processing and analysis of data obtained by NASA SDO spacecraft. His talk takes place in POSTPONED.
Kevin Köser is a senior researcher at the GEOMAR Helmholtz Centre for Ocean Research, Kiel. His main research interest lies in novel camera-based measurement techniques for (deep) sea environments and processes (3D underwater vision). These help to study resources, to explore and monitor (deep) sea habitats or to assess hazards, e.g. with respect to gas flux or seafloor dynamics. In the past years Dr. Köser has taught the classes 3D Photography and Computer Vision Lab at the Swiss Federal Institute of Technology (ETH Zurich) and has worked as a senior researcher in ETH’s Computer Vision and Geometry Lab on shape and motion extraction from photos and videos, geolocalization and image registration. POSTPONED