Online Short Courses (OSCs)


Online Short Courses (OSCs) are being offered for training purposes in the network. Interested researchers who want to join OSCs may contact the contact person. The schedule for OSC are as follows:

ID What # Class Lead Coordinator Schedule
OSC1 Current and emerging standards – ISO 1 EPFL ESR-3 (KU) Nov. 23, 2015, 14–16h CET
OSC2 Current and emerging standards – ITU 1 TUB ESR-3 (KU) Nov. 30, 2015, 14–16h CET
OSC3 Current and emerging standards – IEEE 1 YU ESR-3 (KU) Dec. 14, 2015, 14–16h CET
OSC4 Current and emerging standards – SID 1 HOL ESR-3 (KU) Dec. 7, 2015, 14–16h CET
OSC5 The psychology of vision 2 KU ESR-3 (KU) M12-18
OSC6 3D perception 2 YU ESR-3 (KU) Jan. 12, 2016, 10-12h CET
OSC7 HDR perception 2 DRZ ESR-6 (DT) Apr. 26, 2016, 14–16h CEST
OSC8 QoE in Games + Subjective quality assessment 2 TUB ESR-6 (DT) June.20, 2016, 16–18h CEST
OSC9 QoE-aware Future Internet architecture 2 UniCA ESR-5 (DT) Dec. 19, 2016
OSC10 QoE adaptation and control mechanisms in multimedia applications 2 UoP ESR-7 (UoP) February 21, 2017
OSC11 3D display technologies 2 HOL ESR-7 (UoP) April 04, 2017
OSC12 3D content generation and processing 2 DRZ ESR-7 (UoP) June 27, 2017
OSC13 Immersive Media Technology Experiences 2 NTNU ESR-8 (HOL) M24-32
OSC14 3D image and Video coding 2 EPFL ESR-9 (DRZ) M24-32
OSC15 HDR image and video coding 2 EPFL ESR-9 (DRZ) M24-32
OSC16 Quality and usability 2 TUB ESR-10 (EPFL) M32-36
OSC17 Concept of models for Quality of Business 2 UoP ESR-10 (EPFL) M32-36

OSC1

Date: Nov. 23, 2015, 14–16h CET
Title: Current and emerging standards – ISO
Speaker: Touradj Ebrahimi (EPFL)

Abstract

ISO is one of the major standardisation organisations in the world, headquartered in Geneva, Switzerland. This short course will provide an overview ISO standards committees JPEG and MPEG.
After a general introduction of the structure and standardisation procedures within ISO, JPEG and MPEG committees, an overview of past standards developed by ISO in the field of multimedia is provided.
A particular emphasis will be given to Legacy JPEG, JPEG-LS, JPEG 2000, JPEG XR, and JPEG XT by JPEG Committee and MPEG-1, MPEG-2, MPEG-4, AVC, and HEVC by MPEG Committee.
In addition to the above, we will see in details how selection of technologies through call for evidence, call for proposals and core experiments is performed, and especially which quality assessment methodologies and metrics are used. The short course will then provide an outlook into emerging standards such as JPEG PLENO, JPEG XS, MPEG-FTV, and MPEG-3DGC and their current status.

Speaker’s biography

Touradj Ebrahimi is a professor at EPFL and head of its multimedia signal processing group. He is also Convenor (chairman) of JPEG standardisation committee and a key member of MPEG standardisation committee. Prof. Ebrahimi has received numerous awards and distinctions for his achievements in standardisation of image and video, in particular ISO certificates for outstanding contributions to MPEG-4. JPEG 2000, JPSEC, JPEG XR and JPSearch to mention some. Prof. Ebrahimi has played a key role in the assessment of JPEG 2000, JPEG XR, JPEG XT, MPEG-4, AVC. MVC, HEVC, 3DVC, and MPEG HDR/WCG.

OSC 2

Date: Nov. 30, 2015, 14–16h CET
Title: Current and emerging standards – ITU
Speaker: Alexander Raake (TU Ilmenau, Telekom Innovation Laboratories)

Abstract

The ITU (International Telecommunication Union) is an agency of the United Nations, based in Geneva, Switzerland. It is one of the most important standardization bodies in the telecommunication world.
This talk will give a brief overview of the general ITU organization, its two main sectors ITU-R and ITU-T (the latter being the main standardization sector), and their divisions into Study Groups and Questions. We will show how an ITU recommendation is created, from its inception and the drafting process within working groups to its final approval.
Also, we will outline the most important previously published recommendations related to multimedia codecs (e.g., ITU-T H.264/H.265 for video and the G.7xx series for speech), QoS and QoE monitoring (e.g., from the ITU-T J-series and the ITU-T P.12xx-series for audiovisual media, PESQ and POLQA for speech), and subjective quality assessment (ITU-T P.8xx/P.9xx series). Further, ITU-T’s liaisons to other organizations are highlighted.
Finally, we will show the status and future plans of selected working groups with regard to current and emerging multimedia services, such as ITU-T SG12/Q14’s P.NATS and P.INQX, and ITU-T SG9/Q12’s ongoing work on recommendations for subjective quality assessment.

Speaker’s biography

Professor Alexander Raake was appointed head of the Audiovisual Technology Group at TU Ilmenau in July 2015. Before, he was a Professor at TU Berlin, heading the Assessment of IP-based Applications group at TU Berlin’s An-Institut T-Labs, a joint venture between Deutsche Telekom AG and TU Berlin.
His research interests are in speech, audio and video transmission, Quality of Experience assessment, audiovisual and multimedia services and user perception. Since 1999, he has been involved in the standardization activities of the International Telecommunication Union (ITU-T) on transmission performance of telephone networks and terminals, where he currently acts as a Co-Rapporteur for question Q.14/12 on monitoring models for audiovisual services.

OSC 3

Date: Dec. 14, 2015, 14–16h CET
Title: Current and emerging standards – IEEE
Speaker: Sanghoon Lee (YU)

Abstract

IEEE-SA is an organization within IEEE that has developed global standards over a century. IEEE-SA covers a broad range of industries, including biomedical and healthcare, IT, nanotechnology, power and energy, and telecommunications. This talk introduces the standards processes and management in IEEE-SA and an overview of some remarkable standards. Then, I will give a talk about the currently developing international standard activity named “IEEE P3333.1: Quality Assessment of Three Dimensional (3D) Displays, 3D Contents and 3D Devices based on Human Factors”. This standard covers visual saliency prediction, subjective assessment for quantifying the visual discomfort and quality of experience (QoE) of 3D image and video and methods of quality assessment of 3D and UHD contents. To understand the current works in this standard, I will introduce the background knowledges of 3D content analysis in terms of human visual system, accommodation and binocular fusion.

Speaker’s biography

Sanghoon Lee is a full-time professor at Yonsei University, Associate Editor of IEEE Signal Processing Letters, Chair of the IEEE P3333.1 Quality Assessment Working Group and member of the IEEE IVMSP Technical Committee. He used to hold significant positions of many groups working on information networking and 3D Technology as well as be editors of several reputable journals in the world. Prof. Lee has been awarded a great number of both domestic and abroad prizes for his remarkable contributions and ceaseless efforts in scientific research. His major research interests focus on image/video quality assessment, medical image processing, cloud computing, sensors and sensor networks, wireless multimedia communications and wireless networks.

OSC 4

Date: Dec. 7, 2015, 14–16h CET
Title: Current and emerging standards – SID
Speaker: Péter Tamás Kovács (HOL)

Abstract

The Society for Information Display exists to foster display commerce, education, research and standards. The ICDM is a committee within in the Society for Information Display (SID).
Members of the ICDM Committee are professionals and experts in displays or related fields, including display metrology, display engineering, physics, optics, color science, vision science, and electrical engineering. There are over 250 active members and over 50 in the ICDM interest group — from many countries around the world including Asia, the Americas, and Europe.
The ICDM develops and maintains display-related standards, and has produced the IDMS1 (Information Display Measurements Standard version version 1), a standard that details evaluation methods for many display parameters. It describes measurement techniques, defines terminology, and explains what the results mean. Its purposes are:
-To help the display industry by providing a common language for display measurements.
-To provide metrics, methods, explanations, and tutorials to help obtain accurate and precise numeric values that define display performance.
-To identify for display users what display parameters are, what information they are meant to provide, and to help users be alert for tricks or misstatements (i.e. specsmanship) that are sometimes used when describing display performance.
-To provide detailed measurement procedures that other standards organizations can reference rather than expend the considerable effort to develop on their own.
As such, the IDMS standard aims to quantify the quality of our terminal devices, which is an upper bound for the overall user experience. It is therefore particularly important that manufacturer-specific metrics and measurement methods (sometimes driven by marketing purposes) are replaced by common ones provided by the IDMS.

Speaker’s biography

Péter Tamás Kovács has been working at Holografika since 2006. He has been involved in the research and development of novel 3D display prototypes and products as well as related technologies, many of them were world’s first (world’s first glasses-free 3D cinema, real-time light field capture and rendering system, full-angle 180 degree light-field display). He has served as PC member and Local Organizer for numerous international IEEE conferences (most notably 3DTV-Con) and is a contributing member of the International Committee for Display Metrology (ICDM), and the MPEG FTV AhG.

OSC 6

Date: Jan. 12, 2016, 10–12h CET
Title: 3D perception
Speaker: Sanghoon Lee (YU)

Abstract

Unlike 2D contents, ocular adjustment to binocular depth can induce neurological symptoms such as visual discomfort and headache, as well as 3D distortions that cause quality of experience. Understanding these problems involves several intricate visual factors which only shall be probed by investigating reliable spatial and temporal features in 3D contents and using a reliable subjective testing methodology. In this talk, I will present the background knowledge of 3D content analysis in terms of human visual resolutions according to foveation, accommodation and binocular fusion. In addition, from physiological point of view, I will introduce the basic operation of perceptual recognition in the primary visual cortex and how to cooperate its operation with visual discomfort estimation of image. As applications, I will introduce principles and statistical characteristics of 3D contents and visual saliency prediction based on the human visual system, physiology and neural activity. For demonstration of the 3D quality on noise, stereopsis, binocular rivalry and suppression are discussed.

Speaker’s biography

Sanghoon Lee is a full-time professor at Yonsei University, Associate Editor of IEEE Signal Processing Letters, Chair of the IEEE P3333.1 Quality Assessment Working Group and member of the IEEE IVMSP Technical Committee. He used to hold significant positions of many groups working on information networking and 3D Technology as well as be editors of several reputable journals in the world. Prof. Lee has been awarded a great number of both domestic and abroad prizes for his remarkable contributions and ceaseless efforts in scientific research. His major research interests focus on image/video quality assessment, medical image processing, cloud computing, sensors and sensor networks, wireless multimedia communications and wireless networks.

OSC 7

Date: Apr. 26, 2016, 14–16h CEST
Title: HDR Perception
Speakers: Dr. Tunc Aydin, Dr. Aljoscha Smolic

Abstract

The fact that the target audience of visual material consists of humans beings is extremely useful for visual computing research. After the visual information travels into the human eye in the form of light rays, it goes through a number of low-level, involuntary optical and neurological systems before it is relayed to other parts of the brain for higher-level processing, which we call collectively the “Human Visual System” (HVS). For our purposes, we can think of the HVS as cascade of filters that either removes or transforms visual information. Years of research showed that insights on how the HVS works can be exploited to advance the state-of-the-art in visual computing. In this talk we will present practical examples where using HVS models either improves upon existing methods, or enable new applications that were not possible before. We will start with a high-level overview on modelling various components of the HVS, and discuss how those components can be applied to various visual significance, contrast detection and aesthetic judgment tasks. Finally, we will outline approaches and results of visual quality evaluation experiments in the context of HDR and WCG coding.

Speaker’s biography

Dr. Tunc Aydin is a Research Scientist at Disney Research. His research interests have been developing image and video post-processing technologies that help creatives to realize their artistic vision, and understanding and automatically evaluating what makes visual content high quality and aesthetically pleasing. He holds a PhD (summa cum laude) degree from the Computer Graphics Department of Max-Planck-Institut für Informatik (2010). His dissertation received the Eurographics PhD Award (2012). He obtained his M.S. degree from the College of Computing of Georgia Institute of Technology (2005), and his B.S degree from the Civil Engineering Department of Istanbul Technical University (2003).

Dr. Aljosa Smolic joined Disney Research Zurich in 2009, as Senior Research Scientist and Head of the Advanced Video Technology group. Before he was Scientific Project Manager at the Fraunhofer Heinrich-Hertz-Institut (HHI), Berlin, also heading a research group. He has been involved in many national and international research projects, where he conducted research in various fields of video processing and visual computing, and published more than 130 referred papers in these fields. He received the Dipl.-Ing. Degree in Electrical Engineering from the Technical University of Berlin, Germany, in 1996, and the Dr.-Ing. Degree in Electrical Engineering and Information Technology from Aachen University of Technology (RWTH), Germany, in 2001. He is Associate Editor of IEEE Trans. on Image Processing, Area Editor for Signal Processing: Image Communication and served as Guest Editor for the Proceedings of the IEEE, and other scientific journals. He has been involved in MPEG standardization for 3D video as group leader and one of the Editors of the Multi-view Video Coding (MVC) standard. Further, he serves as Adjunct Professor at ETH Zurich teaching full lecture courses on Multimedia Communications and Adjunct Professor for Media Art at Karlsruhe University of Arts and Design.

OSC 8

Date: June 20, 2016, 16–18h CEST
Title: QoE in Games and Subjective Quality Assessment
Speaker: Sebastian Möller (TUB)

Abstract

The OSC sets out to establish some basics for the subjective assessment of Quality of Experience. Starting with definitions of quality, performance, Quality of Service and Quality of Experience, it first illustrates the concepts which commonly form the target in QoE assessment. Then, it provides a theoretical concept of psychophysical measurement processes, illustrating the requirements to measurements, the characteristics of test participants, as well as scaling procedures. Finally, the above-mentioned concepts are applied to QoE assessment of online gaming situations.

Speaker’s biography

Sebastian Möller studied electrical engineering at the universities of Bochum (Germany), Orléans (France) and Bologna (Italy). From 1994 to 2005, he held the position of a scientific researcher at the Institute of Communication Acoustics (IKA), Ruhr-University Bochum, and worked on speech signal processing, speech technology, communication acoustics, as well as on speech communication quality aspects. Since June 2005, he works at Telekom Innovation Laboratories, TU Berlin. He was appointed Professor at TU Berlin for the subject “Quality and Usability” in April 2007, and heads the “Quality and Usability Lab” at Telekom Innovation Laboratories.

OSC 9

Date: December 19, 2016, 14–16h CEST
Title: QoE-aware Future Internet architecture
Speaker: Luigi Atzori (UniCA)

Abstract

The OSC starts by presenting the current approaches for a QoE-aware service management in the Internet, i.e., QoE-aware application management and QoE-aware network management, which are mostly characterized by the interests and tools available of the stakeholder implementing the process. Then, the current limits and issues of these approaches are highlighted, together with the current directions of development in the field. These are mostly characterized by a strong evolution towards the virtualization of the services through the introduction of the SDN and NFV paradigms. These allow for a more flexible management of the services with a better control of the QoE and for a potential stronger cooperation among the different actors involved in the service provisioning chain.

Speaker’s biography

Luigi Atzori is Associate Professor at the Department of Electrical and Electronic Engineering at the University of Cagliari (Italy) and Research Associate of CNIT (Consorzio Nazionale Interuniversitario per le Telecomunicazioni). His research interests are in multimedia communications and computer networking (wireless and wireline), with emphasis on multimedia QoE, multimedia streaming, NGN service management, service management in wireless sensor networks, architecture and services in the Internet of Things. At the University of Caglari he lead the Multimedia and Communications lab (http://mclab.diee.unica.it).

OSC 10

Date: February 21, 2017, 11–13h CET
Title: QoE adaptation and control mechanisms in multimedia applications
Speaker: Prof. Lingfen Sun (Plymouth University)

Abstract

This talk is about QoE adaptation and control mechanisms in multimedia applications including VoIP and video streaming applications. It will start with an overview about the need for QoE control and management; main challenges; and general questions on what to control and how to control. Then the speaker will talk through some research projects she has been involved in relation to QoE-aware control at application and network/link layers, ranging from EU FP7 ADAMANTIUM; bitrate adaptation and implementation for VoIP and video streaming applications; LTE resource allocation; QoE-aware, content-aware DASH; and the latest video quality management over SDN. A brief summary is provided at the end.

Speaker’s biography

Dr. Lingfen Sun has been an Associate Professor (Reader) in Multimedia Communications and Networks at the School of Computing, Electronics and Mathematics, Plymouth University since September 2010. She received her PhD on “Speech Quality Prediction for Voice over Internet Protocol Networks” from University of Plymouth UK in 2004. She holds a Postgraduate Certificate in Learning and Teaching in Higher Education, an M.Sc. in Communication and Electronics System and BEng in Telecommunications Engineering. She is currently affiliated to the Signal Processing and Multimedia Communications Research Group (SPMC) and the Centre for Security, Communications and Network Research (CSCAN).

OSC 11

Date: April 04, 2017, 11-13h, CET
Title: 3D Display Technologies
Speaker: Attila Barsi (HOL)

Abstract

The OSC’s main topic is 3D displays. Our discussions will include: Introduction to stereoscopic 3D displays. Anaglyph stereo, passive stereo and active stereo systems. Autostereocopic displays. Differences between horizontal parallax and full parallax autostereoscopy. Multiview systems:
• parallax barrier
• lenticular lens based displays
• volumetric displays
• time multiplex systems
Introduction to light fields and light field displays.
Then, the technical background of current generation light field display technologies will be highlighted:
• Near-eye light field displays.
• HoloVizio technology.

Speaker’s biography

Attila Barsi worked on real-time graphics research and EU projects during his PhD studies. Most notably global illumination, environment lighting and game engines in the GAMETOOLS project. Since joining Holografika in 2006, he participated in several EU projects regarding real-time and light field rendering. He is currently working as lead software developer. He was responsible for software development in the COHERENT, OSIRIS and HOLOVISION projects in FP6; the 3DVIVANT, DIOMEDES and MUSCADE projects in FP7. He was also responsible for project management in the HOLONICS project in FP6; the 3DPHONE and SAFROS projects in FP7; the Arri(va)2 project in Eureka. His expertise are in cutting-edge real-time graphics programming through various APIs and GPUs, developing audio-video related applications, writing distributed and parallel applications and building and managing Windows and Linux based clusters.

OSC 12

Date: June 27, 2017, 14-16h, CEST
Title: 3D Content Generation and Processing
Speaker: Prof Aljosa Smolic (University of Dublin)

Abstract

Stereoscopic 3D (S3D) is a sensitive illusion, which has to be carefully designed to create a compelling experience. For that, perceptual aspects have to be understood in detail as well as inherent shortcomings and limitations of the technology. Over decades, artists have mastered the skills to tell engaging stories in S3D, while not overstressing the audience’s perceptual limits. This course will outline perceptual and creative aspects of S3D content creation. Examples of related technology and research will be given.

Speaker’s biography

Prof Smolic is the SFI Research Professor of Creative Technologies at Trinity College Dublin. Before joining Trinity, Prof Smolic was with Disney Research Zurich as Senior Research Scientist and Head of the Advanced Video Technology group, and with the Fraunhofer Heinrich-Hertz- Institut (HHI), Berlin, also heading a research group as Scientific Project Manager. He received the Dr.-Ing. Degree from Aachen University of Technology (RWTH) in 2001. Prof Smolic is Associate Editor of the IEEE Transactions on Image Processing and served as Guest Editor for the Proceedings of the IEEE, IEEE Transactions on CSVT, IEEE Signal Processing Magazine, and other scientific journals. At TCD he is leading a team of 20+ researchers entitled V-SENSE – Extending Visual Sensation trough Image-Based Visual Computing, combining computer graphics, computer vision and media technology. He published over 150 referred papers in these fields and filed 35+ patents.