Ghent University – iMinds – Multimedia Lab Master thesis subjects 2015 - 2016 VIDEO Efficient transcoding from VP9 to HEVC Promotor: Peter Lambert and Rik Van de Walle Supervisors: Johan De Praeter, Glenn Van Wallendael Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 1 Contact: Johan De Praeter Keywords: VP9, HEVC, video compression, transcoding Location: home, Zuiderpoort Problem definition: The next generation of video compression standards offers better compression than its predecessors. However, two different standards are being pushed forward: VP9 and High Efficiency Video Coding (HEVC). These two standards will likely coexist in the future, which makes it important to provide compatibility between devices supporting the different standards. This compatibility can be achieved by transcoding a bitstream encoded with one standard to a bitstream compatible with the other. The simplest method for doing this consists of decoding the bitstream and re-encoding it with the other one standard. However, this process can be made more efficient, since there are some similarities between both standards. Goals: The goal of this master’s thesis is to reduce the complexity of transcoding from VP9 to HEVC while retaining good video quality. To achieve this, the student will have to get acquainted with the properties of both VP9 and HEVC to exploit similarities between both compression standards. The student gets to build on an existing transcoding framework written in Python or C++, but can also directly make changes to the reference software of both video standards (VP9: http://www.webmproject.org/code/; HEVC: http://hevc.kw.bbc.co.uk/). Classification and detection of screen content for visually lossless transcoding in video compositions for HEVC Promotor: Peter Lambert and Rik Van de Walle Supervisors: Johan De Praeter, Thijs Vermeir Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 1 Contact: Johan De Praeter Keywords: HEVC, lossless compression, transcoding, compositions, screen content Location: home, Zuiderpoort Problem definition: Video sequences can either contain natural content, which is content captured by cameras, or screen content such as a screen capture of a desktop. In many cases, such as screen sharing during a video call, it is important to preserve the detail of this screen content to assure the readability of data. To do this, more bits can be allocated to the entire video sequence. However, in a composition of both natural and screen content, this would result in bits being wasted on the natural content. Moreover, depending on the type of screen content, the amount of bitrate increase to prevent annoying artifacts might differ. Therefore, it should be possible to detect several types of screen content in a composition. Goals: In this master’s thesis, the goal is to detect and classify different types of screen content, and to identify the minimal bitrate that should be allocated to prevent annoying artifacts to appear. In a second phase, the student will implement this bitrate allocation by building on an existing transcoding framework for video compositions in Python or C++ and by making changes to the reference encoder of the HEVC video compression standard (http://hevc.kw.bbc.co.uk/). This thesis is in cooperation with Barco (www.barco.be). Content-adaptive bandwidth sharing for video transport over the network Promotor: Peter Lambert and Rik Van de Walle Supervisors: Johan De Praeter, Thijs Vermeir Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 1 Contact: Johan De Praeter Keywords: video compression, quality of service, network policy, adaptive streaming Location: home, Zuiderpoort Problem definition: In a family of five people, one child is browsing YouTube on his tablet, while another one is having a Skype call with a friend. Meanwhile, the third child is streaming a film in full HD on his laptop. In the living room, the parents are watching Netflix on their Ultra HD television. However, each family member experiences an unstable video quality, since all of the aforementioned devices are competing for their share of bandwidth. This same problem occurs whenever several streaming clients share the same bottleneck link. Goals: The goal of this thesis is to investigate different policies for sharing bandwidth between different devices on the same network during video streaming. This policy should take into account the capabilities of the devices and the type of content and should distribute bit rates appropriately to provide end users with an optimal quality of service. More specifically, the student will have to analyze the relation between bit rate, quality, and the type of video content. This thesis is in cooperation with Barco (www.barco.be). Consistent quality video streaming over the network Promotor: Peter Lambert and Rik Van de Walle Supervisors: Johan De Praeter, Thijs Vermeir Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 1 Contact: Johan De Praeter Keywords: video compression, video quality, adaptive streaming Location: home, Zuiderpoort Problem definition: When streaming video over the internet, the bit rate of video stream segments is usually kept constant. However, if the video content switches from a static scene to a fast motion scene, the quality of the latter will suffer since complex motions require more bits to encode than static content. Depending on the content, this sudden drop of quality will be noticeable or distracting to the viewer. To prevent such a scenario, a more intelligent distribution of the bit rate is necessary. Goals: The goal of this thesis is to investigate strategies to provide consistent quality during video streaming. As one such strategy, the student can ‘steal’ bits from low-motion scenes to use in highmotion scenes. The bit allocation algorithms can be implemented by making changes to the reference encoder of the HEVC video compression standard (http://hevc.kw.bbc.co.uk/). This thesis is in cooperation with Barco (www.barco.be). Tracking down the pirates: watermarking as countermeasure against illegal video distribution Promotor: Peter Lambert and Rik Van de Walle Supervisors: Johan De Praeter, Luong Pham Van Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 1 Contact: Johan De Praeter Keywords: HEVC, video watermarking, copyright protection, piracy Location: home, Zuiderpoort Problem definition: Copying an unprotected digital video file is an easy thing to do. Copyrights are violated and piracy runs rampant. In an effort to protect their properties, copyright holders encrypt their video in such a way that only authorized decoders can play their files, limiting the playback options of their honest users. The malicious users on the other hand focus their efforts to decrypt the secured videos and distribute them anyway. A much kinder approach to protect the copyright of video files is to attach a unique watermark to each digitally distributed video file. This watermark is hidden in the bit stream while keeping the file standard complaint. As such, honest users can use their favorite media player to watch the film, while malicious users won’t notice the watermark that will identify them as the uploader if they dare to illegally distribute the file. Goals: In a first phase, the student will identify elements in HEVC bit streams that can be modified to contain watermarks without introducing a visible loss of quality in the video. In a second phase, the robustness of the watermark will be tested when subjected to several attacks such as re-encoding of the video. The watermarking algorithms can be implemented by making changes to the reference encoder and decoder of the HEVC video compression standard (http://hevc.kw.bbc.co.uk/). Ultra-fast HEVC encoding Promotor: Peter Lambert and Rik Van de Walle Supervisors: Johan De Praeter, Glenn Van Wallendael Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 1 Contact: Johan De Praeter Keywords: HEVC, video compression, low complexity encoding, machine learning Location: home, Zuiderpoort Problem definition: HEVC is the successor of the H.264/AVC video compression standard. It doubles the compression ratio of video for the same quality as its predecessor. However, this increased compression efficiency comes with a price: the computational complexity has seen a huge increase as well. Since HEVC will be used to encode ultra HD resolution content which can contain up to 16 times the amount of pixels of current full HD content, it is a necessity to reduce the complexity as much as possible. Fortunately, some decisions in the encoder can be accelerated by using data from previously made decisions. Goals: The goal of this thesis is to reduce the encoding time of HEVC while keeping the compression efficiency as high as possible. In the first phase, the student is expected to make an analysis of techniques for fast HEVC encoding to become familiar with the encoding decisions that are viable options for acceleration. In a second phase, the student can try different strategies such as machine learning to skip these encoding decisions. For the machine learning aspects, the student gets to build on an existing transcoding framework either in Python or C++. Additionally, algorithms can also be implemented by making changes to the reference encoder of the HEVC video compression standard (http://hevc.kw.bbc.co.uk/). Robust low-latency video encoding for distribution over error-prone networks Promotor: Peter Lambert and Rik Van de Walle Supervisors: Johan De Praeter, Thijs Vermeir Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 1 Contact: Johan De Praeter Keywords: HEVC, video compression, low-latency, error-prone network Location: home, Zuiderpoort Problem definition: In current meeting rooms, a wireless connection is used between the presenter’s laptop and the decoder attached to the display. For a high quality of experience it is important that the stream is displayed in the meeting room with low-latency and high quality. The wireless connection is the bottleneck to achieve this requirement. This connection is error-prone (dropped packets), has lots of jitter and has limited bandwidth. Multiple solutions could be proposed to minimize the impact of dropped network packets (e.g. using a reliable network connection, using less bandwidth) but all have their own disadvantages. Goals: The goal of this thesis is to analyze the influence of wireless networks on the HEVC video stream while providing a high quality of experience. First, the student is expected to analyze the impact of the wireless connection on this video stream and study the state-of-the-art for robust video encoding over error-prone networks. Secondly, the student should try different strategies to improve the quality of experience. These strategies could be related to the streaming protocol or the HEVC codec configuration. The student can also investigate how some tools (FMO, ASO, RS) from a previous codec (H.264/AVC) may improve the quality of the video stream. This thesis is in cooperation with Barco (www.barco.be). Tracking of moving objects in the HEVC compressed domain for ultra-high resolution video Promotor: Peter Lambert and Rik Van de Walle Supervisors: Johan De Praeter, Niels Van Kets Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 1 Contact: Johan De Praeter Keywords: HEVC, object detection, video surveillance, video analysis, compressed domain Location: home, Zuiderpoort Problem definition: Video object detection and tracking is a useful technique for video surveillance and other video analysis applications. The most accurate way to perform this analysis is on the raw pixels in the video. However, these techniques may be computationally complex, which is not desirable for realtime analysis of ultra HD videos with resolutions of 3840x2160 and 7680x4320 pixels. Moreover, a lot of videos only exist in compressed form and need to be decoded first. A less computationally complex approach to track objects in these videos is to extract some information from the bit stream and perform object detection on this information. Goals: The goal of this thesis is to detect and track objects by using only a subset of information from an HEVC bit stream. The algorithms should have a low complexity, since they should be able to detect and track many different objects in an ultra-high resolution video. In a first phase, the student should investigate existing techniques for compressed domain object detection in previous video coding standards. In a second phase, these techniques can be adapted and expanded for HEVC by building on an existing information extraction framework. Going Beyond HD: optimized encoding of personalized views from Beyond HD content Promotor: Peter Lambert and Rik Van de Walle Supervisors: Johan De Praeter, Niels Van Kets Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 1 Contact: Johan De Praeter Keywords: HEVC, video compression, Beyond HD, ultra-high resolution Location: home, Zuiderpoort Problem definition: Advances in digital video capturing allow cameras to capture videos with increasingly high resolutions. Using stitching technology, the output of these cameras can be stitched together to create a video with a resolution far beyond HD. However, such an amount of data cannot easily be transported to viewers at home. Additionally, the increased resolution would be wasted on the limited capabilities of their displays. As a solution, the user can select a subset of the beyond HD picture and zoom in on interesting events. Since many users will be interacting with the video at the same time, it’s necessary to speed up the encoding of each personalized stream. To achieve this, the personalized streams can exploit the correlation with the encoding of the full beyond HD picture. Goals: The goal of this thesis is to reduce the encoding complexity of each personalized stream to make the system viable for a large amount of users. The student should investigate the correlation between personalized streams and the full beyond HD picture to skip encoding decisions with e.g. machine learning. For the machine learning aspects, the student gets to build on an existing transcoding framework either in Python or C++. Additionally, algorithms can also be implemented by making changes to the reference encoder of the HEVC video compression standard (http://hevc.kw.bbc.co.uk/). Parallel encoding and decoding of DNA sequences on multicore CPUs and/or GPUs Promotor: Wesley De Neve and Rik Van de Walle Supervisors: Tom Paridaens and Ruben Verhack Study Programme: Master Computer Science Engineering, Master Electrical Engineering, Master Mathematical Informatics Number of students: 1 Number of theses: 1 Contact: Tom Paridaens Keywords: Bio-informatics, compression, DNA, Parallel processing Location: home, Zuiderpoort Problem definition: The human genome consists of more than 3 billion base pairs. Assuming that each base pair is stored in one byte, the human genome has an uncompressed size of 2.8 gigabyte. Add to this that the cost of reading the human genome (i.e. genome sequencing) dropped significantly the past years and it is clear that the amount of sequenced genomes is rising very fast. In fact, it rises faster than the speed at which storage capacity (per $) rises. This limits the development of new DNAoriented applications, such as personal health care and biometric identification. Goals: Within Multimedia Lab, a framework has been built that compresses DNA sequences based on principles used in the domain of video compression. This master’s thesis has the goal to improve the efficiency (processing speed and memory usage) and effectiveness (compression ratio) of the framework. The focus is on parallelizing the framework in order to be able to use multi-core architectures for the compression of DNA sequences and (potentially) using the gained processing time to apply additional compression tools to improve the effectiveness too. Compression and streaming of DNA reads for faster exchange and processing of DNA information Promotor: Wesley De Neve and Rik Van de Walle Supervisors: Tom Paridaens Study Programme: Master Computer Science Engineering, Master Electrical Engineering, Master Mathematical Informatics Number of students: 1 Number of theses: 1 Contact: Tom Paridaens Keywords: Bio-informatics, compression, DNA, distributed processing, streaming Location: home, Zuiderpoort, Applied Maths Problem definition: As the cost of sequencing genomes has dropped significantly the past years, sequencing technology has been applied more and more to generate databases for researchers and public health institutes, e.g. to investigate the evolution and spreading of organisms such as salmonella. Unfortunately, the speed at which base pairs can be read is higher than the speed at which they can be transmitted. Often, researchers use hard drives to transmit the sequencing data. This generates delays for research and the treatment of epidemics. Goals: Within Multimedia Lab, a framework has been built that compresses DNA sequences based on principles used in the domain of video compression. This master’s thesis focuses on adding support for individual reads. These are short DNA sequences, often not longer than some 10s or 100s of base pairs. The second goal is to transmit these individually compressed reads fast and efficiently from the sequencing machine across a network to a central database or end-users, to allow for live processing and streaming of data in a distributed environment. This master’s thesis is in cooperation with Applied Maths (http://www.applied-maths.com). Applied Maths is a Ghent-based company that creates software for management and analysis of biological data. Encryption of DNA sequences with preservation of accessibility for genomic analysis Promotor: Wesley De Neve and Rik Van de Walle Supervisors: Tom Paridaens and Glenn Van Wallendael Study Programme: Master Computer Science Engineering, Master Electrical Engineering, Master Mathematical Informatics Number of students: 1 Number of theses: 1 Contact: Tom Paridaens Keywords: Bio-informatics, compression, DNA, experimental research, encryption, DRM Location: home, Zuiderpoort Problem definition: As the cost of sequencing genomes has dropped significantly the past years, sequencing technology has been applied more and more to generate databases for researchers, e.g. to investigate epidemics or diseases such as cancer. As this is sensitive data, protection of the privacy of patients is very important. Encryption and DRM are key tools to protect this data. Goals: Within Multimedia Lab, a framework has been designed that compresses DNA Sequences, based on principles from the video compression domain. This master’s thesis has the goal to expand the current framework with encryption capabilities. In the first phase, available encryption technologies and techniques will be studied. In the second phase, methods will be designed and implemented in such a way that the data is (partially) encrypted, but can still be used for medical research. Complexity Constrained Scalable Video Coding Promotor: Peter Lambert and Rik Van de Walle Supervisors: Thijs Vermeir and Johan De Praeter Study Programme: Master in Computer Science Engineering, Master Electrical Engineering, Master of Mathematical Informatics Number of students: 1 Number of theses: 1 Contact: Thijs Vermeir Keywords: Scalable Video, SHVC, Complexity Constrained Encoding Location: home, Zuiderpoort or Technicum or Technicum Problem definition: In a multi-site meeting, consumer electronics are used a lot for sharing screen content (presentations, documents...) between different participants. These participants could be connected with a high bandwidth connection (e.g., other meeting room in the same building) or a low bandwidth connection (e.g., home office, customer site). Typically these consumer electronics (laptops, tablets, smartphones etc.) use hardware (HW) accelerated encoding and have (limited) computational capabilities available. Encoding the screen content multiple times for every bandwidth constraint is not always possible as only one HW acceleration block is available. If the HW accelerated encoder is used as the base layer (low bandwidth) the available computational power could be used to create an enhancement layer (high bandwidth). Goals: The goal of this master’s thesis is to investigate strategies for a complexity constrained scalable HEVC encoder. As it is expected that the base layer is already generated with a HW encoder, the student should concentrate only on the extension layer. In the first phase the student should study the complexity distribution for the HEVC reference software and analyze the relation between complexity, rate and distortion. In the next phase the student should explore strategies to reduce the complexity taking rate and distortion into account. This thesis is in cooperation with Barco (www.barco.be). DNA compression using machine learning techniques Promotor: Wesley De Neve, Pieter Buteneers (Reservoir Lab) Supervisors: Ruben Verhack, Tom Paridaens, Lionel Pigou (Reservoir Lab) Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 2 Contact: Ruben Verhack Keywords: DNA, compression, machine learning, neural networks Location: home, Zuiderpoort or Technicum or Technicum Problem definition: The human genome consists of more than 3 billion base pairs. Assuming that each base pair is stored in one byte, the human genome has an uncompressed size of 2.8 gigabyte. Add to this that the cost of reading the human genome (i.e. genome sequencing) dropped significantly the past years and it is clear that the amount of sequenced genomes is rising very fast. In fact, it rises faster than the speed at which storage capacity (per $) rises. This limits the development of new DNAoriented applications, such as personal health care and biometric identification. Goals: Within Multimedia Lab, basic research has been performed to compress DNA sequences based on principles used in the domain of video compression. In these master theses, the goal is to explore new paths for compressing genomic data using machine learning. Machine learning algorithms are known to be well suited for exploiting characteristics of the underlying model, i.e. the grammar of the genomes. The goal of these two theses is to explore the usage of machine learning in DNA compression, with two different approaches: reference based and non-reference based. Reference based compresses genomes relative to a reference genome. Non-reference based methods compress genomic data without external references. Non-transform based content adaptive image coding Promotor: Peter Lambert and Rik Van de Walle Supervisors: Ruben Verhack Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 1 Contact: Ruben Verhack Keywords: Image coding, lossy compression, image processing Location: home, Zuiderpoort or Technicum or Technicum Problem definition: Due to the massive amount of visual materials that are captured and stored nowadays, image compression remains an active field of research. Most standard image coding schemes follow the transform-quantize-entropy coding scheme. JPEG 2000 relies on the wavelet transform, whereas JPEG relies on the DCT transform. This has been the main focus for the last decades, but we claim that spatial (non-transform) techniques are still not fully researched. In adaptive systems the parameters of the encoder are adapted to the changing statistics of the signal. There is a wide range of adaptive systems: they go from updating compression symbol statistics to really understanding the semantics within the image. But they all have the mutual goal of gaining compression efficiency through a better understanding of the content it is compressing. Goals: The goal in this master’s thesis is to develop a completely new strategy for exploiting image content for coding purposes. This includes finding an underlying model of the image as a description and coding it efficiently. This should be done while keeping the characteristics of the human visual system (HVS) in mind. This complete new way of image coding provides many opportunities for different approaches, e.g. adaptive sampling (why would you code pixels that you can predict perfectly?), machine learning approaches (learning correlations), kernel regression (function approximation), or dictionary learning. The implementation can be done in Matlab, Python or C/C++. If necessary, acceleration on the GPU using NVIDIA CUDA is possible. Lossy content aware image coding Promotor: Peter Lambert and Rik Van de Walle Supervisors: Ruben Verhack Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 1 Contact: Ruben Verhack Keywords: Image coding, lossy compression, DPCM, prediction methods, machine learning Location: home, Zuiderpoort or Technicum or Technicum Problem definition: Due to the massive amount of visual materials that are captured and stored nowadays, image compression stays an active field of research. During the last 20 years, powerful non-linear techniques evolved in the machine learning community around the non-linear support vector machine theory for classification and regression of signals. Non-linear regression is intimately related to the problem of non-linear prediction in signal theory, and thus highly relevant for predictive coding in general. Surprisingly, this path has only been recently started gaining attention. Our challenge is the exploration of non-linear statistical dependencies between amplitudes of image samples for reducing redundancy of the source prior to coding while modeling the signal. In simpler terms, using machine learning techniques, it is possible to extract the main structures in an image by learning the model of the image. The more understanding there is about the content of an image, the shorter it can be described, and thus coded. Goals: The goal in this master’s thesis is to develop a strategy for exploiting learned information from the images for coding purposes. This includes finding an underlying model of the image as a description and coding it efficiently. This should be done while keeping the characteristics of the human visual system (HVS) in mind. The implementation can be done in Matlab, Python or C/C++. If necessary, acceleration on the GPU using NVIDIA CUDA is possible. Video processing for interactive visualization of spherical or 360° video Promotor: Peter Lambert and Rik Van de Walle Supervisors: Glenn Van Wallendael, Niels Van Kets Study Programme: Master Computer Science Engineering, Master Electrical Engineering, Master Mathematical Informatics Number of students: 1 Number of theses: 1 Contact: Glenn Van Wallendael Keywords: 360° video, video coding Location: home, Zuiderpoort or Technicum or Technicum Problem definition: In the future, video will not be displayed on a flat surface like a TV. There will be 360° video in which you can look around freely. For this purpose, TVs are getting curved, virtual reality glasses like the Oculus Rift are being developed and videos get projected on spheres around viewers. Whatever the practical solution for display will be, numerous challenges exist about providing this user with the best possible visual experience. In an ideal world where there are no practical limitations, the entire 360° video can be provided to the end user device without any quality sacrifices. In the real world, on the other hand, every resource is limited, but optimizations can be found in the characteristics of this 360° video, the video content and his observer. For example, did you ever look at the eye movement of a person turning his head? The eyes will stay focused on an object and while the head turns different objects along the way will be fixed. Therefore, the importance of predicting the focus points of where the observer is going to watch becomes more important. Additionally, when the observer keeps his body stationary, he is only able to turn his head 180°. With a movie that triggers his will to look around, the observer could start looking around further and be more active. But more actively looking around blurs the observation capacity, so a trade-off can be made. Finally, there is the additional advantage that the left side of the movie is connected to the right side because it represents a sphere. This property is not being used in today regular video coders. Goals: In this work, the student tries to progress the scientific field of 360° video coding, transmission and interactive visualization. In video coding perspective, the regular video gets extended with 360° properties. To improve this 360° video experience, the restrictions of eye and head movement must be taken into account. Real time Ultra HD video compression using hardware/software codesign Promotor: Peter Lambert and Rik Van de Walle Supervisors: Glenn Van Wallendael Study Programme: Master Computer Science Engineering, Master Electrical Engineering, Master Mathematical Informatics Number of students: 1 Number of theses: 1 Contact: Glenn Van Wallendael Keywords: SoC development, Ultra HD, video compression Location: home, Zuiderpoort or Technicum or Technicum Problem definition: Because of the ever increasing resolution of video, software based solutions are not capable of encoding video in real time anymore. One solution would be to make the compression algorithm parallel and execute everything on GPUs or hardware. The problem with video compression and compression in general is that a compression algorithm only works optimally in a serial way. Serial processing enables the compressor to learn from previous decisions with a smaller size as a result. Still, some parts of the video encoder can be handled in parallel like the motion estimation and the intra prediction. Therefore, the ideal solution would be to have a software/hardware co-design in which the general structure is kept serial with a lot of parallel hardware assistance. For this purpose, specialized chips are developed which include an ARM processor combined with hardware programmable parts (FPGA). With both parts available on a single chip, the hardware and software processing can be combined efficiently. Goals: During this thesis, the state-of-the-art High Efficiency Video Coding (HEVC) standard will be implemented in a hardware software combined design. For this purpose, the student can start from the open-source reference software for HEVC video compression (https://hevc.hhi.fraunhofer.de/trac/hevc/browser/trunk ). This software can then be run on the internal processor of the specialized Xilinx Zynq-7000 System on a Chip. This SoC then gives the possibility to execute certain parts in hardware and doing this efficiently will be the main challenge in this thesis. Guaranteeing video quality Promotor: Peter Lambert and Rik Van de Walle Supervisors: Glenn Van Wallendael, Niels Van Kets Study Programme: Master Computer Science Engineering, Master Electrical Engineering, Master Mathematical Informatics Number of students: 1 Number of theses: 1 Contact: Glenn Van Wallendael Keywords: video quality metrics, large data analysis Location: home, Zuiderpoort or Technicum or Technicum Problem definition: Video distribution systems, like the ones from Telenet, Stevie or Netflix, have no idea about the delivered Quality of Experience to the end user. They know what they serve the network, but have no control on what comes out at the end-user. For this purpose, lightweight algorithms exist that analyze the received video and form an opinion about the quality of that video (good, average, or bad). In the past, these algorithms got developed using a limited set of settings in order to keep the ground truth generation process manageable. This ground truth would need to be generated through groups of people scoring all the distorted videos. In this master thesis, such a lightweight video quality algorithm will be developed using an entirely new approach. A large database of videos will be provided such that finally a general applicable video quality metric can be designed. Goals: In this thesis, the goal is to make a lightweight quality metric developed using a large data analysis approach. For this purpose, we have already started the large database generation process which can be extended during the thesis. Additionally, tools will be provided to extract a large part of the incoming video information and convert it into a machine readable format. Apart from extending this work, the main challenge is the analysis of this large dataset which will result in a good quality metric. Such a large dataset analysis is a perfect asset when looking to the job market, because such analysis skills can be applied on a lot of problems that occur in the field. Rate-distortion-complexity modeling for the HEVC decoder under energy constraints of receivers Promotor: Peter Lambert and Rik Van de Walle Supervisors: Luong Pham Van and Glenn Van Wallendael Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 1 Contact: Glenn Van Wallendael Keywords: HEVC, power-aware video coding, decoding complexity Location: home, Zuiderpoort or Technicum or Technicum Problem definition: In real-life applications, video streams might be decoded by portable devices such as mobile phones or tablets, where the devices have time-varying battery life or different processing capacities. Under such a scenario, the reduction in decoding complexity to adapt to the capacity of receivers becomes a critical issue. One possible solution is to allow the encoder to generate a decoderfriendly bit stream. The new High Efficiency Video Coding (HEVC) standard is expected to be widely used in many multimedia applications thanks to its high compression efficiency compared with H.264/AVC. To achieve this high coding performance, HEVC introduces several new coding techniques resulting in extremely high complexity of both the encoder and decoder. To incorporate this standard in real-life applications, power-aware hardware modules should be investigated and included. Goals: The purpose of this work is to study and propose a rate-distortion-complexity model using statistical and machine learning techniques for HEVC. The proposed model should be evaluated for its impact on the compression ratio of the bit stream. Ideally, this model is included in the encoder to select the optimal coding mode, such that a trade-off between rate-distortion and decoding complexity is achieved with a minimal impact on the compression ratio. Robust detection and reconstruction of damaged digital video files Promotors: Peter Lambert and Rik Van de Walle Supervisors: Patrick De Smet (NICC) and Glenn Van Wallendael Study Programme: Master in Computer Science Engineering, Master of Mathematical Informatics Number of students: 1 Number of theses: 1 Contact: Patrick De Smet (NICC) Keywords: video analysis and processing, video reconstruction Location: home, Zuiderpoort Problem definition: In professional and home environments, recording and storing of audiovisual data is nowadays almost exclusively performed using digital techniques. As a result of this evolution, there is an increasing demand towards new possibilities for analyzing digital information in the context of forensic investigations. A particular and common problem in these investigations is the analysis of damaged digital files due to the complete or partial erasure of files, or due to malfunctioning or damaged data carriers and/or hardware. As a result, police and forensic experts are facing the challenging task of tracking and partially recovering the remaining damaged information, in the context of investigations of retrieved or confiscated multimedia players and carriers, computer systems etc. Goals: The goal of this master’s thesis is to investigate and implement novel techniques in software that allow to recover damaged digital video files from certain ‘data dumps’ in a reliable way, and where possible, to reconstruct and decode the resulting files. In this thesis, you will collaborate with the National Institute of Criminalistics and Criminology (NICC, http://nicc.fgov.be/). The NICC is a scientific institute within the federal department of justice which performs forensic research in collaboration with governments and police departments. User interaction with personalized views extracted from Beyond HD content Promotor: Peter Lambert and Rik Van de Walle Supervisors: Niels Van Kets, Johan De Praeter Study Programme: Master Computer Science Engineering, Master Electrical Engineering Number of students: 1 Number of theses: 1 Contact: Niels Van Kets Keywords: Beyond HD, ultra-high resolution, user interaction Location: home, Zuiderpoort Problem definition: Advances in digital video capturing allow cameras to capture videos with increasingly high resolutions. Using stitching technology, the output of these cameras can be stitched together to create a video with a resolution far beyond HD. However, such an amount of data cannot be easily transported to viewers at home. Additionally, the increased resolution would be wasted on the limited capabilities of their displays. As a solution, the user can select a subset of the beyond HD picture and zoom in on interesting events. But how will the user interact with this beyond HD picture? Does a user prefer a certain type of input such as a tablet, jawbone or oculus rift? Or does a user prefer a more passive approach and simply select the region or events (such as tracking the ball during a game of football) which he/she would like to follow? Goals: Within Multimedia Lab, a framework has been built to extract personalized views from a very high resolution video (8600x2160 pixels). This framework allows to pan, tilt and zoom within the video. The goal of this thesis is to perform research towards existing or new ways of interacting with this framework. This could be active (swiping on tablets, moving around with an oculus rift, making gestures with a jawbone etc.) or passive (automatic feature tracking). The work of the thesis student will be focused on two main areas. First of all, a general and reusable interface or API should be created that allows input originating from different interaction devices. Second, different interaction devices and automatic tracking algorithms must be compared both in terms of complexity and usability. In a final stage, the student will subjectively compare the different interaction devices and automatic tracking systems such that a conclusion can be drawn concerning the usability of each solution. Optimized transmission of compressed genomic data sequences for visualization in Genomeview Promotor: Wesley De Neve, Yao-Cheng Lin (PSB-UGent), and Rik Van de Walle Supervisors: Tom Paridaens Study Programme: Master Computer Science Engineering, Master Electrical Engineering, Master Mathematical Informatics Number of students: 1 Number of theses: 1 Contact: Tom Paridaens Keywords: Bio-Informatics, Compression, DNA, Experimental Research, Visualization, Transmission Location: home, Zuiderpoort, Technologiepark Zwijnaarde Problem definition: As the cost of sequencing genomes has dropped significantly the past years, sequencing technology has been applied more and more to generate databases for researchers, e.g. to investigate the evolution and spreading of organisms such as salmonella. Unfortunately, the speed at which base pairs can be read is higher than the speed at which the sequences can be transmitted. Often, researchers use hard drives to transmit the sequencing data. This generates delays for research and the treatment of epidemics. Goals: Within Multimedia Lab, a framework has been designed that compresses DNA Sequences, based on principles from the video compression domain. Within the PSB, a genome browser has been developed (http://genomeview.org) that allows to study genomes by navigating through genomes, without transmitting them completely. Currently, transmissions from the server to the viewer do not use compression. In this master’s thesis, the goal is to integrate both frameworks to an efficient server-client system and to define different usage scenarios and to optimize the system in terms of compression, storage cost, server load, network load and Quality-of-Experience (QoE). Design and implement an automated platform to benchmark zapping times Promotor: Sebastiaan Van Leuven ([email protected]) and Peter Lambert Supervisors: Glenn Van Wallendael, Wim Baekelandt ([email protected]) Study Programme: Master Computer Science Engineering, Master Mathematical Informatics, Master Industrial Engineering: Elektronica - ICT Number of students: 1 Number of theses: 1 Contact: Sebastiaan Van Leuven ([email protected]) Keywords: Benchmarking, zapping Location: home, Zuiderpoort, TPVision Zwijnaarde Problem definition: Each year, TV sets undergo an extensive number of tests to evaluate the performance (benchmarks). One of those benchmarks is the time required for zapping between television channels within a Digital Video Broadcast (DVB) environment. Since zapping times are highly depending on a number of external factors such as the location of the TV channel within the DVB transport stream, the type of video codec used, and the temporal position within the new video stream it is hard to model and evaluate the theoretical zapping time for televisions. Currently, there is no test environment available to evaluate the zapping time with a statistical relevant procedure. An automated tool allows for an objective benchmark to assist in the research and development of improvement broadcast tuners and decoders. Goals: The purpose of this master thesis is to design, implement and evaluate an automated platform to benchmark zapping times in a Digital Video Broadcast environment (DVB C/T/S). The students should evaluate existing components on the applicability of their use in an automated platform. Based on this study, the student combines all the selected components in a working platform. In a second phase, the student implements a platform in which the previously selected components are combined. This platform should allow to benchmark commercially available television sets by applying a statistically relevant number of tests. The platform should be capable of repeating the same test with the same content such that different panels can be evaluated. Furthermore, to allow for new test cases, the platform should be able to dynamically interpret different test cases. The platform automatically evaluates the performance and visually report the results. Lastly, the student investigates different DVB broadcast use cases, and identifies and designs potentially synthetic corner cases in order to find the largest possible zapping time for a particular TV set. TP Vision is a dedicated TV player in the world of visual digital entertainment. TP Vision is part of TPV Technology, the #1 monitor manufacturer in the world. At its Innovation Site Europe, Ghent we design televisions of the future, for Philips and other brands. NetTV, Ambilight, Android TV and Cinema 21:9 have all been developed in this innovative and driven environment. Recognition of our activities is visible in numerous awards such as prestigious EISA-awards. This master’s thesis is within the Advanced Software Development department which drives the innovation chain from conceptualization to feasibility and prototyping, as well as leads technology and standardization roadmaps. Improving Chrome Cast by using Screen Content Coding tools for VP9 Promotor: Sebastiaan Van Leuven ([email protected]) and Peter Lambert Supervisors: Glenn Van Wallendael, Tabish Siddique ([email protected]) Study Programme: Master Computer Science Engineering, Master Mathematical Informatics, Master Industrial Engineering: Elektronica – ICT, Master Industrial Engineering: Informatica Number of students: 1 Number of theses: 1 Contact: Sebastiaan Van Leuven ([email protected]) Keywords: VP 9, Chrome Cast, Miracast, Screen Content Coding Location: home, Zuiderpoort, TPVision Zwijnaarde Problem definition: Graphical content on computer displays (i.e., screen content) is often transmitted towards televisions using Miracast, Chrome Cast or similar techniques. Nevertheless, screen content coding often yields visible and annoying artifacts around edges of textures and text within an image. To reduce these artifacts, screen content coding has been proposed as an extension to High Efficiency Video Coding. However, screen casting scenarios where mobile devices transmit the display towards an Android TV make use of the open source VP 9 video codec. Therefore, no screen content coding mechanisms can be exploited and the classical coding artifacts remain visible, reducing the quality of experience. Goals: This master thesis aims at investigating screen content coding mechanisms to improve the capabilities of the VP9 encoder for screen content. In a first phase an analysis is performed to identify the most effective screen content coding tools within HEVC. From this analysis, the most promising techniques are identified, taking into account the target use case of a mobile phone to television casting scenario. In a second phase, the identified techniques are adapted, taking into account specific VP9 encoding tools. Alternatively, the student can identify new coding algorithms and use these to show potential improvements. Finally, the student shows the improvement of a VP9 encoder using screen content coding tools for relevant content. Ultimately, the evaluated techniques can be committed towards the VP9 open source project. During the course of this master thesis, the student is also encouraged to identify conceptually ground breaking techniques which might not be implementable. TP Vision is a dedicated TV player in the world of visual digital entertainment. TP Vision is part of TPV Technology, the #1 monitor manufacturer in the world. At its Innovation Site Europe, Ghent we design televisions of the future, for Philips and other brands. NetTV, Ambilight, Android TV and Cinema 21:9 have all been developed in this innovative and driven environment. Recognition of our activities is visible in numerous awards such as prestigious EISA-awards. This master’s thesis is within the Advanced Software Development department which drives the innovation chain from conceptualization to feasibility and prototyping, as well as leads technology and standardization roadmaps. Low-complexity temporal transcoding for high frame rate HEVC Promotor: Sebastiaan Van Leuven ([email protected]) and Peter Lambert Supervisors: Glenn Van Wallendael, Bert Van Dam ([email protected]) Study Programme: Master Computer Science Engineering, Master Mathematical Informatics Number of students: 1 Number of theses: 1 Contact: Sebastiaan Van Leuven ([email protected]) Keywords: HEVC, temporal transcoding, next generation action cameras Location: home, Zuiderpoort, TPVision Zwijnaarde Problem definition: Capabilities of television sets have always been limited by the underlying hardware. As the current evolutions in media experience, such as 4K, high frame rate, and high dynamic range are rapidly evolving, existing hardware is not capable of providing playback for such new features. Therefore, older television sets are not able to play a large amount of newly generated (user) content. Currently, Go Pro action cameras are capable of capturing Full HD content at 120fps, while mainstream hardware decoders are only capable of decoding 60fps. Therefore, to allow a stable and future proof product, new techniques should be designed to efficiently adapt the input bit streams and allow for playback capabilities with a reduced experience. Goals: This master’s thesis aims at investigating novel techniques to increase the amount of media content a television set can play. This thesis will primarily focus on temporal transcoding for HEVC. To allow deployment of temporal transcoding techniques on existing television sets, the newly developed algorithms should have a low-complexity. In a first phase, the research can focus on evaluating algorithms with similar behavior as H.264/AVC. This can be used as a starting point for scientific evaluation of future algorithms. The student can define the second phase based on the outcome of the previous results. Either the algorithms of the first phase are optimized, or new algorithms are investigated which explicitly exploit HEVC specific features such as temporal stability of CU and PU structures. TP Vision is a dedicated TV player in the world of visual digital entertainment. TP Vision is part of TPV Technology, the #1 monitor manufacturer in the world. At its Innovation Site Europe, Ghent we design televisions of the future, for Philips and other brands. NetTV, Ambilight, Android TV and Cinema 21:9 have all been developed in this innovative and driven environment. Recognition of our activities is visible in numerous awards such as prestigious EISA-awards. This master’s thesis is within the Advanced Software Development department which drives the innovation chain from conceptualization to feasibility and prototyping, as well as leads technology and standardization roadmaps. Optimizing a simulcast encoder for Standard and High Dynamic Range content Promotors: Sebastiaan Van Leuven ([email protected]) and Peter Lambert Supervisors: Glenn Van Wallendael Study Programme: Master Computer Science Engineering, Master Mathematical Informatics Number of students: 1 Number of theses: 1 Contact: Sebastiaan Van Leuven ([email protected]) Keywords: HEVC, HDR, EOTF, Netflix Location: home, Zuiderpoort, TPVision Zwijnaarde Problem definition: Over-the-top (OTT) content providers such as Netflix create a plethora of versions of the same content. Currently, a different version of the content is transmitted to devices based on the capabilities in terms of resolution, frame rate, memory size, audio support, etc. As the current hype of 4K is fading out, a new technology has arrived. High Dynamic Range (HDR) allows to improve the perceptual quality and immersivity by introducing more luminosity in the picture. This requires televisions which can be 40 times as bright as current 4K televisions. However, encoding such content requires yet again processing power for another encoder. As a large amount of the visuals data is similar between Standard Dynamic Range (SDR) and HDR, optimizations between both encoding loops can reduce the complexity and power consumption significantly. Goals: In a first phase, the student investigates the benefits and properties of HDR content. The research focuses on HDR content captured and mastered using a new Electro Optical Transfer Function (EOTF) as defined by SMPTE ST 2084, which is basically applying a different Gamma correction to the decoded signal. This new EOTF changes the properties of the pixel values, such that different encoding decision are made. In a second phase an investigation of the different encoding decision between SDR and HDR content is evaluated and mapped, which can be done using machine learning tools. From this analysis a fast mode decision model is created and evaluated in an encoder. To demonstrate the scalability of the proposed model, a single loop encoder can be designed where the SDR and HDR content are encoded simultaneously using the proposed model. This can be implemented in a real-life HEVC encoder such as x265. TP Vision is a dedicated TV player in the world of visual digital entertainment. TP Vision is part of TPV Technology, the #1 monitor manufacturer in the world. At its Innovation Site Europe, Ghent we design televisions of the future, for Philips and other brands. NetTV, Ambilight, Android TV and Cinema 21:9 have all been developed in this innovative and driven environment. Recognition of our activities is visible in numerous awards such as prestigious EISA-awards. This master thesis is within the Advanced Software Development department which drives the innovation chain from conceptualization to feasibility to prototyping, as well as leads technology and standardization roadmaps.
© Copyright 2024