TIME LENS: EVENT-BASED VIDEO FRAME INTERPOLATION

HUAWEI TECHNOLOGIES, UNIVERSITY OF ZURICH, ETH ZURICH


Stepan Tulyakov, Daniel Gehrig, Stamatios Georgoulis, Julius Erbach, Mathias Gehrig, Yuanyou Li, Davide Scaramuzza

ABSTRACT

State-of-the-art frame interpolation methods generate intermediate frames by inferring object motions in the image from consecutive key-frames. In the absence of additional information, first-order approximations, i.e. optical flow, must be used, but this choice restricts the types of motions that can be modeled, leading to errors in highly dynamic scenarios. Event cameras are novel sensors that address this limitation by providing auxiliary visual information in the blind-time between frames. They asynchronously measure per-pixel brightness changes and do this with high temporal resolution and low latency. Event-based frame interpolation methods typically adopt a synthesis-based approach, where predicted frame residuals are directly applied to the key-frames. However, while these approaches can capture non-linear motions they suffer from ghosting and perform poorly in low-texture regions with few events. Thus, synthesis-based and flow-based approaches are complementary. In this work, we introduce Time Lens, a novel method that leverages the advantages of both. We extensively evaluate our method on three synthetic and two real benchmarks where we show an up to 5.21 dB improvement in terms of PSNR over state-of-the-art frame-based and event-based methods. Finally, we release a new large-scale dataset in highly dynamic scenarios, aimed at pushing the limits of existing methods.

 

INVENTORS COMMUNITY

Our fast-growing network of 12,000+ researchers and developers is revealing the invisible with Prophesee’s neuromorphic vision technologies. From giving sight back to the blind to touching cells or tracking space debris, they have shown incredible imagination and innovation by leveraging Event-Based Metavision.

Don’t miss a bit,

follow us to be the first to know

✉️ Join Our Newsletter