Abstract : In this minisymposium, we will discuss the current developments of implicit and explicit deep learning models for inverse problems. Explicit deep learning models are based on stacking several discrete layers to solve a given downstream tasks. Another interesting perspective is implicit models, where one can specify the conditions to satisfy. Within this context our session will cover these two paradigms through new developments in Hybrid Models, Neural ODES – PDEs and Beyond. Moreover, discussing interesting real-world applications for a wide range of inverse problems.
Organizer(s) : Angelica Aviles-Rivero, Raymond H. Chan
02349 (1/2) : 4C @E817 A715 [Chair: Raymond H Chan & Angelica Aviles-Rivero]
[03463] Learning pair-wise homeomorphic image registration in a conformal-invariant hyperelastic setting
Format : Talk at Waseda University
Author(s) :
Noémie DEBROUX (Université Clermont Auvergne)
Jing Zou (The Hong Kong Polytechnic University)
Lihao Liu (University of Cambridge)
Angelica Aviles-Rivero (University of Cambridge)
Jing Qin (The Hong Kong Polytechnic University)
Carola-Bibiane Schönlieb (University of Cambridge)
Abstract : Deformable image registration is a fundamental task in medical image analysis and plays a crucial role in a wide range of clinical applications. Recently, deep learning-based approaches have been widely studied for deformable medical image registration and achieved promising results. However, existing deep learning image registration techniques
do not theoretically guarantee physically-meaningful transformations and usually require a lot of training data. In order to overcome these drawbacks, we propose a novel framework for pair-wise deformable image registration in a deep-learning framework. Firstly, we introduce a novel regulariser in the loss function based on conformal-invariant properties in a nonlinear elasticity setting. It theoretically guarantees that the obtained deformations are homeomorphisms and therefore preserve topology. Secondly, we boost the performance of our regulariser through coordinate MLPs, where one can view the to-be-registered images as continuously differentiable entities. We evaluate our model through extensive numerical experiments.
[03252] Spherical Image Inpainting with Frame Transformation and Data-driven Prior Deep Networks
Format : Talk at Waseda University
Author(s) :
Tieyong Zeng (The Chinese University of Hong Kong)
Abstract : Spherical image processing has been widely applied in many important fields. In this talk, we focus on the challenging task of spherical image inpainting with deep learning-based regularizer. We employ a fast directional spherical Haar framelet transform and develop a novel optimization framework based on a sparsity assumption. Furthermore, by employing progressive encoder-decoder architecture, a new and better-performed deep CNN denoiser is carefully designed and works as an implicit regularizer. Finally, we use a plug-and-play method to handle the proposed optimization model, which can be implemented efficiently by training the CNN denoiser prior. Numerical experiments are conducted and show that the proposed algorithms can greatly recover damaged spherical images and achieve the best performance over purely using deep learning denoiser and plug-and-play model. This is a joint work with Jianfei Li, Chaoyan Huang, Raymond Chan, Han Feng, and Michael Ng.
[05441] Learning to solve inverse problems with unsupervised nonlinear models
Format : Talk at Waseda University
Author(s) :
Rihuan Ke (University of Bristol)
Carola-Bibiane Schönlieb (University of Cambridge)
Abstract : Deep learning methods have recently demonstrated remarkable achievements in solving inverse problems. At the core of these methods lies the learning tasks of finding effective inverse problem solvers from a parameterised operator space, which is typically high dimensional. In the context of supervised learning, these learning tasks can be effectively tackled with sufficient supervised data, consisting of paired measurements and ground truth solutions. However, when the ground truth solutions are unknown, these learning tasks can be as challenging as solving the inverse problems themselves. In this talk, we present a hybrid method that addresses the learning tasks in an unsupervised learning setting for denoising and inverse problems more generally, where access to high-quality supervised data is restrictive or unavailable. We highlight a class of nonlinear operators that can be learned from noisy data and offer close approximations to the optimal solutions. Based on these nonlinear operators, we introduce a learning algorithm for solving inverse problems with limited knowledge of the underlying ground truth solutions and noise distributions.
[04181] A learning framework for mapping problems via Quasiconformal geometry
Author(s) :
Ronald Lok Ming LUI (The Chinese University of Hong Kong)
Qiguang Chen (The Chinese University of Hong Kong)
Abstract : Many imaging problems can be formulated as a mapping problem. A general mapping problem aims to obtain an optimal mapping that minimizes an energy functional subject to the given constraints. Existing methods to solve the mapping problems are often inefficient and can sometimes get trapped in local minima. An extra challenge arises when the optimal mapping is required to be diffeomorphic. In this talk, we address the problem by proposing a deep-learning based framework based on the Quasiconformal (QC) Teichmüller theories. The main strategy is to learn the Beltrami coefficient (BC) that represents a mapping as the latent feature vector in the deep neural network. The BC measures the geometric distortions under the mapping. As such, the proposed network based on QC theories is explainable. Another crucial advantage of the proposed framework is that once the network is successfully trained, the optimized mapping corresponding to each input data information can be obtained in real time. In this talk, we will illustrate our framework by applying it to solve the diffeomorphic image registration problem. The developed network, called the quasiconformal registration network (QCRegNet), outperforms other state-of-the-art image registration models. This work is supported by HKRGC GRF (Project IDs: 14305919, 14306721,14307622).
02349 (2/2) : 4D @E817 A715 [Chair: Angelica Aviles Rivero & Raymond H Chan]
[05328] Physics Informed Graph Transformer for PDEs
Format : Online Talk on Zoom
Author(s) :
Andrey Bryutkin (University of Cambridge)
Angelica Aviles-Rivero (University of Cambridge)
Jiahao Huang (Imperial College London)
Abstract : In recent years, robust PDE solvers have become increasingly important, necessitating more input variety. The physics-informed graph transformer (PhysGTN) uses graphs to solve underlying problems described on an irregular grid and combines multiple parameter inputs of the PDE. It applies a transformer network to learn specific resemblances of data and additional inputs, which the PDE provides. The architecture is designed to be discretization invariant and flexible enough to handle irregular meshes. The PhysGTN offers several advantages over traditional numerical methods, including increased computational efficiency, reduced time needed for obtaining solutions, and increased robustness to additional noise. This can lead to various challenges and applications for the setup.
[03627] Continuous U-Net: Faster, Greater and Noiseless
Format : Talk at Waseda University
Author(s) :
Chun-Wun Cheng (City University of Hong Kong)
Christina Runkel (University of Cambridge)
Lihao Liu (University of Cambridge)
Raymond Honfu Chan (City University of Hong Kong)
Carola-Bibiane Schönlieb (University of Cambridge)
Angelica Aviles-Rivero (University of Cambridge)
Abstract : Image segmentation is a fundamental task in image analysis and clinical practice. The current state-of-the-art techniques are based on U-shape type encoder-decoder networks with skip connections called U-Net. Despite the powerful performance reported by existing U-Net type networks, they suffer from several major limitations. These issues include the hard coding of the receptive field size, compromising the performance and computational cost, as well as the fact that they do not account for inherent noise in the data. They have problems associated with discrete layers, and do not offer any theoretical underpinning. In this work we introduce continuous U-Net, a novel family of networks for image segmentation. Firstly, continuous U-Net is a continuous deep neural network that introduces new dynamic blocks modelled by second order ordinary differential equations. Secondly, we provide theoretical guarantees for our network demonstrating faster convergence, higher robustness and less sensitivity to noise. Thirdly, we derive qualitative measures to tailor-made segmentation tasks. We demonstrate, through extensive numerical and visual results, that our model outperforms existing U-Net blocks for several medical image segmentation benchmarking datasets.
[05633] Why Deep Surgical Models Fail?: Revisiting Surgical Action Triplet Recognition through the Lens of Robustness
Format : Talk at Waseda University
Author(s) :
Yanqi Cheng (University of Cambridge )
Abstract : Surgical action triplet recognition is of high relevance as it provides the surgeon with context-aware support and safety. The go-to strategy develops new network mechanisms. However, the performance of state-of-the-art techniques is substantially lower than other surgical tasks. Why is this happening? This is the question that we address in this work. We present the first study to understand the failure of existing deep learning models through the lens of robustness and explainability.
[05634] On Implicit Neural Representation
Format : Online Talk on Zoom
Author(s) :
Zhenda Shen (City University of Hong Kong)
Abstract : In this talk, we explore a novel approach to implicit neural representations, introducing a novel function that capitalizes on the advantages of Strong Spatial and Frequency attributes. Unlike conventional methods, our proposed technique exhibits remarkable performance improvements across a diverse range of downstream tasks. Through rigorous experimentation and validation, we demonstrate the superior capabilities of our new function in critical applications such as denoising, CT reconstruction, and 3D reconstruction.
Implicit neural representations have garnered significant interest in recent years due to their ability to model complex and high-dimensional data without requiring explicit parameterization. Our novel function leverages both spatial and frequency domains, enhancing its ability to capture intricate patterns and relationships within the data.
During the presentation, we delve into the technical details of our approach, providing insights into how we use spatial and frequency information effectively. We highlight the advantages of our technique over existing methods, emphasizing the superior performance and efficiency it offers.
Through extensive experimental evaluations, we showcase how our new function excels in denoising tasks, enabling high-fidelity reconstructions even in noisy environments. Additionally, we demonstrate its exceptional capabilities in CT reconstruction, where our approach delivers accurate and robust results with reduced artifacts. Furthermore, in the realm of 3D reconstruction, our method outperforms existing techniques, offering more precise and detailed representations.