FreeNeRF: Frequency-regularized NeRF

How does FreeNeRF work?

*click to expand

1. High-frequency signals cause catastrophic overfitting in few-shot neural rendering.

Neural rendering methods, such as NeRF, can learn 3D scene representations from a set of 2D images without explicit 3D geometry. Instead, the 3D geometry is implicitly learned by optimizing appearance in its 2D projected views. However, when given only very few input views, NeRF can easily overfit to these Given only very few input views, NeRF is prone to overfitting to these 2D images with small loss while not explaining the 3D geometry in a multi-view consistent way.

This issue of overfitting in few-shot neural rendering is further exacerbated by the presence of high-frequency signals in the input positional encoding. A previous study shows that higher-frequency mappings enable faster convergence for high-frequency components. However, the over-fast convergence to high-frequency components will lead to catastrophic overfitting in few-shot neural rendering.

To test this, we conducted an experiment in which we trained NeRF models with masked positional encodings by setting the high-frequency bits to zero:

pos_enc[int(L * x%): ] = 0,

where L is the length of the positional encoding and x is the visible ratio.

The following videos show the negative impact of high-frequency signals on NeRF's performance in few-shot neural rendering, resulting in severe overfitting. While using only low-frequency inputs allows NeRF to learn 3D scene representations, the resulting models may still exhibit oversmoothness. These results highlight the importance of addressing the overfitting issue from the frequency domain in order to improve the accuracy of 3D scene representations and mitigate the issue of oversmoothness.

High-frequency inputs cause the catastrophic failure in few-shot neural rendering.

2. Frequency regularization enjoys the benefits of both high-frequency and low-frequency signals.

We propose Frequency Regularization. Given a positional encoding, we use a linearly increased frequency mask to regularize the visible frequency spectrum based on training time steps, as described in Equations 4 and 5 of the paper.

The following figure shows how frequency mask changes over the training step. We use 50%-schedule as an example, i.e., all inputs become visible at the midpoint of training. By gradually increasing the visibility of the high-frequency signals, Frequency Regularization helps to reduce the risk of overfitting that causes catastrophic failure at the beginning and avoids underfitting that causes over-smoothness at the end.

Frequency mask changes over the training step.

The following videos show two examples. NeRF models first learn the smooth and coarse 3D scene representations with only low-frequency signals. As the training step increases, more high-frequency signals become visible, and the model learns more accurate 3D scene representations with both high-frequency and low-frequency signals.

------------------>>------------------>>------------------ Training steps ------------------>>------------------>>------------------

3. Occlusion regularization addresses the near-camera floaters.

Despite the use of Frequency Regularization, some characteristic artifacts may still appear in certain novel views due to the limited number of training views and the inherent ill-posedness of the problem. These artifacts often manifest as "walls" or "floaters" that are close to the camera and can significantly degrade the quality of the 3D scene representations.

To address this issue, we propose a new method called Occlusion Regularization, which penalizes the dense fields near the camera as described in Equation 6 of the paper. By reducing the influence of these dense fields, Occlusion Regularization helps to improve the accuracy and realism of the 3D scene representations, as shown in the visual comparisons below between models without (left) and with (right) occlusion regularization.

FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization
CVPR 2023

Paper

Code

Abstract

TL;DR:

Example novel view synthesis results

More Results

How does FreeNeRF work?

1. High-frequency signals cause catastrophic overfitting in few-shot neural rendering.

2. Frequency regularization enjoys the benefits of both high-frequency and low-frequency signals.

3. Occlusion regularization addresses the near-camera floaters.

Citation

Acknowledgement

FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization CVPR 2023

Paper

Code

Abstract

TL;DR:

Example novel view synthesis results

More Results

How does FreeNeRF work?

1. High-frequency signals cause catastrophic overfitting in few-shot neural rendering.

2. Frequency regularization enjoys the benefits of both high-frequency and low-frequency signals.

3. Occlusion regularization addresses the near-camera floaters.

Citation

Acknowledgement

FreeNeRF: Improving Few-shot Neural Rendering with Free Frequency Regularization
CVPR 2023