Does it support sound generation?

No, NUWA-Infinity focuses exclusively on visual synthesis (images and video).

NUWA-Infinity

NUWA-Infinity | Find AI List

Overview

NUWA-Infinity is a state-of-the-art generative model developed by Microsoft Research Asia, designed for the synthesis of high-quality images and videos from text, image, or video inputs. Unlike standard generative models that are limited by fixed resolutions, NUWA-Infinity employs an 'Autoregressive-over-Autoregressive' (AR-over-AR) architecture. This technical framework allows the model to generate visual content with essentially infinite resolution by modeling local and global context simultaneously. As of 2026, it remains a cornerstone in the evolution of visual AI, positioning itself as a superior alternative for tasks requiring extreme spatial extensions, such as outpainting and long-form video prediction. The architecture leverages a Vector Quantized Variational Autoencoder (VQ-VAE) to compress visual data into discrete tokens, which are then processed by a multi-modal transformer. Its primary market position is centered on high-fidelity creative automation and professional visual effects, providing a foundation for next-generation cinematic tools. While primarily a research-driven project, its open-source components and academic releases have heavily influenced commercial video generation platforms, setting the benchmark for temporal consistency and spatial resolution in synthetic media.

Common tasks

Infinite image outpainting Text-to-video synthesis Image-to-video animation Video prediction and extension High-resolution image synthesis

FAQ

View all

How does NUWA-Infinity achieve infinite resolution?

It uses an Autoregressive-over-Autoregressive mechanism that generates images patch-by-patch while maintaining global structural tokens.

Is NUWA-Infinity free to use?

As a research project, the code and model weights are generally released for free for non-commercial/academic use.

What hardware is required to run it?

A high-end NVIDIA GPU (A100 or H100 preferred) is recommended for video generation and high-resolution synthesis.

Can I use it for commercial projects?

You must check the specific licensing in the Microsoft Research repository; it is often restricted to research purposes unless otherwise stated.

FAQ+

How does NUWA-Infinity achieve infinite resolution?

It uses an Autoregressive-over-Autoregressive mechanism that generates images patch-by-patch while maintaining global structural tokens.

Is NUWA-Infinity free to use?

As a research project, the code and model weights are generally released for free for non-commercial/academic use.

What hardware is required to run it?

A high-end NVIDIA GPU (A100 or H100 preferred) is recommended for video generation and high-resolution synthesis.

Can I use it for commercial projects?

You must check the specific licensing in the Microsoft Research repository; it is often restricted to research purposes unless otherwise stated.

View all

Compare with top alternatives

Full compare

Tool	Pricing	Rating	Visits
NUWA-InfinityCurrent	Free	-	-
CapCut	Freemium	★ 0.0	-
Cava (Artflow.ai)	Freemium	★ 0.0	-
DeepBrain AI	Paid	★ 0.0	-

NUWA-Infinity

Current

Pricing: Free
Rating: -
Visits: -

CapCut

Pricing: Freemium
Rating: ★ 0.0
Visits: -

Cava (Artflow.ai)

Pricing: Freemium
Rating: ★ 0.0
Visits: -

DeepBrain AI

Pricing: Paid
Rating: ★ 0.0
Visits: -

NUWA-Infinity

Should you use NUWA-Infinity?

Overview

FAQ

Pricing

Pros & Cons

Compare with top alternatives

More tools from Nuwa-infinity

Reviews & Ratings