Scaling test-time compute
📈
601
Boost LLM answers with flexible test‑time search strategies
Inpaint images with custom prompts
Multimodal Image-to-Video
Remove backgrounds from images instantly
Generate 3D models and videos from images
Generate a 3D mesh from a single image
Generate images by blending foregrounds with custom backgrounds
Erase any object from an image with just a prompt
Generate spatial audio from images (and optionally text)
Media understanding
Edit image regions using a reference picture
Transcribe audio to text instantly using WebGPU
Generate animated video from two images and a prompt