Text to Image Generation with Semantic-Spatial Aware GAN Semantic-Spatial Aware GAN Explained

New Apple model combines vision understanding and image generation with impressive results

Manzano combines visual understanding and text-to-image generation, while significantly reducing performance or quality trade-offs.

Decrypt

China's Z.AI Releases First Major AI Image Generation Model Trained Without American Chips

Image, trained entirely on Huawei chips, as Beijing moves to block Nvidia H200 imports in a push for AI self-reliance.

EurekAlert!

Socially aware AI helps autonomous vehicles weave through crowds without collisions

Researchers from Tongji University and Shanghai Jiao Tong University have developed a socially aware prediction-to-control pipeline that lets autonomous vehicles safely navigate dense crowds by ...

GitHub

Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation

torchrun --nproc_per_node=8 --nnodes=1 \ main_cache.py \ --img_size 256 --vqgan_path tokenizers/vq_ds16_c2i.pt \ --data_path ${IMAGENET_PATH}--cached_path ${CACHED ...

PC Magazine

The Best AI Video Generators for 2026

AI video generators help you turn your prompts into believable videos, complete with audio. We've tested all the top services to help you choose the one that does the best job with the fewest tweaks.

TechCrunch

Fei-Fei Li’s World Labs speeds up the world model race with Marble, its first commercial product

World Labs, the startup founded by AI pioneer Fei-Fei Li, is launching its first commercial world model product. Marble is now available via freemium and paid tiers that let users turn text prompts, ...

GPS World

Building the future of localization: how GNSS+IMU and VPS work together

No audio available for this content. Accurate localization underpins modern mobility, powering everything from precise rideshare pickups and efficient deliveries to augmented reality and autonomous ...

blockchain

Discrete Diffusion Models for Text Generation: AI Paradigm Shift Explained by Karpathy

According to Andrej Karpathy, the application of discrete diffusion models to text generation offers a simple yet powerful alternative to traditional autoregressive methods, as illustrated in his ...

IEEE

Text-to-Image Activation for Open-Vocabulary Semantic Segmentation in Remote Sensing

Abstract: Open-vocabulary semantic segmentation in remote sensing aims to recognize arbitrary object categories from satellite imageries beyond a fixed label set, but its progress is constrained by ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results