Vision Language Model Architecture

New Apple model combines vision understanding and image generation with impressive results

Manzano combines visual understanding and text-to-image generation, while significantly reducing performance or quality trade-offs.

10d

Meta’s Vision-Language Shift VL-JEPA Beats Bulky LLMs

VL-JEPA predicts meaning in embeddings, not words, combining visual inputs with eight Llama 3.2 layers to give faster answers ...

Electronic Design

Vision-Language-Action Model Opens Level 4 Frontier for Autonomous Driving

Safely achieving end-to-end autonomous driving is the cornerstone of Level 4 autonomy and the primary reason it hasn’t been widely adopted. The main difference between Level 3 and Level 4 is the ...

CIO

Understanding transformers: What every leader should know about the architecture powering GenAI

GenAI isn’t magic — it’s transformers using attention to understand context at scale. Knowing how they work will help CIOs ...

Nvidia Doubles Down On Enterprise AI Infrastructure With Five Strategic Platform Launches

The announcements reflect a calculated shift from discrete chip sales to integrated systems that address enterprise ...

Z.ai's open source GLM-Image beats Google's Nano Banana Pro at complex text rendering, but not aesthetics

Furthermore, Nano Banana Pro still edged out GLM-Image in terms of pure aesthetics — using the OneIG benchmark, Nano Banana 2 ...

GlobalData on MSN

Visteon and TomTom launch ‘world first in-car local AI navigation system

Visteon has announced a strategic collaboration with TomTom, a specialist in mapping and location technology, to deliver what ...

Domus

Snøhetta, Jean Nouvel, and Zaha Hadid: Not a Hotel is architecture’s answer to Airbnb

From Japan comes a hospitality model in which architecture, technology and flexibility intertwine to redefine post-pandemic ...

15d

DeepSeek develops mHC AI architecture to boost model performance

DeepSeek researchers have developed a technology called Manifold-Constrained Hyper-Connections, or mHC, that can improve the performance of artificial intelligence models. The Chinese AI lab debuted ...

Computer Weekly

CES 2026: Sony Honda Mobility drives out Afeela SDV prototype

Japanese joint venture mobility tech company established by Sony Group Corporation and Honda Motor Co advance vision of mobility as a creative entertainment space.

RoboChallenge's Top-Ranked Embodied AI Model Goes Open Source, Challenging Clean Data Collection Paradigm

Spirit AI, an embodied AI startup, today announced that its latest VLA model, Spirit v1.5, has ranked first overall on the ...

i-SCOOP

TongYi Fun-Audio-Chat speech to speech model

Discover the TongYi Fun-Audio-Chat speech-to-speech model by Alibaba Group. Explore how this Large Audio Language Model ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results