Python GUI Multiple Video Stream Box

Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation (NeurIPS 2025)

We propose MultiTalk, a novel framework for audio-driven multi-person conversational video generation. Given a multi-stream audio input, a reference image and a prompt, MultiTalk generates a video ...

IEEE

M2HF: Multi-Branch Multi-Modal Hybrid Fusion for Text-Video Retrieval

Abstract: Videos contain multimodal content, and exploring multi-branch cross-modal interactions with natural language queries can be of benefit to the text-video retrieval task (TVR). However, recent ...

GitHub

Zero-dependency Python linter for detecting and fixing misaligned ASCII art boxes in documentation.

AI-generated ASCII flowcharts and diagrams often have subtle formatting errors where box borders are misaligned by 1-2 characters. This breaks visual integrity and makes documentation harder to read.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation (NeurIPS 2025)

M2HF: Multi-Branch Multi-Modal Hybrid Fusion for Text-Video Retrieval

Zero-dependency Python linter for detecting and fixing misaligned ASCII art boxes in documentation.

Trending now