Learning JavaScript Video

Live: Learning Video LLM with Streaming Speech Transcription at Scale

Abstract: Recent video large language models (Video LLMs) often depend on costly human annotations or proprietary APIs (e.g., GPT-4o) to produce training data, which limits their training at scale. In ...

IEEE

Deep Learning-Based Object Tracking in Satellite Videos: A comprehensive survey with a new dataset

Abstract: As a fundamental task for research in satellite videos (SVs), object tracking is used to track the target of interest in traffic evaluation, military security, and so forth. The current ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Live: Learning Video LLM with Streaming Speech Transcription at Scale

Deep Learning-Based Object Tracking in Satellite Videos: A comprehensive survey with a new dataset

Trending now