Vision-Language Models Tutorial

Microsoft Builds A Compact AI Model That Decides When To Think

Microsoft's Phi-4-reasoning-vision-15B uses careful data curation and selective reasoning to compete with models trained on ...

Analytics Insight

Explore Data Science This Weekend: Best YouTube Channels to Follow

Overview: Free YouTube channels provide structured playlists covering AI, ML, and analytics fundamentals.Practical coding demonstrations help build real-world d ...

The Robot Report

Vision-language-action models are the next leap in autonomous robotics

Explore how vision-language-action models like Helix, GR00T N1, and RT-1 are enabling robots to understand instructions and act autonomously.

InfoQ

Moonshot AI Releases Open-Weight Kimi K2.5 Model with Vision and Agent Swarm Capabilities

A monthly overview of things you need to know as an architect or aspiring architect. Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with ...

IEEE

Remote Sensing Spatiotemporal Vision–Language Models: A comprehensive survey

Abstract: The interpretation of multitemporal remote sensing imagery is critical for monitoring Earth’s dynamic processes. However, previous change detection (CD) methods, which produce binary or ...

Interesting Engineering

AI creates artificial animals that over time develop functioning vision without instruction

Researchers created the virtual animals and released them into a synthetic world, giving them tasks on how to navigate, avoid obstacles and find food. (Representational image)Donald/Devrimb ...

IEEE

Vision-Language Model-Driven Human-Vehicle Interaction for Autonomous Driving: Status, Challenge, and Innovation

Abstract: This paper investigates the potential of Vision-Language Models (VLMs) to enhance Human-Vehicle Interaction (HVI) in Autonomous Driving (AD) scenarios, particularly in interactions between ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results