RealSense demos safer humanoid navigation at NVIDIA GTC, using 3D vision and simulation to enable reliable real-world robot movement.
cuVSLAM is the library by NVIDIA, providing various Visual Tracking Camera modes and Simultaneous Localization and Mapping (SLAM) capabilities. Leveraging CUDA acceleration and a rich set of features, ...
Abstract: Making multi-camera visual SLAM systems easier to set up and more robust to the environment is attractive for vision robots. Existing monocular and binocular vision SLAM systems have narrow ...
Abstract: Visual-inertial odometry (VIO) is widely used in autonomous driving, robots, and drones in global navigation satellite system (GNSS)-denied environments. Current VIO methods primarily rely ...
In this tutorial, we build an end-to-end visual document retrieval pipeline using ColPali. We focus on making the setup robust by resolving common dependency conflicts and ensuring the environment ...