Ant Group's Robbyant Unveils LingBot-Map

New Model Enhances Real-Time Spatial Understanding

On April 16, Robbyant, the AI arm of Ant Group, announced the launch of LingBot-Map, a groundbreaking open-source streaming 3D reconstruction model. This model empowers robots, autonomous vehicles, and AR devices to perceive and understand their environments in real-time using a standard RGB camera.

The new technology operates on a ‘see-as-you-go’ principle, continuously estimating the camera’s position and reconstructing the 3D structure of the scene as video is captured. Unlike traditional methods that process images offline, LingBot-Map provides immediate results, enhancing its utility in fast-paced environments.

LingBot-Map has set a new standard in accuracy. For instance, on the Oxford Spires dataset, known for its challenging lighting, the model achieved an Absolute Trajectory Error (ATE) of just 6.42 metres. This figure represents a near 2.8x improvement in trajectory accuracy over previous methods and outperforms offline models like DA3 and VIPE significantly.

LingBot-Map’s Technological Achievements

LingBot-Map excels across other benchmarks such as ETH3D, 7-Scenes, and Tanks and Temples. On the ETH3D benchmark, it achieved a reconstruction F1 score of 98.98, surpassing the second-best method by over 21%. This demonstrates the model’s leadership in both pose estimation and 3D reconstruction quality.

In terms of performance, LingBot-Map supports real-time applications with an inference speed of approximately 20 frames per second. It can handle long video sequences exceeding 10,000 frames without compromising accuracy, which is crucial for continuous spatial awareness applications such as robot navigation and obstacle avoidance.

The core of LingBot-Map’s innovation is its Geometric Context Attention (GCA) mechanism. This feature efficiently manages geometric information across frames, maintaining essential historical context while minimizing redundant computations. Inspired by classic SLAM systems, the architecture leverages a unified model to handle complex tasks typically requiring intricate designs.

Robbyant continues to expand its open-source suite with models like LingBot-Depth, LingBot-VLA, LingBot-World, and LingBot-VA, further enhancing its technology stack for real-time spatial understanding and mapping.

For more information about LingBot-Map, interested parties are encouraged to visit Robbyant’s GitHub page or access their technical report on arXiv. Further applications and details are also available on Robbyant’s website.

Last updated: 29 June 2026, 12:25 pm

In This Article

Page Contents

Ant Group’s Robbyant Unveils LingBot-Map

New Model Enhances Real-Time Spatial Understanding

LingBot-Map’s Technological Achievements

Melbourne’s biggest moments, straight to you.

Melbourne’s biggest moments, straight to you.

A Weekly Dumpling Disco Has Landed at The Espy’s Mya Tiger

Bar Privé, A New 25-Seat Cocktail Bar, Opens Beside Reine & La Rue

Review: Jazz and Cocktails at Mill Place Merchants

The Meat & Wine Co’s African-Sky Inspired Steakhouse Is Coming To Collins Street

The Cookie Box To Give Away 1,050+ Cookies In CBD

Review: Borgo Food & Wine Is Ascot Vale’s Italian Gem

Ant Group’s Robbyant Unveils LingBot-Map

New Model Enhances Real-Time Spatial Understanding

LingBot-Map’s Technological Achievements

Melbourne’s biggest moments, straight to you.

RELATED ARTICLES

Melbourne’s biggest moments, straight to you.

Welcome Back

Join Melbourne Insider

Reset Password

Business Portal