AI Engineer -- Robotics, Autonomous Systems & Multimodal AI
AI Engineer with 8+ years post-PhD experience building production AI models and training pipelines for multimodal and autonomous systems. Expertise in VLMs, vision-language-action models, 3D scene reconstruction, and large-scale ML infrastructure.
Currently working at Huawei Suomi Finland Research Center as a Senior Researcher in Multimodal AI, where I:
My research spans:
Video understanding, generation, and long video analysis with memory models
VLMs, VLA models, and multimodal perception systems
4D LiDAR generation, sensor fusion, and 3D reconstruction
Building production-scale video generation models and co-developing ReWind (CVPR 2025) for long video understanding.
Deep learning pipelines for kinematic time series analysis and trajectory reconstruction using TCNs.
UNet-based architectures for visual perception and 3D modeling of ancient maps.
Production DNN pipelines for named entity extraction from historical documents (EURHISFIRM project).
Co-developed ReWind, a large language model for long video understanding with instructed learnable memory.
Apparatus to Generate Media Contents Conditioned to User Preferences Settings.
Interested in collaborating or learning more about my work?
Contact Me