PhD in Computer Science with 8+ years post-PhD experience building production AI models and training pipelines for multimodal and autonomous systems. Expertise in VLMs, video understanding, video generation, and robotics.
Years Experience
Tier-1 Conference
Publications
I am an AI Engineer with a PhD in Computer Science, specializing in building production AI models and training pipelines for multimodal and autonomous systems.
My extensive experience spans several cutting-edge domains, including:
I currently work at Huawei Suomi Finland Research Center as a Senior Researcher in Multimodal AI, where I build and ship production-scale video generation models and co-developed the ReWind model for long video understanding (CVPR 2025).
Senior Researcher - Multimodal AI
Huawei Suomi Finland Research Center
2023 - PresentHelsinki, Finland
Long video analysis with instructed learnable memory (ReWind - CVPR 2025)
Production-scale video generation using VLMs and diffusion architectures
LLaVA, CLIP, QwenVL, LayoutVLM for multimodal perception
4D LiDAR generation, sensor fusion, and 3D scene reconstruction
Advanced generative AI for creating and manipulating visual content
Time series analysis for motion and trajectory reconstruction
Publications
Tier-1 Conference
Submitted
Years Experience
Interested in collaboration or want to learn more about my research?