 |
Robust Bird’s Eye View Segmentation by Adapting DINOv2
Merve R. Barin ,
Gorkay Aydemir,
Fatma Guney
VCAD Workshop ECCV, 2024
This paper introduces an approach to enhance Bird’s Eye View (BEV) perception in autonomous driving by adapting the DINOv2 with Low Rank Adaptation (LoRA). The method improves robustness to environmental challenges like brightness changes, adverse weather, and camera failures.
|
 |
Can Visual Foundation Models Achieve Long-term Point Tracking?
Gorkay Aydemir,
Weidi Xie ,
Fatma Guney
EVAL-FoMo Workshop ECCV, 2024
We assess the geometric awareness of vision foundation models for long-term point tracking. Our results show that Stable Diffusion and DINOv2 excel in zero-shot settings, with DINOv2 matching supervised models after training in lighter setup, highlighting its potential for correspondence learning.
|
 |
Self-supervised Object-centric Learning for Videos
Gorkay Aydemir,
Weidi Xie ,
Fatma Guney
NeurIPS, 2023
This paper presents SOLV, the first fully unsupervised technique for segmenting multiple objects in real-world video sequences using an object-centric approach. Through a unique masking strategy and slot merging based on similarity, our method effectively segments varied object classes in YouTube videos.
|
 |
ADAPT: Efficient Multi-Agent Trajectory Prediction with Adaptation
Gorkay Aydemir,
Adil Kaan Akan,
Fatma Guney
ICCV, 2023
This paper presents ADAPT, a method for predicting trajectories of all agents in complex traffic scenarios, ensuring both efficiency and accuracy. By utilizing dynamic weight learning and an adaptive head, ADAPT offers superior performance over existing multi-agent methods on Interaction dataset, with reduced computational demands.
|
|