Research
|
Can Visual Foundation Models Achieve Long-term Point Tracking?
Gorkay Aydemir,
Weidi Xie ,
Fatma Guney
EVAL-FoMo Workshop ECCV, 2024
We assess the geometric awareness of vision foundation models for long-term point tracking. Our results show that Stable Diffusion and DINOv2 excel in zero-shot settings, with DINOv2 matching supervised models after training in lighter setup, highlighting its potential for correspondence learning.
|
|
Self-supervised Object-centric Learning for Videos
Gorkay Aydemir,
Weidi Xie ,
Fatma Guney
NeurIPS, 2023
This paper presents SOLV, the first fully unsupervised technique for segmenting multiple objects in real-world video sequences using an object-centric approach. Through a unique masking strategy and slot merging based on similarity, our method effectively segments varied object classes in YouTube videos.
|
|
ADAPT: Efficient Multi-Agent Trajectory Prediction with Adaptation
Gorkay Aydemir,
Adil Kaan Akan,
Fatma Guney
ICCV, 2023
This paper presents ADAPT, a method for predicting trajectories of all agents in complex traffic scenarios, ensuring both efficiency and accuracy. By utilizing dynamic weight learning and an adaptive head, ADAPT offers superior performance over existing multi-agent methods on Interaction dataset, with reduced computational demands.
|
|