Abdulkadir Gokce

I am a second-year Ph.D. student in Computer Science at the Swiss Federal Institute of Technology Lausanne (EPFL), where I am advised by Martin Schrimpf. My research focuses on building artificial models of perception.

Previously, I earned my master's degree at EPFL and my bachelor's in Electrical & Electronics Engineering and Mathematics at Bogazici University. I was fortunate to gain research experience at ETH Zurich, MIT, and Nanyang Technological University (NTU).

Email  /  Google Scholar  /  Github  /  Twitter  /  Bluesky

profile photo

Research

My research investigates the extent to which current deep learning models can capture neural and behavioral signals, and aims to develop new modeling strategies that expand these capabilities. I am particularly focused on building scalable, multimodal models of human perception that integrate diverse neurocognitive datasets.

First-Authored Papers

* indicates equal contribution.

2026_mirage_jpg MIRAGE: Adaptive Multimodal Gating for Whole-Brain fMRI Encoding
Abdulkadir Gokce*, Badr AlKhamissi*, Martin Schrimpf
Preprint, 2026  
[Project page] [arXiv] [Code] [Model]

We introduced MIRAGE, a whole-brain fMRI encoding framework that combines a native multimodal foundation model with adaptive layer gating, achieving state-of-the-art prediction of brain responses to naturalistic movies.

multimodal_brain_scaling_jpg Multimodal Scaling Laws for Task & Data-Optimized Models of Visual Cortex
Abdulkadir Gokce, Yingtian Tang, Martin Schrimpf
ICML, 2026  
[Project page] [OpenReview] [Code] [Artifacts]

We mapped the scaling laws of brain alignment across vision models and neural recordings, showing that bigger pretraining helps only up to a point, after which neural supervision and improved mappings become the main drivers of progress.

scaling_primate_vvs_jpg Scaling Laws for Task-Optimized Models of the Primate Visual Ventral Stream
Abdulkadir Gokce, Martin Schrimpf
ICML, 2025   [Spotlight, Top 3%]
[Project page] [OpenReview] [arXiv] [Code]

We systematically explored scaling laws for primate vision models and discovered that neural alignment stops improving beyond a certain scale, even though behavior keeps aligning better.

2024_babyllama_jpg Dreaming Out Loud: A Self-Synthesis Approach For Training Vision-Language Models With Developmentally Plausible Data
Badr AlKhamissi*, Yingtian Tang*, Abdulkadir Gokce*, Johannes Mehrer, Martin Schrimpf
BabyLM Challenge, at CoNLL 2024  
[OpenReview] [arXiv]

Inspired by human cognitive development, our BabyLLaMA model learns language and vision jointly through a self-synthesis loop, generating its own training data from unlabeled images.

Co-Authored Papers

microstimulation_jpg Model-Guided Microstimulation Steers Primate Visual Behavior
Johannes Mehrer, Ben Lonnqvist, Anna Mitola, Abdulkadir Gokce, Paolo Papale, Martin Schrimpf
ICLR, 2026  
[OpenReview] [arXiv]

We developed a model-guided framework that predicts and controls how microstimulation in primate visual cortex biases perceptual choices, showing strong correlation between model predictions and behavioral changes in macaque monkeys performing visual recognition tasks.

cadabra_jpg Large Language Models Align with the Human Brain during Creative Thinking
Mete Ismayilzada, Simone A. Luchini, Abdulkadir Gokce, Badr AlKhamissi, Antoine Bosselut, Antonio Laverghetta Jr., Lonneke van der Plas, Roger E. Beaty
Preprint, 2026
[arXiv]

We found that LLMs capture aspects of the neural geometry of human creative thought, and that creativity-focused post-training preserves alignment with highly creative neural responses while reasoning-focused training shifts it away.

fragmented_objects_jpg Contour Integration Underlies Human-Like Vision
Ben Lonnqvist, Elsa Scialom*, Abdulkadir Gokce*, Zehra Merchant, Michael Herzog, Martin Schrimpf
ICML, 2025  
[OpenReview] [arXiv]

We find that contour integration, a core feature of human object recognition, emerges in models only at large scales and correlates with improved shape bias.

dynamic_vision_jpg Dynamic Modelling of Visual Perception Is Governed by a Low-Dimensional Task Space
Yingtian Tang, Abdulkadir Gokce, K. J. Al-Karkari, Daniel Yamins, Martin Schrimpf
Preprint, 2025
[bioRxiv] [Code]

We found that the most brain-like video models capture human visual cortex through two core computations — object recognition and appearance-free motion recognition — which together organize visual processing across cortical streams.

endol2h_jpg EndoL2H: Deep Super-Resolution for Capsule Endoscopy
Yasin Almalioglu, Kutsev Bengisu Ozyoruk, Abdulkadir Gokce, Kagan Incetan, Guliz Irem Gokceler, Muhammed Ali Simsek, Kivanc Ararat, Richard J Chen, Nicholas J Durr, Faisal Mahmood, Mehmet Turan
IEEE Transactions on Medical Imaging  
[Paper] [arXiv] [Code]

EndoL2H enhances capsule endoscopy images up to 12x using a spatial attention-guided GAN, outperforming existing methods in perceptual quality and clinical relevance.


Design and source code from Jon Barron's website.