https://avatars.githubusercontent.com/u/6041070?v=4

Tilburg Algorithm Observatory, Assistant Professor at the Department of Intelligent Systems, Tilburg University.

šŸ”— Quick Links

šŸŒĀ Home Page

šŸ“šĀ Publications

🦣 Social

šŸ¦‹Ā Social

Data Processing — Course Page

Language & AI — Course Page ā€˜24-’25

Reproducibility & Model Deployment — Course Page

Thesis Page

šŸ‘‹ Hi


I’m anĀ Assistant ProfessorĀ at the Department of Intelligent Systems, part of the Research Center for Cognitive Science & Artifical IntelligenceĀ of Tilburg University.

Tilburg Algorithm Observatory | Tilburg University

I work on algorithmic monitoring and auditing as part of theĀ Tilburg Algorithm Observatory, and am interested in the (harmful) effects of intelligent systems on our lives; systems that uncover our personal information, monitor and change our behavior, subtly restrict our exposure to information, and treat us unfairly.

I defended my dissertation ā€œUser-centered Security in Natural Language Processingā€ in January 2023, supervised byĀ Grzegorz Chrupała,Ā Eric Postma, andĀ Walter Daelemans.

I'm a member of theĀ shool councilĀ (previously List DCA.I., currently List TSHD) and the Data Science and SocietyĀ program committee.

āš—ļø Research


I have a multidisciplinary background in humanities and computer science. My primary area of expertise is algorithm monitoring and auditing; i.e., identifying, recording, and evaluating (harmful) inferences made through Machine Learning (ML, such as Large Language Models). During my PhD, I mainly worked on adversarial attacks on Deep Learning algorithms trained on language data (Natural Language Processing or NLP), with a focus on privacy and security. My work critically analyzes the current, and more distant impact such algorithms have on society. I'm a strong advocate of a user-centered, open-source approach to ML, and the automation of society in general.

Within NLP, I have worked on various topics such as (adversarial) stylometry (or author profiling), cyberbullying/toxicity detection, bias, data augmentation, language generation, machine translation, and more generally scientific development of reproducible research pipelines. Here are a few selected papers to give you an idea:

The Role of Search Engines in the Amplification and Suppression of...

SOBR: A Corpus for Stylometry, Obfuscation, and Bias on Reddit

Adversarial Stylometry in the Wild: Transferable Lexical Substitution Attacks on Author Profiling

Current limitations in cyberbullying detection: On evaluation criteria, reproducibility, and data scarcity

šŸ« Teaching


I’m currently the course coordinator and designer for both Programming for Data Science (1sem), and Reproducibility & Model Deployment (1sem). Previously, I taughtĀ Data MiningĀ (5y), Data ProcessingĀ (4y), Text Mining (1sem),Ā andĀ Spatiotemporal Data AnalysisĀ (1sem) in context of ourĀ Data ScienceĀ master. I also coordinated and designedĀ Language & AIĀ (4y) for ourĀ joint Data ScienceĀ bachelor withĀ TU/eĀ (JADS), which was given two excellent course evaluation certificates (2023-2025).

I focus on innovating the courses I am involved in, primarily by connecting theory to practice through problem-based learning. I believe this makes the lectures more fun, and easier to conceptualize the utility of the material. I also actively promote the use of open-source and open-science practices to shape students’ future careers. An example is myĀ EDUiLABĀ project to familiarize students with code versioning, repositories, and build servers usingĀ GitHub, which lay the foundation for Reproducibility & Model Deployment.

Here are the associated course pages (all on Notion):

šŸ“« Contact