https://avatars.githubusercontent.com/u/6041070?v=4
Tilburg Algorithm Observatory, Assistant Professor at the Department of Intelligent Systems, Tilburg University.
šĀ Home Page
šĀ Publications
š¦£Ā Social
š¦Ā Social
Data Processing ā Course Page
Language & AI ā Course Page ā24-ā25
Reproducibility & Model Deployment ā Course Page
Iām anĀ Assistant ProfessorĀ at the Department of Intelligent Systems, part of the Research Center for Cognitive Science & Artifical IntelligenceĀ of Tilburg University.
Tilburg Algorithm Observatory | Tilburg University
I work on algorithmic monitoring and auditing as part of theĀ Tilburg Algorithm Observatory, and am interested in the (harmful) effects of intelligent systems on our lives; systems that uncover our personal information, monitor and change our behavior, subtly restrict our exposure to information, and treat us unfairly.
I defended my dissertation āUser-centered Security in Natural Language Processingā in January 2023, supervised byĀ Grzegorz ChrupaÅa,Ā Eric Postma, andĀ Walter Daelemans.
I'm a member of theĀ shool councilĀ (previously List DCA.I., currently List TSHD) and the Data Science and SocietyĀ program committee.
I have a multidisciplinary background in humanities and computer science. My primary area of expertise is algorithm monitoring and auditing; i.e., identifying, recording, and evaluating (harmful) inferences made through Machine Learning (ML, such as Large Language Models). During my PhD, I mainly worked on adversarial attacks on Deep Learning algorithms trained on language data (Natural Language Processing or NLP), with a focus on privacy and security. My work critically analyzes the current, and more distant impact such algorithms have on society. I'm a strong advocate of a user-centered, open-source approach to ML, and the automation of society in general.
Within NLP, I have worked on various topics such as (adversarial) stylometry (or author profiling), cyberbullying/toxicity detection, bias, data augmentation, language generation, machine translation, and more generally scientific development of reproducible research pipelines. Here are a few selected papers to give you an idea:
The Role of Search Engines in the Amplification and Suppression of...
SOBR: A Corpus for Stylometry, Obfuscation, and Bias on Reddit
Adversarial Stylometry in the Wild: Transferable Lexical Substitution Attacks on Author Profiling
Iām currently the course coordinator and designer for both Programming for Data Science (1sem), and Reproducibility & Model Deployment (1sem). Previously, I taughtĀ Data MiningĀ (5y), Data ProcessingĀ (4y), Text Mining (1sem),Ā andĀ Spatiotemporal Data AnalysisĀ (1sem) in context of ourĀ Data ScienceĀ master. I also coordinated and designedĀ Language & AIĀ (4y) for ourĀ joint Data ScienceĀ bachelor withĀ TU/eĀ (JADS), which was given two excellent course evaluation certificates (2023-2025).
I focus on innovating the courses I am involved in, primarily by connecting theory to practice through problem-based learning. I believe this makes the lectures more fun, and easier to conceptualize the utility of the material. I also actively promote the use of open-source and open-science practices to shape studentsā future careers. An example is myĀ EDUiLABĀ project to familiarize students with code versioning, repositories, and build servers usingĀ GitHub, which lay the foundation for Reproducibility & Model Deployment.
Here are the associated course pages (all on Notion):