Saber Zerhoudi - Information Retrieval Research Scientist

Hi — curious minds of the internet. I am Saber Zerhoudi, currently an Information Retrieval research scientist, based in Passau, Germany. My current research focuses on user simulation, user-centric evaluation of interactive search systems, and agentic AI.

Previously, I earned my Ph.D. at University of Passau, where I contributed to the OpenWebSearch project on large-scale, distributed web crawling and the creation of an open web index, and to the SINIR project on simulation-based evaluation of interactive search behavior in digital libraries.

I'm a user simulation and search researcher. I'm broadly interested in how we can evaluate and design information access systems from a user-centric perspective, especially now that generative models and agents are part of the picture. My work falls into three interconnected themes:

User Simulation: I build platforms and benchmarks for simulating realistic search behavior across classic, conversational, and agentic settings. My recent work centers on the IIRSim ecosystem, which includes SimEval-IR, a unified toolkit and benchmark suite that disentangles behavioral realism from tester reliability when evaluating simulators; IIRSim Studio, a dashboard for configuring and running simulation studies; and UXSim, a hybrid simulator that fuses cognitive layers with LLM-driven personas. Together, these resources provide a reproducible testbed for systematic experimentation across diverse user types, interfaces, and tasks.
Open Web Crawling & Indexing: I help design and operate large-scale crawling pipelines and infrastructure for the European Open Web Index, this involves the Open Web Search Crawler (OWLer) and its ecosystem for distributed, efficient crawling. Furthermore, I contribute to services exposing the Open Web Index for search, analytics, and generative AI applications.
User-Centric RAG & Agents: I study how retrieval-augmented generation and agentic systems can stay trustworthy, governable, and user-aware as content evolves and as the "user" itself becomes an LLM-driven agent. A central recent contribution is NuggetIndex, which stores atomic information units as governed records with evidence links, temporal validity, and lifecycle state, preventing outdated or contested facts from reaching the generator. Complementing this, AgentSim produces verifiable, stepwise reasoning traces over user-defined corpora, and PersonaRAG uses cooperating agents to maintain stable user personas that guide retrieval and generation. Together, these systems push RAG beyond passage retrieval toward maintainable, accountable, and user-aware information access.

Additionally, I founded SearchSim, a research initiative studying how information access changes when autonomous agents search, decide, and act alongside humans.

Program Committees & Reviewing

The Web Conference (WWW)–PC Member
Full Papers2026
SIGIR–PC Member
Full Papers2024, 2025, 2026Short Papers2023, 2024, 2026Resource Papers2025, 2026Reproducibility2026Perspectives Papers2026Demo Papers2026Tutorials2026
WSDM–PC Member
Research Papers2026Short Papers2026
CIKM–PC Member
Full Research Papers2021, 2023, 2026Short Papers2026Applied Research2026Resource Papers2025, 2026Demo Papers2025, 2026
ECIR–PC Member
Full Papers2021, 2026Short Papers2026Resource Papers2026Workshops2026
ICLR–PC Member
Full Papers2025
ECML–PC Member
Full Papers2021Research Track2026
CHIIR–PC Member
Full Papers2026Short Papers2026Resource Papers2026Demo Papers2026
JCDL–PC Member
Demo Papers2025Full or Short Research Papers2026
SIGIR-AP–PC Member
Research Papers2024

(Co-)organizer

Simulation of Interactive Information Retrieval: A Guided Tour–Tutorial @ CHIIR 2026
Search Futures–Workshop @ ECIR 2026
WOWS – Workshop on Open Web Search–Workshop @ ECIR 2024-2026
Theory and Toolkits for User Simulation in the Era of Generative AI–Tutorial @ SIGIR 2025

Sim4IA Workshop @ SIGIR 2024-2025–Cognitive-aware user simulation.

Slides '24 Slides '25

IR-RAG Workshop @ SIGIR 2024–Personalized and persona-aware RAG systems.

Slides

Search Futures Workshop @ ECIR 2025–User simulation and generative agents.

Slides

Open Search Symposium 2024-2025–Open web search and crawling.

Slides '24 Slides '25

At the University of Passau, I am involved in teaching and supporting courses in:

Data Science5779V/5779UE

Web and Data Engineering5430V/5430UE

Machine Learning5729OS

Advanced Topics in Data Science5945UE

ECIR 2026Excellence in Reviewing

Awarded by the European Conference on Information Retrieval for excellence in reviewing at ECIR 2026

SIGIR 2025Excellence in Reviewing

Awarded by ACM SIGIR for excellence in reviewing at SIGIR 2025

ICADL 2024Best Student Full Paper

"Generative Agents Navigating Digital Libraries." International Conference on Asian Digital Libraries. Singapore: Springer Nature Singapore, 2024.

CIKM 2022Best Resource Paper Award

"The SimIIR 2.0 framework: user types, Markov model-based interaction simulation, and advanced query generation." Proceedings of the 31st ACM CIKM, 2022.