Hi — curious minds of the internet. I am Saber Zerhoudi, currently an Information Retrieval research scientist, based in Passau, Germany. My current research focuses on user simulation, user-centric evaluation of interactive search systems, and agentic AI.
Previously, I earned my Ph.D. at University of Passau, where I contributed to the OpenWebSearch project on large-scale, distributed web crawling and the creation of an open web index, and to the SINIR project on simulation-based evaluation of interactive search behavior in digital libraries.
I'm a user simulation and search researcher. I'm broadly interested in how we can evaluate and design information access systems from a user-centric perspective, especially now that generative models and agents are part of the picture. My work falls into three interconnected themes:
User Simulation: I build platforms and benchmarks for simulating realistic search behavior across classic, conversational, and agentic settings. My recent work centers on the IIRSim ecosystem, which includes SimEval-IR, a unified toolkit and benchmark suite that disentangles behavioral realism from tester reliability when evaluating simulators; IIRSim Studio, a dashboard for configuring and running simulation studies; and UXSim, a hybrid simulator that fuses cognitive layers with LLM-driven personas. Together, these resources provide a reproducible testbed for systematic experimentation across diverse user types, interfaces, and tasks.
Open Web Crawling & Indexing: I help design and operate large-scale crawling pipelines and infrastructure for the European Open Web Index, this involves the Open Web Search Crawler (OWLer) and its ecosystem for distributed, efficient crawling. Furthermore, I contribute to services exposing the Open Web Index for search, analytics, and generative AI applications.
User-Centric RAG & Agents: I study how retrieval-augmented generation and agentic systems can stay trustworthy, governable, and user-aware as content evolves and as the "user" itself becomes an LLM-driven agent. A central recent contribution is NuggetIndex, which stores atomic information units as governed records with evidence links, temporal validity, and lifecycle state, preventing outdated or contested facts from reaching the generator. Complementing this, AgentSim produces verifiable, stepwise reasoning traces over user-defined corpora, and PersonaRAG uses cooperating agents to maintain stable user personas that guide retrieval and generation. Together, these systems push RAG beyond passage retrieval toward maintainable, accountable, and user-aware information access.
Additionally, I founded SearchSim, a research initiative studying how information access changes when autonomous agents search, decide, and act alongside humans.
Program Committees & Reviewing
- The Web Conference (WWW)–PC MemberFull Papers2026
- SIGIR–PC MemberFull Papers2024, 2025, 2026Short Papers2023, 2024, 2026Resource Papers2025, 2026Reproducibility2026Perspectives Papers2026Demo Papers2026Tutorials2026
- WSDM–PC MemberResearch Papers2026Short Papers2026
- CIKM–PC MemberFull Research Papers2021, 2023, 2026Short Papers2026Applied Research2026Resource Papers2025, 2026Demo Papers2025, 2026
- ECIR–PC MemberFull Papers2021, 2026Short Papers2026Resource Papers2026Workshops2026
- ICLR–PC MemberFull Papers2025
- ECML–PC MemberFull Papers2021Research Track2026
- CHIIR–PC MemberFull Papers2026Short Papers2026Resource Papers2026Demo Papers2026
- JCDL–PC MemberDemo Papers2025Full or Short Research Papers2026
- SIGIR-AP–PC MemberResearch Papers2024
(Co-)organizer
At the University of Passau, I am involved in teaching and supporting courses in:
Awarded by the European Conference on Information Retrieval for excellence in reviewing at ECIR 2026
Awarded by ACM SIGIR for excellence in reviewing at SIGIR 2025
"Generative Agents Navigating Digital Libraries." International Conference on Asian Digital Libraries. Singapore: Springer Nature Singapore, 2024.
"The SimIIR 2.0 framework: user types, Markov model-based interaction simulation, and advanced query generation." Proceedings of the 31st ACM CIKM, 2022.