markotesic.org
markotesic.org
Home
Tutorials
Publications
Work in progress
Projects
Human Behavioral XAI
CV
Contact
3
Robust evaluation of generative AI
A tutorial on evaluating the capabilities of LLMs presented at the European Association for Data Science
Summer School on Generative AI
Marko Tešić
Jun 20, 2024
Slides
Measurement layouts for capability-oriented AI evaluation
A tutorial presented at AAAI-24 on AI evaluation that focuses on estimating capabilities and creating capability profiles of AI systems (e.g., reinforcement learning agents and large language models) using a Bayesian framework.
John Burden
,
José Hernández-Orallo
,
Marko Tešić
,
Konstantinos Voudouris
Feb 20, 2024
Slides
(Un)interesting correlations: What are the chances that correlations lead to causation?
We use directed acyclic graphs (DAGs) to investigate the chances that two variables are causally connected, correlated, and that a covariate is inducing a correlation when controlled for.
Marko Tešić
,
Ulrike Hahn
,
Kirsty Phillips
,
Jason Burton
Confirmation by Explanation: A Bayesian Justification of IBE
A justification of the Inference to the Best Explanation (IBE) by finding conditions under which the best explanation of evidence can provide a confirmatory boost for the hypotheses under consideration.
Marko Tešić
,
Benjamin Eva
,
Stephan Hartmann
PDF
Cite
×