Measurement layouts for capability-oriented AI evaluationJohn Burden, José Hernández-Orallo, Marko Tešić, Konstantinos VoudourisFeb 20, 2024GitHubMarko TešićResearch Associate