IT Days 2025
Testing and evaluating AI systems: quality in the blur
Artificial intelligence impresses - and disappoints. Sometimes chatbots answer correctly, sometimes incorrectly, but even the most absurd nonsense (as well as the opposite) is presented with seemingly great certainty. And such systems are supposed to control business-critical processes? The central challenge is therefore: how can the quality of AI systems be measured and assured?