โ† Back to Blog
AIQualJanuary 2, 2026

What happens if you replace traditional open ended answers in a survey with an AI moderated "interview" (AIMI)?

Glaut publised a study by University of Mannheim that looked exactly into that. Commendable, with all this new ResTech coming on the market we need some proper Research-on-Research to understand what works, where, when and why. ๐˜ผ๐™ฃ๐™™ ๐™ฌ๐™๐™š๐™ฃ ๐™ฃ๐™ค๐™ฉ.

Based on this study (and in line with expectations...) using AI Moderated interviews outperform standard open answers due to:

โ€ข ๐—ฅ๐—ถ๐—ฐ๐—ต๐—ฒ๐—ฟ ๐—ฟ๐—ฒ๐˜€๐—ฝ๐—ผ๐—ป๐˜€๐—ฒ๐˜€: AIMI generated longer answers, more unique words, and higher lexical diversity
โ€ข ๐—•๐—ฟ๐—ผ๐—ฎ๐—ฑ๐—ฒ๐—ฟ ๐—ถ๐—ป๐˜€๐—ถ๐—ด๐—ต๐˜๐˜€: Participants mentioned 36% more unique themes
โ€ข ๐—–๐—น๐—ฒ๐—ฎ๐—ป๐—ฒ๐—ฟ ๐—ฑ๐—ฎ๐˜๐—ฎ: The static survey showed a 10% gibberish rate; AIMI had none (see my endnote...)
โ€ข ๐—•๐—ฒ๐˜๐˜๐—ฒ๐—ฟ ๐—ฒ๐˜…๐—ฝ๐—ฒ๐—ฟ๐—ถ๐—ฒ๐—ป๐—ฐ๐—ฒ: Respondents found the AI format more conversational, less repetitive, and more trustworthy.

The paper is an interesting read. One thing that got my attention though in the analysis section is that "๐˜๐˜ฏ ๐˜ต๐˜ฉ๐˜ฆ ๐˜ˆ๐˜-๐˜ฎ๐˜ฐ๐˜ฅ๐˜ฆ๐˜ณ๐˜ข๐˜ต๐˜ฆ๐˜ฅ ๐˜ช๐˜ฏ๐˜ต๐˜ฆ๐˜ณ๐˜ท๐˜ช๐˜ฆ๐˜ธ ๐˜ค๐˜ฐ๐˜ฏ๐˜ฅ๐˜ช๐˜ต๐˜ช๐˜ฐ๐˜ฏ, ๐˜จ๐˜ช๐˜ฃ๐˜ฃ๐˜ฆ๐˜ณ๐˜ช๐˜ด๐˜ฉ ๐˜ฆ๐˜ฏ๐˜ต๐˜ณ๐˜ช๐˜ฆ๐˜ด ๐˜ธ๐˜ฆ๐˜ณ๐˜ฆ ๐˜ฆ๐˜น๐˜ค๐˜ญ๐˜ถ๐˜ฅ๐˜ฆ๐˜ฅ, ๐˜ณ๐˜ฆ๐˜ด๐˜ถ๐˜ญ๐˜ต๐˜ช๐˜ฏ๐˜จ ๐˜ช๐˜ฏ ๐˜ฏ = 100 ๐˜ท๐˜ข๐˜ญ๐˜ช๐˜ฅ ๐˜ณ๐˜ฆ๐˜ด๐˜ฑ๐˜ฐ๐˜ฏ๐˜ด๐˜ฆ๐˜ด. ๐˜๐˜ฏ ๐˜ต๐˜ฉ๐˜ฆ ๐˜ด๐˜ต๐˜ข๐˜ต๐˜ช๐˜ค ๐˜ด๐˜ถ๐˜ณ๐˜ท๐˜ฆ๐˜บ ๐˜ค๐˜ฐ๐˜ฏ๐˜ฅ๐˜ช๐˜ต๐˜ช๐˜ฐ๐˜ฏ, ๐˜จ๐˜ช๐˜ฃ๐˜ฃ๐˜ฆ๐˜ณ๐˜ช๐˜ด๐˜ฉ ๐˜ณ๐˜ฆ๐˜ด๐˜ฑ๐˜ฐ๐˜ฏ๐˜ด๐˜ฆ๐˜ด ๐˜ธ๐˜ฆ๐˜ณ๐˜ฆ ๐˜ณ๐˜ฆ๐˜ต๐˜ข๐˜ช๐˜ฏ๐˜ฆ๐˜ฅ ๐˜ด๐˜ฐ ๐˜ต๐˜ฉ๐˜ข๐˜ต ๐˜ต๐˜ฉ๐˜ฆ ๐˜ด๐˜ข๐˜ฎ๐˜ฑ๐˜ญ๐˜ฆ ๐˜ด๐˜ช๐˜ป๐˜ฆ ๐˜ณ๐˜ฆ๐˜ฎ๐˜ข๐˜ช๐˜ฏ๐˜ฆ๐˜ฅ ๐˜ข๐˜ต ๐˜ฏ = 100".

Unless i interpret this incorrectly, looks like the comparison has been between a pre-cleaned dataset and a non-clean data set which might explain some of the differences...

Link to paper:
https://research.glaut.com/hubfs/Paper/Glaut%20Research/Glaut%20vs.%20Survey%2c%20University%20of%20Manneheim..pdf

Lastly, not mentioned in the paper, but would be interesting to know what the impact of the AIMI was on the overall Length of Interview as well as dropout rates, things that impact the overall economics of adopting a new approach.

Related Articles

Want to discuss further?

I'd love to hear your thoughts on this topic.

Get in Touch