Whithin AIMI evaluation
5 min read

Do AI-generated follow-up questions elicit more informative responses than static questions alone? Evidence by the University of Nottingham

AUTHOR
Prof. Charlotte Doidge
June 18, 2026
TABLE OF CONTENT
Read the full paper
SUMMARISE WITH AI

Stimulating disclosure in specialised, unfamiliar and sensitive topics: an AIMI application in veterinary medicine

Prof. Charlotte Doidge, Prof. Jasmeet Kaler and Abbie-Louise Smith.

Key findings at a glance

  • +75% longer responses when AI-generated follow-up questions were added.
  • +9% higher lexical diversity*, indicating respondents introduced new vocabulary and ideas rather than repeating themselves.
  • AI follow-ups surfaced new themes around animal wellbeing, nutrition, housing, and environmental conditions that were largely absent from initial responses.
  • The effectiveness of AI moderation depended heavily on question design. Broad questions benefited most, while highly specific questions generated fewer new insights.

A new study from the University of Nottingham School of Veterinary Medicine and Science, led by Prof. Charlotte Doidge, examines an increasingly important question in AI-moderated research: Do AI-generated follow-up questions simply generate more text, or can they uncover genuinely new insights?

The answer appears to be both.

The study design

The research examined public perceptions of dairy calf health in the UK, a topic unfamiliar to most respondents and often involving ethical considerations.

A total of 296 UK participants completed an AI-moderated survey on the Glaut platform. The survey included six researcher-designed open-ended questions, each followed by a single AI-generated follow-up question. Participants could respond via voice or text.

The researchers compared initial responses with the combined responses generated after AI probing using three measures:

  • Verbosity (response length)*
  • Lexical diversity*
  • Keyness analysis*, a linguistic technique used to identify new topics and concepts introduced after AI probing

Key Findings

1.  AI-generated follow-ups increased response depth

The addition of a single AI-generated follow-up question increased average response length from 41 words to 71 words. That represents a 75% increase in verbosity.

This finding aligns with previous AIMI studies showing that dynamic follow-up questions encourage respondents to expand on their answers. What makes this result notable is that it was replicated in a specialized domain where most participants had limited prior knowledge.

2. Longer responses do not automatically mean better responses.

To determine whether respondents were adding meaningful content, the researchers measured lexical diversity, which captures the richness and variety of vocabulary used. The addition of AI-generated follow-up questions increased lexical diversity by 9%.

In practical terms, respondents were introducing new concepts and language rather than restating what they had already said. This mirrors findings from previous AIMI research and provides further evidence that AI moderation can encourage deeper reflection rather than simple repetition.

3. The role of an unfamiliar, sensitive topic: AI follow-ups surfaced new themes

Using keyness analysis, the researchers examined whether AI-generated questions introduced topics largely absent from the original responses.

In several cases, they did.

I. Animal well-being and mental welfare

When respondents were asked what a good life for a dairy calf looks like, AI follow-ups generated significantly more discussion around:

  • Wellbeing
  • Mental welfare
  • Stress
  • Emotional wellbeing

Only 10 respondents mentioned these topics in their original responses. After AI probing, that number increased to 110 respondents.

II. Nutrition and environmental conditions

When discussing who is responsible for ensuring calf welfare, AI-generated follow-ups surfaced substantially more discussion around:

  • Feed and nutrition
  • Water access
  • Housing
  • Shelter
  • Environmental conditions

Nutrition-related themes increased from 48 respondents to 150 respondents, while environmental themes increased from 6 respondents to 81 respondents.

III. Comparative thinking

AI follow-ups also prompted respondents to think beyond the UK context. References to other countries increased as participants began comparing welfare standards across different regions. Taken together, these findings suggest that AI-generated follow-up questions can help respondents explore dimensions of a topic that were not immediately salient in their initial answers.

4. AI-generated follow-up questions did not always generate new insights.

When respondents were asked highly specific questions or had already explained their reasoning in the initial response, AI probing often reinforced existing answers rather than uncovering new themes. For example, questions about specific calf management practices elicited very little thematic expansion, as respondents had already articulated their reasoning in the original question. This finding reinforces a broader lesson emerging across AI-moderated research: AI moderation is only as effective as the combination of researcher-authored questions and AI instructions.

What researchers can learn from this

Based on their findings, the authors suggest that AI-generated follow-up questions work best when:

  • The initial question is broad or exploratory.
  • The topic is unfamiliar to respondents.
  • The AI is instructed to explore new areas rather than repeat existing themes.
  • Respondents have not yet explained their reasoning.

AI-generated follow-ups are less informative when:

  • The initial question is already highly specific.
  • The AI is instructed to ask questions that mirror the original question.
  • Respondents have already fully explained their reasoning.

Why this study matters

Previous AIMI studies have demonstrated that AI moderation can generate longer responses and richer language. This study extends that evidence by showing that AI-generated follow-up questions can also uncover new themes, particularly when respondents are discussing unfamiliar topics and when the research design gives the AI room to explore new directions.

The findings suggest that the future of AI-moderated research is not simply about replacing static questionnaires with dynamic conversations. It is about designing the right combination of researcher-authored questions and AI-generated probes to help respondents articulate ideas they might not otherwise express.

As the authors conclude: the success of AI moderation depends not only on the technology itself, but on how researchers design the conversation around it”.

Notes:

  • Verbosity: Analysis to understand how many extra words the response to the AI-followup question generates compared with the static question.
  • Lexical diversity: Analysis to understand the ratio of different words (types) to totalwords (tokens) the response to the AI-follow up question generates compared with thestatic question.
  • Keyness: Analysis to identify words that are disproportionately frequent in theresponses to the AI-generated questions compared to the responses to the staticquestions (Schweinberger, 2026).