research
LLMs are stuck in a groupthink groove. This startup is trying to get them out.
Builders using LLMs in production need to account for output homogeneity, which can degrade the quality and variety of generated content or decisions.

What happened
A new MIT Tech Review article highlights a recurring issue with large language models (LLMs): their tendency to converge on the same outputs for open-ended prompts, such as always returning the number 7 when asked for a random number between 1 and 10. This 'groupthink' phenomenon limits the diversity of responses and can undermine applications requiring creativity or variation. The article features a startup developing methods to increase output diversity, potentially through fine-tuning or prompt engineering techniques. For developers building AI-powered workflows, this limitation is critical: automated systems that rely on LLM outputs for tasks like content generation or data augmentation may produce repetitive results, reducing effectiveness. The startup's approach, if successful, could enable more robust and varied outputs from LLMs, improving their utility in production environments.
Key takeaways
- According to MIT Tech Review, LLMs like ChatGPT, Claude, and Gemini tend to produce the same answers (e.g., the number 7) when asked for random numbers.
- A startup is working on solutions to introduce more diversity into LLM outputs, potentially using specialized training or inference techniques.
- The article argues that this lack of output diversity is a systemic issue affecting many current models.
- For developers, this phenomenon can lead to predictable and less useful results in automated workflows.
Why it matters
Builders using LLMs in production need to account for output homogeneity, which can degrade the quality and variety of generated content or decisions.
This is an original editorial digest by AI Workflow Pro. Full reporting at the source:
Read the original on MIT Tech ReviewMore AI news
All news →





Join the AI Workflow Pro Community