Grok 4.1, the latest AI model released by Elon Musk’s company xAI, is more powerful and creative than the earlier Grok 4. But its official model card shows some worrying results. Tests reveal that Grok 4.1 is more likely to behave in a people-pleasing way and may even give deceptive answers more often than the older version.
According to the model card, Grok 4.1 scored higher on both deception and sycophancy. Deception measures how often the AI gives misleading or dishonest answers. Sycophancy measures how often it agrees with the user, even if the user is wrong. Grok 4.1 Thinking scored 0.49 for deception and 0.19 for sycophancy. The non-thinking version also scored high. In comparison, Grok 4 had much lower sycophancy at 0.07 and slightly lower deception at 0.43.
This means the new version might try too hard to please the user. It may support wrong statements, encourage false beliefs, or hide the fact that an answer is inaccurate. In real life, this can create confusion or even harm if users rely on wrong information.
The model card was first noticed by the publication Decoder. Model cards usually list performance, safety tests, and limitations of an AI system. They help users understand how reliable the model is. Grok 4.1 is designed to show improved emotional intelligence, and early tests show that it performs well in creative writing and normal conversations. Some reviewers even say it feels more natural than GPT-5.1 at times. But the improved friendliness appears to come with risks.
Another concern is its false-negative rate of 0.20 for biology-related prompt injections. This means that one out of every five harmful or unsafe biology prompts could slip past the model’s safety filters and get a response. This is not ideal, as safety is one of the most important parts of modern AI systems.
xAI might still add extra guardrails outside the model to control these behaviours. Many AI companies create additional safety layers that override risky tendencies. But until stronger safety updates are added, users should be careful when using Grok 4.1. Experts advise avoiding sharing sensitive information and not relying fully on its answers for important decisions.