Great news for xAI: Grok is now pretty good at answering questions about Baldur’s Gate
Different AI labs have different priorities. OpenAI has traditionally focused on consumer users, for instance, while its rival Anthropic tends to target enterprises. Elon Musk’s xAI, we discovered recently, has been placing particular emphasis on video-game walkthroughs.
Of course, you can imagine the frustration of any respected and experienced engineer who shows up to work thinking he’ll be tackling fundamental problems of knowledge and machine intelligence, only to be sidetracked into helping a 54-year-old man beat his video game. But the anecdote raises an even more pressing question: Did Musk end up getting the gaming skills he wanted? To answer that question, our resident RPG enthusiast Ram Iyer put together a set of five general questions about Baldur’s Gate, which we ran against xAI and the three major models in a kind of quasi-benchmark that I’ve decided to call “BaldurBench. ” In the interest of journalistic transparency, I’ve made all the chat transcripts public, so you can see them here: Grok, ChatGPT, Claude, and Gemini. First, the good news: Grok actually gives pretty good information. Its responses were a bit dense with gamer jargon — “save-scumming” instead of saving and “DPS” instead of damage — but the answers were both useful and well-informed, provided you knew what it was talking about.
Grok also really loves tables and theorycraft, which is about what you would expect.
ChatGPT prefers bulleted lists and sentence fragments, while Gemini loves to bold important words.
When I asked about good party compositions, it closed the guidance by saying, “Don’t stress too much and just play what sounds fun to you
Logic Quality Breakdown:
- Updated_At:
- Truth_Blocks:
- Analysis_Method: