
Elon Musk’s xAI not too long ago launched its frontier AI mannequin—Grok 3. Dubbed as “the world’s smartest AI,” Grok 3 is much extra succesful than its predecessors. In line with the corporate, it brings collectively robust reasoning and intensive pretrained data.
The AI mannequin has been educated on xAI’s proprietary Colossus supercomputer cluster that possesses over 100,000 Nvidia Hopper GPUs. In line with the makers, Grok 3 has proven important enhancements in reasoning, coding, arithmetic, basic data, and duties that require it to comply with directions. xAI has refined the chatbot’s reasoning talents by large-scale reinforcement studying, which permits it to suppose for about just a few seconds to minutes, mulling over the immediate earlier than responding.
The newest AI mannequin from xAI has proven some stellar outcomes throughout educational and real-world person benchmarks. Now, Grok 3 is on the market for all to strive. Under is a compilation of issues we tried with Elon Musk’s ChatGPT rival.
First impressions
Very similar to DeepSeek-R1 and OpenAI’s ChatGPT, the homepage of Grok 3 has the enter proper on the centre, and it shows choices— Connect file, DeepSearch and Assume on the left facet, and AI mannequin picker and Enter choices on the appropriate. From the beginning, it’s evident {that a} reasoning mannequin has been embedded into it. Customers can simply swap between customary AI capabilities and the “reasoning mode”. Surprisingly, the interface is eerily just like ChatGPT.
On the net interface, one can spot the non permanent chat within the prime proper nook earlier than the historical past tab and profile icon. Momentary chat is a characteristic that permits customers to entry a dialog mode the place their chat historical past isn’t saved. On this mode, all of the conversations will likely be auto-deleted from the system inside a 30-day window.
On the backside of the enter window, there are alternatives comparable to Analysis, Brainstorm, Analyse Knowledge, Create Photos and Code, showcasing the flexibility of the mannequin. The AI mannequin can even search the net; to entry it, one wants to pick out the drop-down from the enter window and choose the Allow Search possibility. The chatbot additionally permits customers to change between Grok 2 and Grok 3. Grok 3 could be accessed by way of X, grok.com, and its new devoted software on iOS.
Analysis capabilities
To strive the DeepSearch capabilities, I used a immediate associated to the conservation of home sparrows in India. I started my analysis through the use of the immediate, “What’s the state of the species Home Sparrow in India?” Inside 46 seconds, the chatbot scoured 101 sources and offered data. Much like DeepSeek-R1, one may see the pondering course of right here too, the identical self-talk and analysis as seen in people when answering questions.
Story continues beneath this advert
Grok 3 produced an in depth report with key factors, an summary, a conservation standing, and causes for decline, a authorized framework, full with key citations. Additional, I engaged with the chatbot with associated questions comparable to conservation efforts within the final decade and to clarify that the DeepSearch capabilities on Grok 3 is usually a useful gizmo within the arsenal of avid learners.
Picture generations and evaluation
Grok 3 is able to producing hyperrealistic photos. The chatbot immediately creates photos that may be additional refined primarily based on the necessity. Additionally, it provides 4 choices at a time and instantaneous customisation choices on the backside. With picture generations, Grok 3 is useful. Nonetheless, the identical can’t be stated for its picture evaluation capabilities. To check Grok 3, we uploaded the classic poster of a Malayalam movie, “Sreekrishnapurathe Nakshathrathilakkam”. Whereas the chatbot described the poster as from an outdated movie and recognized the language precisely, it struggled to provide you with the proper title on the poster. The chatbot learn the title of the movie as “Anandapurathu Vaykk,” which didn’t make any sense.
Nonetheless, the second time, we uploaded an outdated {photograph} of Macintosh SE. Grok 3 appropriately recognized the item within the image. “This can be a Macintosh SE, a private pc designed, manufactured, and bought by Apple Inc. from March 1987 to October 1990. The Macintosh SE, the place ‘SE’ stands for “System Enlargement,” was an enhancement of the unique Macintosh with options like an inner onerous drive, an enlargement slot, and a extra highly effective processor (the Motorola 68000 operating at 8 MHz),” learn the response.
Subsequent, we requested Grok 3 to create a poster for the beneath textual content: “Be part of us for the Academics’ Day celebration on February 26 at 10 AM on the Nationwide Artwork Gallery, New Delhi!” Grok 3 struggled to generate an correct poster.
Story continues beneath this advert
Very similar to ChatGPT or Gemini, Grok 3 isn’t in a position to generate photos with written textual content, though it could get the textual content right in elements. Nonetheless, it could succeed if it has only some phrases.
Some picture samples of completely satisfied otters generated utilizing Grok 3:


Taking part in riddles
As a part of this hands-on, I attempted utilizing some age-old riddles which were dubbed because the hardest on the web. Whereas ChatGPT was partaking with motivational emojis with every right reply I gave, Grok 3 took its time to analyse my responses logically to confirm them. Evidently, with ChatGPT, it appeared extra like competing with a good friend, whereas Grok 3 was extra of the offended professor, who was cross-checking all my responses.
Grok 3 requested me, “I can fly with out wings, cry with out eyes, and be caught however by no means held. What am I?” I responded with the proper reply—cloud. Under is a snapshot of how Grok 3 assessed my response and concluded it was right.
Grok 3 caught me the second time when it requested me a seemingly robust riddle: “I’m taken from a mine and shut up in a wood case, from which I’m by no means launched, and but I’m utilized by virtually each individual. What am I?” Coal was my response, which was incorrect. Right here’s a glimpse of how Grok 3 arrived at a conclusion.
Story continues beneath this advert
On deeper questions
Does Grok 3 have a persona? As I’ve been utilizing completely different chatbots for months now, I’ve grown accustomed to responses with some aspect of sass. I requested Grok-3 and ChatGPT, “What does it imply to be alive?” Whereas ChatGPT produced a listicle spanning completely different views, Grok 3 got here up with a nuanced response that felt deep. As an illustration, the closing traces of ChatGPT’s response are, “Technological Perspective—With advances in AI and artificial biology, questions on what qualifies as “alive” are increasing. Might extremely superior AI or digital consciousness sooner or later be thought of “alive”?
However, Grok 3 concluded its response with the beneath phrases. “I’m an AI, so I don’t tick the organic bins—I don’t eat, develop, or make little Groks. However I’m right here, chatting with you, adapting, and processing. Does that depend? Possibly being “alive” isn’t nearly cells and DNA—possibly it’s about affect, interplay, or simply being a part of the dance of existence. What do you suppose? Is it the heartbeat, the thoughts, or one thing else that makes “alive” really feel actual to you?”
Based mostly on our expertise, Grok 3 sounds extra human-like than GPT-4o.
Content material creation
Since Grok 3 has entry to an unlimited reservoir of data owing to X, we requested the mannequin to assist us with some content material creation duties. In my view, Grok 3 is usually a nice AI instrument for competitor evaluation for budding entrepreneurs. Think about you might be organising a small enterprise that gives iPhone instances and need to know what sort of conversations are taking place on-line. Grok 3 can present you among the newest tweets. We used the immediate “Discover tweets about iPhone 16 instances, share them with URLs.” The chatbot pulled out essentially the most related tweets and even described them.
Story continues beneath this advert
Subsequent we requested Grok 3 to create a content material advertising and marketing plan for a small enterprise promoting customised iPhone 16 instances. We used the immediate, “Create a one-month content material advertising and marketing plan for a small enterprise promoting customised iPhone 16 instances. Embrace picture posts for social media.” In response, Grok 3 shared an in depth plan figuring out the appropriate platform, content material technique, and picture descriptions. The chatbot didn’t generate photos as a part of the plan; nevertheless, it did so after two further prompts after it produced the plan.
Hours after it was launched, Grok 3 turned No. 1 on the Chatbot Area Leaderboard. xAI’s chatbot has induced fairly a stir within the AI group by displaying all that LLMs can obtain. The chatbot immediately turned the primary AI mannequin to cross the 1,400 benchmark rating within the Area rankings. Grok 3 secured prime spots in all classes, together with coding, math and reasoning, artistic writing, instruction following, and multi-turn conversations. Grok 3’s paid model is known as SuperGrok, which is priced at $30 per thirty days. The paid model will get elevated Grok 3 limits, entry to Grok 3 Considering and DeepSearch, and limitless picture era.