Note details

Grok 3 Tested: Does It Live Up to the Hype?

BY q8bhj
July 7, 2025
Public
Private
4816 views

Overview of Grock 3

Grock 3 was recently released with significant fanfare, promoting itself as the leading large language model. Initially behind a costly subscription service on Twitter, it was unexpectedly made available to the general public, including advanced versions like the Deep Research and Thinking models.

Testing Grock 3

Initial Tests

  • Sanity Test: The model answered basic arithmetic and logic questions correctly, such as the number of Alice's sisters (4) given specific family details.
  • Hourglass Problem: While Grock 2 previously failed, Grock 3 provided a logically correct, albeit slightly inefficient, solution for measuring 15 minutes with two hourglasses.
  • Unsolved Hourglass Variation: Grock 3 attempted to solve an unsolved hourglass problem but did not find the correct answer, similar to other models.

Language and Coding Challenges

  • Word Reversal Task: Grock 3 successfully completed a language task by selecting an unusual word, finding a synonym, and reversing it, which Grock 2 previously failed.
  • Complex Programming Task: Grock 3 attempted to write a complex chess engine in C. Initial code had compiler errors but was rectified with adjustments, although the engine still underperformed when tested in a chess environment.

Advanced Model Uses

  • Deep Research Task: Created a detailed report on Android tablets using deep internet search capabilities, providing a comprehensive comparison of specifications and prices.
  • Best F1 Driver Report: Used deep research to analyze and rank F1 drivers, concluding Lewis Hamilton as the best, supported by detailed statistics and comparisons with other top drivers.

Conclusion

Grock 3 exhibits improved performance over Grock 2, with the addition of specialized models enhancing its capabilities. While it still faces challenges in solving certain complex problems, its ability to integrate deep research and logical thinking shows promise.

Availability: Grock 3's future access remains uncertain, whether it will remain free or return to a subscription model.


Video by Gary Sims, host of "Gary Explains". For more content like this, subscribe and stay informed on advancements in language models.

    Grok 3 Tested: Does It Live Up to the Hype?