Elon Musk teases next-gen AI chatbot Grok-1.5 with superior coding and math skills

Elon Musk introduced that an upgraded iteration of his synthetic intelligence agency xAI’s chatbot Grok could also be launched subsequent week.

This revelation got here through Musk’s social media post on March 29, following xAI’s announcement of Grok-1.5 in a weblog submit. The improved AI chatbot will initially be accessible to “early testers and current Grok customers on the social media platform.”

Moreover, Musk hinted on the ongoing improvement of Grok 2, which he anticipates will surpass present AI requirements in all elements.

Grok-1.5

Grok-1.5 is a sophisticated model of the Grok-1 AI mannequin and comes with improved reasoning and a context size of 128,000 tokens.

xAI’s evaluation signifies vital enhancements within the efficiency of its superior chatbot, significantly in coding and math-related duties. Nevertheless, it falls brief in comparison with Google’s Gemini Professional 1.5 and OpenAI’s GPT-4.

xAI Grok-1.5 — AI Chatbot’s evaluation. (Supply: xAI)

In keeping with the agency:

“Grok-1.5 achieved a 50.6% rating on the MATH benchmark and a 90% rating on the GSM8K benchmark, two math benchmarks protecting a variety of grade faculty to highschool competitors issues. Moreover, it scored 74.1% on the HumanEval benchmark, which evaluates code technology and problem-solving skills.”

Furthermore, Grok-1.5 can make the most of info from considerably longer paperwork, and the mannequin can deal with longer and extra complicated prompts whereas sustaining its instruction-following functionality as its context window expands.

The agency added:

“Grok-1.5 is constructed on a customized distributed coaching framework based mostly on JAX, Rust, and Kubernetes. This coaching stack permits our staff to prototype concepts and prepare new architectures at scale with minimal effort.”

Grok is Open-source

Earlier this month, xAI took a major step by open-sourcing the bottom code of Grok-1.

This choice arose as a response to a authorized motion initiated by Musk towards OpenAI, the group he as soon as co-founded. Musk alleged that OpenAI has deviated from its unique dedication to prioritize open-source mannequin improvement over shareholder pursuits.

In the meantime, xAI stated the launched code was “the uncooked base mannequin checkpoint from the Grok-1 pre-training section, which concluded in October 2023. Because of this the mannequin is just not fine-tuned for any particular software, akin to dialogue.” It added that the mannequin was licensed beneath Apache License 2.0.