X’s Grok chatbot will soon get an upgraded model, Grok-1.5

Elon Musk’s AI startup, X.ai, has revealed its latest generative AI model, Grok-1.5. Set to power social network X’s Grok chatbot in the not-so-distant future (“in the coming days,” per a blog post), Grok-1.5 appears to be a measurable upgrade over its predecessor, Grok-1 — at least judging by the published benchmark results and specs.

Grok-1.5 benefits from “improved reasoning,” according to X.ai, particularly where it concerns coding and math-related tasks. The model more than doubled Grok-1’s score on a popular mathematics benchmark, MATH, and scored over 10 percentage points higher on the HumanEval test of programming language generation and problem-solving abilities.

It’s difficult to predict how those results will translate in actual usage. As we recently wrote, commonly-used AI benchmarks, which measure things as esoteric as performance on graduate-level chemistry exam questions, do a poor job of capturing how the average person interacts with models today.

One improvement that should lead to observable gains is the amount of context Grok-1.5 can understand compared to Grok-1.

Grok-1.5 can process contexts of up to 128,000 tokens. Here, “tokens” refers to bits of raw text (e.g., the word “fantastic” split into “fan,” “tas” and “tic”). Context, or context window, refers to input data (in this case, text) that a model considers before generating output (more text). Models with small context windows tend to forget the contents of even very recent conversations, while models with larger contexts avoid this pitfall — and, as an added benefit, better grasp the flow of data they take in.

“[Grok-1.5 can] utilize information from substantially longer documents,” X.ai writes in the blog post. “Furthermore, the model can handle longer and more complex prompts while still maintaining its instruction-following capability as its context window expands.”

What’s historically set X.ai’s Grok models apart from other generative AI models is that they respond to questions about topics that are typically off-limits to other models, like conspiracies and more controversial political ideas. The models also answer questions with “a rebellious streak,” as Musk has described it, and outright rude language if requested to do so.

It’s unclear what changes, if any, Grok-1.5 brings in these areas. X.ai doesn’t allude to this in the blog post.

Grok-1.5 will soon be available to early testers on X, accompanied by “several new features.” Musk has previously hinted at summarizing threads and replies, and suggesting content for posts; we’ll see if those arrive soon enough.

The announcement comes after X.ai open sourced Grok-1, albeit without the code necessary to fine-tune or further train it. More recently, Musk said that more users on X — specifically those paying for X’s $8-per-month Premium plan — would gain access to the Grok chatbot, which was previously only available to X Premium+ customers (who pay $16 per month).

source

Hot this week

Banking as a Service: Meaning, Examples, Benefits and Future

The push for open banking has led to a...

Best fintech blogs and websites

Fintech (financial technology) has been an interesting part of...

What is Fintech?

Fintech: A term used to refer to innovations in...

How to buy shares online

Buying shares online in India has come a long...

Is it worth investing in life insurance over 60?

Is it worth investing in life insurance over 60? As...

TBC Bank Uzbekistan Raises $37 Million in Equity Investment

Subheading TBC Bank Uzbekistan secures $37 million from TBC Group,...

XTransfer and OCBC Form Comprehensive Partnership

Subheading XTransfer and OCBC collaborate to provide innovative cross-border financial...

Brazil Greenlights PayRetailers’ Acquisition of Transfeera

Subheading PayRetailers expands its presence in Brazil by acquiring Transfeera,...

Delio Appoints Felicia Meyerowitz-Singh as New Chair

Subheading Felicia Meyerowitz-Singh brings extensive financial services experience to drive...

Işbank Expands Partnership with Alipay+, Enhancing Cross-Border Payments

Subheading Işbank, Turkey's largest private bank, partners with Alipay+ to...

Former UBS Investment Analyst Unveils Voice-Cloned AI Education Tool

Subheading Geoff Robinson's new app uses his digitally cloned voice...

New Zealand Reduces Merchant Service Fees for Card Payments to Benefit Businesses

Subheading The Commerce Commission's draft decision aims to lower Visa...

Adyen and Affirm Extend Partnership to Canada, Enhancing Payment Options

Subheading Adyen and Affirm expand their collaboration to bring flexible...
Exit mobile version