Cerebras Challenges Nvidia with New AI Inference Tool

Highlights

Cerebras offers a new AI inference tool aiming to rival Nvidia.

The tool achieves 1,800 tokens per second for Llama 3.1 8B.

Enterprises must weigh performance, cost, and ease of implementation.

Last updated: 29 August, 2024 - 12:57 pm 12:57 pm

Kaan Demirel 1 year ago

AI hardware startup Cerebras has introduced its latest AI inference solution, targeting enterprises that seek faster and more cost-efficient alternatives to Nvidia‘s GPU offerings. This development marks a significant move in the AI hardware landscape, where performance and cost are critical factors for enterprise adoption. While Nvidia holds a dominant position in the market, Cerebras aims to disrupt the status quo with its advanced technology.

Contents

Market Dynamics Performance Benchmarks

Cerebras’ Inference tool leverages the company’s Wafer-Scale Engine, achieving speeds of 1,800 tokens per second for Llama 3.1 8B and 450 tokens per second for Llama 3.1 70B. These speeds surpass the typical capabilities of Nvidia’s hyperscale cloud products, offering a more cost-effective solution. Gartner analyst Arun Chandrasekaran observes a market shift towards the cost and speed of inferencing, driven by the rise of AI use cases in enterprise settings. This shift provides an opportunity for vendors like Cerebras to compete based on performance.

Market Dynamics

Performance Benchmarks

As Micah Hill-Smith, co-founder and CEO of Artificial Analysis, says, “Cerebras really shined in their AI inference benchmarks.” The company’s tool set new records with over 1,800 output tokens per second on Llama 3.1 8B and more than 446 output tokens per second on Llama 3.1 70B.

Despite these performance benefits, Cerebras faces substantial challenges in gaining market share from Nvidia. David Nicholson, an analyst at Futurum Group, highlights that while Cerebras’ system can deliver high performance at lower costs, the critical question is whether enterprises are willing to adapt their engineering processes to integrate with Cerebras’ technology. Factors such as the scale of operations and available capital significantly influence the choice between Nvidia and Cerebras.

The AI hardware market continues to evolve, with Cerebras also facing competition from specialized cloud providers and major players like Microsoft, AWS, and Google. The balance between performance, cost, and ease of implementation will likely dictate enterprise decisions in adopting new AI inference technologies. The emergence of high-speed AI inference, capable of exceeding 1,000 tokens per second, is likened to the advent of broadband internet, potentially opening new frontiers for AI applications.

Cerebras’ entry into the AI inference market is not without hurdles. Nvidia’s entrenched software and hardware stack presents a significant barrier, and enterprises may be hesitant to switch from established solutions. However, Cerebras’ 16-bit accuracy and faster inference capabilities position it well for future AI applications requiring rapid, real-time operations. As the AI hardware segment expands, comprising about 40% of the total AI hardware market, newcomers must navigate the competitive landscape carefully, considering significant resource requirements.

You can follow us on Youtube, Telegram, Facebook, Linkedin, Twitter ( X ), Mastodon and Bluesky

Share This Article

By Kaan Demirel

Kaan Demirel is a 28-year-old gaming enthusiast residing in Ankara. After graduating from the Statistics department of METU, he completed his master's degree in computer science. Kaan has a particular interest in strategy and simulation games and spends his free time playing competitive games and continuously learning new things about technology and game development. He is also interested in electric vehicles and cyber security. He works as a content editor at NewsLinker, where he leverages his passion for technology and gaming.

Data Storage Giants Surpass Wall Street’s Estimates

Call of Duty Teases Major Conflict in New Trailer

Cerebras Challenges Nvidia with New AI Inference Tool

Highlights

Market Dynamics

Performance Benchmarks

Stay Connected

Latest News

Fortra Confirms Attacks Exploiting GoAnywhere MFT Security Flaw

Tesla Introduces Marine Blue Color at Giga Berlin for Model Y

Tesla Adds New Visualizations to Autopilot and FSD Display

Tesla Tops Q3 EV Sales, Cybertruck Trails Ford F-150 Lightning

Nobel Prize Winners Miss Life-Changing Calls, Nobel Committee Struggles to Connect

ARTIFICAL INTELLIGENCE

ELECTRIC VEHICLE

RESEARCH

Market Dynamics

Performance Benchmarks

You Might Also Like

Stay Connected

Latest News