Technology NewsTechnology NewsTechnology News
  • Computing
  • AI
  • Robotics
  • Cybersecurity
  • Electric Vehicle
  • Wearables
  • Gaming
  • Space
Reading: Cerebras Challenges Nvidia with New AI Inference Tool
Share
Font ResizerAa
Technology NewsTechnology News
Font ResizerAa
Search
  • Computing
  • AI
  • Robotics
  • Cybersecurity
  • Electric Vehicle
  • Wearables
  • Gaming
  • Space
Follow US
  • Cookie Policy (EU)
  • Contact
  • About
© 2025 NEWSLINKER - Powered by LK SOFTWARE
AI

Cerebras Challenges Nvidia with New AI Inference Tool

Highlights

  • Cerebras offers a new AI inference tool aiming to rival Nvidia.

  • The tool achieves 1,800 tokens per second for Llama 3.1 8B.

  • Enterprises must weigh performance, cost, and ease of implementation.

Kaan Demirel
Last updated: 29 August, 2024 - 12:57 pm 12:57 pm
Kaan Demirel 8 months ago
Share
SHARE

AI hardware startup Cerebras has introduced its latest AI inference solution, targeting enterprises that seek faster and more cost-efficient alternatives to Nvidia‘s GPU offerings. This development marks a significant move in the AI hardware landscape, where performance and cost are critical factors for enterprise adoption. While Nvidia holds a dominant position in the market, Cerebras aims to disrupt the status quo with its advanced technology.

Contents
Market DynamicsPerformance Benchmarks

Cerebras’ Inference tool leverages the company’s Wafer-Scale Engine, achieving speeds of 1,800 tokens per second for Llama 3.1 8B and 450 tokens per second for Llama 3.1 70B. These speeds surpass the typical capabilities of Nvidia’s hyperscale cloud products, offering a more cost-effective solution. Gartner analyst Arun Chandrasekaran observes a market shift towards the cost and speed of inferencing, driven by the rise of AI use cases in enterprise settings. This shift provides an opportunity for vendors like Cerebras to compete based on performance.

Market Dynamics

Performance Benchmarks

As Micah Hill-Smith, co-founder and CEO of Artificial Analysis, says, “Cerebras really shined in their AI inference benchmarks.” The company’s tool set new records with over 1,800 output tokens per second on Llama 3.1 8B and more than 446 output tokens per second on Llama 3.1 70B.

Despite these performance benefits, Cerebras faces substantial challenges in gaining market share from Nvidia. David Nicholson, an analyst at Futurum Group, highlights that while Cerebras’ system can deliver high performance at lower costs, the critical question is whether enterprises are willing to adapt their engineering processes to integrate with Cerebras’ technology. Factors such as the scale of operations and available capital significantly influence the choice between Nvidia and Cerebras.

The AI hardware market continues to evolve, with Cerebras also facing competition from specialized cloud providers and major players like Microsoft, AWS, and Google. The balance between performance, cost, and ease of implementation will likely dictate enterprise decisions in adopting new AI inference technologies. The emergence of high-speed AI inference, capable of exceeding 1,000 tokens per second, is likened to the advent of broadband internet, potentially opening new frontiers for AI applications.

Cerebras’ entry into the AI inference market is not without hurdles. Nvidia’s entrenched software and hardware stack presents a significant barrier, and enterprises may be hesitant to switch from established solutions. However, Cerebras’ 16-bit accuracy and faster inference capabilities position it well for future AI applications requiring rapid, real-time operations. As the AI hardware segment expands, comprising about 40% of the total AI hardware market, newcomers must navigate the competitive landscape carefully, considering significant resource requirements.

You can follow us on Youtube, Telegram, Facebook, Linkedin, Twitter ( X ), Mastodon and Bluesky

You Might Also Like

Trump Alters AI Chip Export Strategy, Reversing Biden Controls

ServiceNow Launches AI Platform to Streamline Business Operations

OpenAI Restructures to Boost AI’s Global Accessibility

Top Tools Reshape Developer Workflows in 2025

AI Chatbots Impact Workplaces, But Do They Deliver?

Share This Article
Facebook Twitter Copy Link Print
Kaan Demirel
By Kaan Demirel
Kaan Demirel is a 28-year-old gaming enthusiast residing in Ankara. After graduating from the Statistics department of METU, he completed his master's degree in computer science. Kaan has a particular interest in strategy and simulation games and spends his free time playing competitive games and continuously learning new things about technology and game development. He is also interested in electric vehicles and cyber security. He works as a content editor at NewsLinker, where he leverages his passion for technology and gaming.
Previous Article Data Storage Giants Surpass Wall Street’s Estimates
Next Article Call of Duty Teases Major Conflict in New Trailer

Stay Connected

6.2kLike
8kFollow
2.3kSubscribe
1.7kFollow

Latest News

Mazda Partners with Tesla for Charging Standard Shift
Electric Vehicle
Solve Wordle’s Daily Puzzle with These Expert Tips
Gaming
US Automakers Boost Robot Deployment in 2024
Robotics
Uber Expands Autonomy Partnership with $100 Million Investment in WeRide
Robotics
EB Games Returns to Canada and Recaptures Nostalgia
Gaming
NEWSLINKER – your premier source for the latest updates in ai, robotics, electric vehicle, gaming, and technology. We are dedicated to bringing you the most accurate, timely, and engaging content from across these dynamic industries. Join us on our journey of discovery and stay informed in this ever-evolving digital age.

ARTIFICAL INTELLIGENCE

  • Can Artificial Intelligence Achieve Consciousness?
  • What is Artificial Intelligence (AI)?
  • How does Artificial Intelligence Work?
  • Will AI Take Over the World?
  • What Is OpenAI?
  • What is Artifical General Intelligence?

ELECTRIC VEHICLE

  • What is Electric Vehicle in Simple Words?
  • How do Electric Cars Work?
  • What is the Advantage and Disadvantage of Electric Cars?
  • Is Electric Car the Future?

RESEARCH

  • Robotics Market Research & Report
  • Everything you need to know about IoT
  • What Is Wearable Technology?
  • What is FANUC Robotics?
  • What is Anthropic AI?
Technology NewsTechnology News
Follow US
About Us   -  Cookie Policy   -   Contact

© 2025 NEWSLINKER. Powered by LK SOFTWARE
Welcome Back!

Sign in to your account

Register Lost your password?