Technology NewsTechnology NewsTechnology News
  • Computing
  • AI
  • Robotics
  • Cybersecurity
  • Electric Vehicle
  • Wearables
  • Gaming
  • Space
Reading: Cerebras Challenges Nvidia with New AI Inference Tool
Share
Font ResizerAa
Technology NewsTechnology News
Font ResizerAa
Search
  • Computing
  • AI
  • Robotics
  • Cybersecurity
  • Electric Vehicle
  • Wearables
  • Gaming
  • Space
Follow US
  • Cookie Policy (EU)
  • Contact
  • About
© 2025 NEWSLINKER - Powered by LK SOFTWARE
AI

Cerebras Challenges Nvidia with New AI Inference Tool

Highlights

  • Cerebras offers a new AI inference tool aiming to rival Nvidia.

  • The tool achieves 1,800 tokens per second for Llama 3.1 8B.

  • Enterprises must weigh performance, cost, and ease of implementation.

Kaan Demirel
Last updated: 29 August, 2024 - 12:57 pm 12:57 pm
Kaan Demirel 10 months ago
Share
SHARE

AI hardware startup Cerebras has introduced its latest AI inference solution, targeting enterprises that seek faster and more cost-efficient alternatives to Nvidia‘s GPU offerings. This development marks a significant move in the AI hardware landscape, where performance and cost are critical factors for enterprise adoption. While Nvidia holds a dominant position in the market, Cerebras aims to disrupt the status quo with its advanced technology.

Contents
Market DynamicsPerformance Benchmarks

Cerebras’ Inference tool leverages the company’s Wafer-Scale Engine, achieving speeds of 1,800 tokens per second for Llama 3.1 8B and 450 tokens per second for Llama 3.1 70B. These speeds surpass the typical capabilities of Nvidia’s hyperscale cloud products, offering a more cost-effective solution. Gartner analyst Arun Chandrasekaran observes a market shift towards the cost and speed of inferencing, driven by the rise of AI use cases in enterprise settings. This shift provides an opportunity for vendors like Cerebras to compete based on performance.

Market Dynamics

Performance Benchmarks

As Micah Hill-Smith, co-founder and CEO of Artificial Analysis, says, “Cerebras really shined in their AI inference benchmarks.” The company’s tool set new records with over 1,800 output tokens per second on Llama 3.1 8B and more than 446 output tokens per second on Llama 3.1 70B.

Despite these performance benefits, Cerebras faces substantial challenges in gaining market share from Nvidia. David Nicholson, an analyst at Futurum Group, highlights that while Cerebras’ system can deliver high performance at lower costs, the critical question is whether enterprises are willing to adapt their engineering processes to integrate with Cerebras’ technology. Factors such as the scale of operations and available capital significantly influence the choice between Nvidia and Cerebras.

The AI hardware market continues to evolve, with Cerebras also facing competition from specialized cloud providers and major players like Microsoft, AWS, and Google. The balance between performance, cost, and ease of implementation will likely dictate enterprise decisions in adopting new AI inference technologies. The emergence of high-speed AI inference, capable of exceeding 1,000 tokens per second, is likened to the advent of broadband internet, potentially opening new frontiers for AI applications.

Cerebras’ entry into the AI inference market is not without hurdles. Nvidia’s entrenched software and hardware stack presents a significant barrier, and enterprises may be hesitant to switch from established solutions. However, Cerebras’ 16-bit accuracy and faster inference capabilities position it well for future AI applications requiring rapid, real-time operations. As the AI hardware segment expands, comprising about 40% of the total AI hardware market, newcomers must navigate the competitive landscape carefully, considering significant resource requirements.

You can follow us on Youtube, Telegram, Facebook, Linkedin, Twitter ( X ), Mastodon and Bluesky

You Might Also Like

Nvidia Drives Growth With Physical AI Ambitions

Meta Attracts Top AI Talent as Zuckerberg Intensifies Recruitment

Nvidia Surpasses Microsoft to Regain Lead in Market Value

AI Chatbots Reflect CCP Propaganda in Sensitive Topics, Study Finds

Lawmakers Target Online Speech with NO FAKES Act Expansion

Share This Article
Facebook Twitter Copy Link Print
Kaan Demirel
By Kaan Demirel
Kaan Demirel is a 28-year-old gaming enthusiast residing in Ankara. After graduating from the Statistics department of METU, he completed his master's degree in computer science. Kaan has a particular interest in strategy and simulation games and spends his free time playing competitive games and continuously learning new things about technology and game development. He is also interested in electric vehicles and cyber security. He works as a content editor at NewsLinker, where he leverages his passion for technology and gaming.
Previous Article Data Storage Giants Surpass Wall Street’s Estimates
Next Article Call of Duty Teases Major Conflict in New Trailer

Stay Connected

6.2kLike
8kFollow
2.3kSubscribe
1.7kFollow

Latest News

Tesla Introduces Virtual Queuing to Address Supercharger Line-Cutting
Electric Vehicle
Badger Technologies Launches Digital Teammate to Support Retail Staff
Robotics
Tesla Targets Affordable Models and Self-Delivery Milestones This Quarter
Electric Vehicle
Samsung 990 Pro SSD Hits Record Low Price on Amazon Today
Computing
OnePlus Releases Compact Watch 3 43mm to Expand Smartwatch Options
Wearables
NEWSLINKER – your premier source for the latest updates in ai, robotics, electric vehicle, gaming, and technology. We are dedicated to bringing you the most accurate, timely, and engaging content from across these dynamic industries. Join us on our journey of discovery and stay informed in this ever-evolving digital age.

ARTIFICAL INTELLIGENCE

  • Can Artificial Intelligence Achieve Consciousness?
  • What is Artificial Intelligence (AI)?
  • How does Artificial Intelligence Work?
  • Will AI Take Over the World?
  • What Is OpenAI?
  • What is Artifical General Intelligence?

ELECTRIC VEHICLE

  • What is Electric Vehicle in Simple Words?
  • How do Electric Cars Work?
  • What is the Advantage and Disadvantage of Electric Cars?
  • Is Electric Car the Future?

RESEARCH

  • Robotics Market Research & Report
  • Everything you need to know about IoT
  • What Is Wearable Technology?
  • What is FANUC Robotics?
  • What is Anthropic AI?
Technology NewsTechnology News
Follow US
About Us   -  Cookie Policy   -   Contact

© 2025 NEWSLINKER. Powered by LK SOFTWARE
Welcome Back!

Sign in to your account

Register Lost your password?