Technology NewsTechnology NewsTechnology News
  • Computing
  • AI
  • Robotics
  • Cybersecurity
  • Electric Vehicle
  • Wearables
  • Gaming
  • Space
Reading: Baidu Blocks Google and Bing from Accessing Baike Content
Share
Font ResizerAa
Technology NewsTechnology News
Font ResizerAa
Search
  • Computing
  • AI
  • Robotics
  • Cybersecurity
  • Electric Vehicle
  • Wearables
  • Gaming
  • Space
Follow US
  • Cookie Policy (EU)
  • Contact
  • About
© 2025 NEWSLINKER - Powered by LK SOFTWARE
AI

Baidu Blocks Google and Bing from Accessing Baike Content

Highlights

  • Baidu blocks Google and Bing from accessing Baike content.

  • Industry trend shows tech companies protecting proprietary data.

  • AI developers seek high-quality data for training models.

Ethan Moreno
Last updated: 28 August, 2024 - 5:28 pm 5:28 pm
Ethan Moreno 9 months ago
Share
SHARE

Baidu, a leading Chinese internet search provider, has implemented a significant update to its Baike service, akin to Wikipedia, to block Google and Microsoft Bing from scraping its content. This strategic move underlines the increasing value of large datasets essential for training artificial intelligence (AI) models and applications. The decision is a part of a broader trend where technology companies are reevaluating their data-sharing policies to safeguard their valuable digital resources. Industry observers have noted similar actions from other companies aiming to control how their information is accessed and utilized by third-party platforms.

Contents
Updated Robots.txt FileImplications for AI Development

In 2019, Microsoft considered similar restrictions on its internet-search data to limit access by rival search engine operators, particularly those developing chatbots and generative AI services. This evolving trend reflects the growing emphasis on data security and proprietary content management among tech giants. Parallelly, Reddit took steps to block multiple search engines from indexing its content, except for Google, which had a financial agreement for data access. Such measures indicate a shift towards monetizing data access and controlling its distribution across various platforms.

Updated Robots.txt File

The latest update to Baidu Baike’s robots.txt file now denies access to both Googlebot and Bingbot crawlers, effective from August 8, as noted by the Wayback Machine. Previously, these search engines were permitted to index Baidu Baike’s extensive repository, encompassing nearly 30 million entries, though some subdomains were already restricted. This change signifies Baidu’s proactive approach in managing access to its comprehensive online content.

Implications for AI Development

Baidu’s decision aligns with an industry-wide trend where AI developers seek high-quality content for training their models. Companies like OpenAI have established agreements with content publishers, such as Time magazine and the Financial Times, to gain access to extensive archives for their AI projects. This practice highlights the competitive landscape for securing valuable datasets critical for advancing AI capabilities.

Currently, the Chinese Wikipedia, with 1.43 million entries, continues to be available to search engine crawlers. Despite Baidu’s restrictions, entries from Baidu Baike still appear in Google and Bing searches, likely due to older cached content. This situation underscores the persistent demand for comprehensive data by AI developers and search engines.

Baidu’s restrictions on its Baike content reflect a broader industry trend towards controlling and monetizing valuable online data. As AI technology advances, access to extensive, curated datasets becomes increasingly critical. This move may prompt other companies to reassess their data-sharing practices, leading to more restricted access or commercial arrangements for data usage.

You can follow us on Youtube, Telegram, Facebook, Linkedin, Twitter ( X ), Mastodon and Bluesky

You Might Also Like

Trump Alters AI Chip Export Strategy, Reversing Biden Controls

ServiceNow Launches AI Platform to Streamline Business Operations

OpenAI Restructures to Boost AI’s Global Accessibility

Top Tools Reshape Developer Workflows in 2025

AI Chatbots Impact Workplaces, But Do They Deliver?

Share This Article
Facebook Twitter Copy Link Print
Ethan Moreno
By Ethan Moreno
Ethan Moreno, a 35-year-old California resident, is a media graduate. Recognized for his extensive media knowledge and sharp editing skills, Ethan is a passionate professional dedicated to improving the accuracy and quality of news. Specializing in digital media, Moreno keeps abreast of technology, science and new media trends to shape content strategies.
Previous Article Chinese Entities Exploit Cloud Loophole for US AI Chips
Next Article VMware Faces Ecosystem Challenges Amidst Growing Demands

Stay Connected

6.2kLike
8kFollow
2.3kSubscribe
1.7kFollow

Latest News

Nvidia Faces Price Uncertainty Despite Tariff Agreement
Computing
Orbbec Debuts Gemini 435Le for Enhanced Industrial 3D Vision
Robotics
Tesla Drives Toward $1 Trillion Valuation With Tariff Rollback
Electric Vehicle
China and Tesla Compete in Humanoid Robot Development
Electric Vehicle
FTC Delays Enforcement of Subscription Cancellation Rule
Gaming
NEWSLINKER – your premier source for the latest updates in ai, robotics, electric vehicle, gaming, and technology. We are dedicated to bringing you the most accurate, timely, and engaging content from across these dynamic industries. Join us on our journey of discovery and stay informed in this ever-evolving digital age.

ARTIFICAL INTELLIGENCE

  • Can Artificial Intelligence Achieve Consciousness?
  • What is Artificial Intelligence (AI)?
  • How does Artificial Intelligence Work?
  • Will AI Take Over the World?
  • What Is OpenAI?
  • What is Artifical General Intelligence?

ELECTRIC VEHICLE

  • What is Electric Vehicle in Simple Words?
  • How do Electric Cars Work?
  • What is the Advantage and Disadvantage of Electric Cars?
  • Is Electric Car the Future?

RESEARCH

  • Robotics Market Research & Report
  • Everything you need to know about IoT
  • What Is Wearable Technology?
  • What is FANUC Robotics?
  • What is Anthropic AI?
Technology NewsTechnology News
Follow US
About Us   -  Cookie Policy   -   Contact

© 2025 NEWSLINKER. Powered by LK SOFTWARE
Welcome Back!

Sign in to your account

Register Lost your password?