The Battle for Internet Freedom

Why a Decentralized Data Layer is More Important than Ever

By

Jeremy Huff

The Corporate Stranglehold is Tightening

While everyone's celebrating AI breakthroughs, the infrastructure powering these models is quietly becoming the most dangerous monopoly in human history. Big Tech has weaponized data scarcity to create walled gardens so impenetrable that they make medieval fortresses look like picket fences.

Google didn't start with a mission to close off the internet—it promised to "Do no evil." Facebook claimed it would make the web more social and open. But here we are, watching these platforms guide users within specific digital realms, like gardens enclosed by walls, where access to content and services is tightly controlled and motivated by corporate bottom lines.

The vast majority of users access information on the internet through search engines, like Google, or curated platforms, like Facebook, and what you're seeing is only what they want you to see. When you conduct a search on Google, you're not searching the web, you're searching Google's (curated) index of the web and the search results, surprise surprise, those are curated as well.

The Mainstream Narrative vs. Reality

The Mainstream Take: AI democratizes information by making search more intelligent and personalized. These platforms provide better user experiences through curation.

The Reality: Reddit provides free API access to Google through exclusive agreements, but third-party users cannot access it. Twitter, Meta, Medium, and other Web2 platforms have also restricted data scraping. The rise of zero-click experiences, where search engines provide answers directly on the results page, reduces the likelihood that users will leave the platform.

The Implication: We're witnessing the systematic enclosure of human knowledge. When companies control the narrative, it stifles creativity and affects potential advancements. As algorithms dictate which content is visible, users become trapped in echo chambers where their beliefs are continuously validated without exposure to challenging ideas.

We're sleepwalking into an AI-controlled internet where access to information will be determined by whatever serves corporate algorithms, not human curiosity.

The Internet's Broken Promise

The original promise of the internet was decentralized collaboration. It was a DoD project to establish a computer data communications network that could withstand disasters like war, requiring decentralization so that if one part failed, the rest could still function using peer-to-peer interconnectivity.

What Robert Taylor and his team envisioned when they created the foundations of the modern internet was an open and decentralized network as opposed to a closed network managed from one central location. The goal was extending collaboration across scientific communities.

The rise of cloud computing and large platforms that served Web 2.0 led to a recentralization of the internet around those services. We traded the internet's collaborative spirit for convenience and got digital serfdom in return.

The shift happened gradually—first through ISP consolidation, then platform monopolization, and finally algorithmic control. Since it is centralized, there is not much choice for users. A user's access to the internet is at the mercy of their provider and the platform. These single points of failure and centralized controls on information were not the original plan or promise of the internet.

The Walled Garden Takeover

On the internet, a walled garden is an environment that controls the user's access to network-based content and services, directing navigation within particular areas to enable access to a selection of material or prevent access to other material.

Looking at the past decade, Google dominates the search engine market, Facebook controls social media through Instagram and WhatsApp acquisitions, and Apple's App Store restricts access to over 2.2 million apps based on their stringent standards. This represents the systematic capture of information flow.

Facebook pages are basically tombstones now. Publishers with massive followings were suddenly out of business when Facebook changed its algorithm to emphasize friends and family over news content. The platform that promised to connect the world became a content cemetery controlled by algorithmic gravediggers.

But here's what most people missed: This was always the plan. Walled gardens require users to remain on platforms rather than being diverted elsewhere, maximizing marketing and advertising reach while controlling the type of content users can access.

The Real Battle is For Data

AI only makes the problem worse, and the technical reality is staggering. Large language models are trained on billions of words and phrases from the internet. The more data they have, the smarter they become. But platforms are systematically restricting access to this data. While surface-level talking points like "AI democratization" may sound nice, the increasing restrictions on data access infrastructure tells a different story.

If Big Tech companies control both the data inputs to the AI models and the algorithms that create the outputs, internet search simply becomes an echo chamber of corporate-approved information. The conventional wisdom says platform curation improves user experience and safety, but at what cost to freedom of information?

Enter Grass Protocol—a blockchain project launched in the depth of the 2022-23 crypto winter as everyone was licking their wounds from the FTX collapse. With a stated mission to redefine internet incentive structures, Grass quietly built the world’s first Layer 2 data rollup that rewards users for sharing their unused internet bandwidth to scrape data. The genius of Grass’ solution is that while large platforms can systematically block one another, they can’t block mass groups of decentralized individuals, because these are their target users.

Grass is leading the charge for data democratization, expanding its user base from 200,000 to over 3 million between Q4 2024 and early 2025 and achieving daily data collection rates that rival those of major tech companies.

Current search engines are optimized for clicks rather than information quality, due to their advertising-based business models. In contrast, Grass's Live Context Retrieval aims to provide unbiased, information-focused results.

Democratized data collection and open, transparent search is the only path to preventing AI systems from becoming corporate propaganda machines. While everyone is focused on competition among Big Tech companies for AI model capabilities, the real battle is for the data that trains them.

Two Futures: Walled Gardens vs. Democratized Search

Fast forward a few years into the future:

You wake up and ask your AI assistant about renewable energy trends. It responds with information exclusively from sources that paid for algorithmic placement—Big Oil's "clean natural gas" initiative ranks higher than actual solar innovations because they bought premium data access.

Your research for a college paper hits content paywalls after three sources. The AI summarizes information from the same five corporate-approved databases everyone else uses. Independent researchers, citizen journalists, and innovative startups remain invisible.

You try to fact-check a news story, but search results are filtered through engagement algorithms designed to keep you scrolling, not inform you. Contradictory information exists but lives behind platform walls you can't access.

Alternatively,

You wake up and query real-time, unfiltered internet data sourced from a decentralized network of web-scraping nodes. Your question about renewable energy accesses live data from research institutions, startup labs, and citizen science projects worldwide—not just sources that paid for visibility.

Your research draws from a genuinely diverse dataset collected by millions of distributed nodes. You access specific relevant data from unexpected sources because the data layer doesn't discriminate based on corporate budgets.

When fact-checking, you access real-time context that includes dissenting voices, primary sources, and transparent provenance trails. The protocol utilizes zero-knowledge proofs to record data provenance history on the blockchain, creating a secure system where all participants benefit.

The Bottom Line

The internet is becoming a corporate-controlled information prison, and projects like Grass are building the escape tunnel.

The next five years will determine whether artificial intelligence amplifies human knowledge or corporate control. Grass leverages the power of the collective to search for necessary information, process collected data, and record its provenance history on the blockchain—creating a transparent data marketplace that rewards all participants and provides AI models the most comprehensive possible data set.

If successful, this model could influence how other projects approach the intersection of AI, data, and decentralized technologies, potentially setting new standards for the evolving internet landscape.

What's your take? Are you building on corporate data foundations or helping construct the infrastructure for digital freedom? Drop your thoughts below—especially if you think I'm wrong about the severity of data centralization. The fight for internet freedom needs builders, not just believers.

Disclaimer: I am a Founding Partner at No Limit Holdings, an investor in Grass Protocol, and views shared in this article may be influenced by our investment. This article is for informational purposes only and should not be considered financial advice. Please consult with a financial advisor before making any investment decisions.

© 2024 No Limit Holdings. All rights reserved.