Independent Web Indexes (10)
Their own crawl of the web. Not Google, not Bing — independent search indexes you can query via API.
Brave Search API
Editor's PickProgrammatic access to the only independent Western search index at scale — 40B+ pages, adding 100M+ new pages daily.
Common Crawl
Most PopularNonprofit open web archive with 9.5 PB of data — the foundational dataset behind 64% of major LLMs including GPT-3.
Mojeek
UK privacy-first search engine with its own independently built ~3.6B page index — the first search engine to pledge no tracking.
Webz.io
Machine-readable web data feeds covering news, blogs, forums, and the dark web — pre-indexed and structured for enterprise consumption.
Yandex Search API
API access to Yandex's own search index — the dominant search engine in Russia and parts of Eastern Europe.
Stract
Open-source, non-profit search engine building its own index with a focus on user-customizable ranking and transparency.
Marginalia Search
Independent search engine focused on non-commercial, text-heavy content — surfaces the 'old web' that Google buries.
Qwant
French privacy-focused search engine with its own partial index, GDPR-native, and backed by European institutional investors.
Alexandria
Open, decentralized search index project aiming to build a community-owned alternative to Google and Bing.
Gigablast
Open-source search engine with its own index, built by a single developer — one of the oldest independent web crawlers still operating.