Hacker News new | past | comments | ask | show | jobs | submit
I paid for Kagi for a bit, but got a weird vibe when I realized they were working pretty hard to paper over the fact that they pay a third party to scrape Google search results for them. The public-facing side of that coin is Kagi's position that Google should make their index available to competitors (see https://blog.kagi.com/waiting-dawn-search).

All that's to say: when I paid for Kagi, I thought I was investing in additional search infrastructure, and didn't realize Kagi had no aspirations to build their own general purpose index, and instead primarily aggregate results from other indexes, either adversarily (Google, Bing) or not (Yandex, Mojeek, Brave, Apple, etc.) I understand they do maintain their own small-web index, but I thought their aspirations were higher when I first jumped on that train.

> didn't realize Kagi had no aspirations to build their own general purpose index

Kagi employee here. We're actively working on building our own indexes beyond the limited ones we have now, not just a general index but also purpose built indexes for things like programming, etc.

I did not intend to spread misinformation here, and would like to hear more about the general-purpose index Kagi is working on. I had based my comment on several Kagi pages, but mostly https://help.kagi.com/kagi/search-details/search-sources.htm..., which mentions Teclis as Kagi's own index, but https://teclis.com/ makes it pretty clear that it's a "small web"-focused tool:

> Teclis is an attempt to surface the less known web, the web of creativity and self expression, the more humane web.

> Teclis includes its own crawl as well as results from Kagi Small Web index and results with permission from Marginalia Search.

> Teclis works best with broad queries such as 'machine learning', 'vegan diet', 'religion' etc..

Is there another crawler doing the general-purpose stuff?

How broad will they be? Do you aim to ever have large scale indexing of the web?
Hey do you guys have posts or sharing about it? It would be awesome to see what you are trying to accomplish, maybe it's time to post on HN ;)
How do you build a search index in the days of Anubis pages everywhere?
What are the challenges of doing that when so much of the internet has turned itself into SEO slop to fit Google's algorithms?

I imagine there is still a whole load of stuff out there on the internet that Google would never surface because it doesn't have enough adsense or whatever. Are you finding that?

So you will stop buying Yandex data at some point?
Hurry. Google might give up the ghost on its search product and maintaining indices on anything not geared for LLMs.

I'm not sure antitrust will help you.

[flagged]
loading story #48267647
loading story #48267475
loading story #48267879
loading story #48267776
loading story #48267492
loading story #48267928
> We're actively working on building our own indexes

Lip service. You'll have some token index of Wikipedia or something so you can say your results are "a blend of our own index and other sources".

loading story #48268735
Nobody wants to pay for anything, so the services that figured out how to profit from people not paying will win.

There was this idea born in the late '90's/early 00's that everything digital should be free. The internet was dominated by teenagers with no job and no credit card, so it made sense.

But the result of that has been a whole generation with an allergy to compensation, and the inability for anyone to compete with "free" services, even if everyone hates that service.

Prior to eternal September, the internet was dominated by college students and staff. Everything was free by virtue of there being no secure payment mechanism. That spirit continued as it opened to the broader public.
That's a really curious perspective. There are a few different angles of attack here, but let's start with this: it WAS free because people were making free content. Before the Internet we were hosting free BBSes (look those up), we then hosted websites which we made ourselves when the Internet was commercialized, and we paid for services like games where it made sense. You'd buy software you'd own forever (like Photoshop), you'd buy music you owned (like CDs), and there weren't 30 subscriptions randomly renewing on your credit card.

Google won because it was a single text box. Yahoo lost because it full of ads and pretended to be a phone book. Linux won in the server world because it was free and superior, Windows lost because it's shite and expensive.

I could go on, but before I do that I'd have to be convinced I'm not replying to a 27 year-old who just graduated business school.

loading story #48268400
loading story #48269638
Counter point: https://kagi.com/stats

About 70k people are paying at least $5 a month. I've been using the $25 a month plan for nearly 3 years now. I imagine Kagi is doing alright.

loading story #48268602
Most people will gladly throw large pile of money for everything that they feel convinced serve them well, provided they are not living by some ridiculously low wage that turn them into monthly paycheck serfs.

When large portion of moneyless teenagers grown up into indebted to death adults, there is no wonder they stick to lure at free services rather than unaffordable services.

> Nobody wants to pay for anything

Congratulations, this might be the single most trivially-disprovable statement I've ever seen on this site

loading story #48267808
loading story #48267775
I think the main value proposition of Kagi is that you're the customer not the product. As far as I know they are delivering on that.

The search infrastructure you're talking about is a natural part of that, but, like any infrastructure, it scales the organization it's supporting. Kagi is tiny so their "original infrastructure" contributions are tiny.

Put another way, you essentially were investing in infrastructure, but you were hoping for major infrastructure and what is happening is small infrastructure. Kagi would probably need to get much bigger to be able to do the infrastructure you're talking about. (And if they were much bigger, it should be natural -- at a certain scale it will make more sense to do your own than work with someone else's.)

They are building their own search index, and they should be allowed to scrape Google in the exact same way Google scraped everybody else.
I don't think they papered over this? They've been transparent about paying to scrape other indices while they work on their own.
If anything, this makes me want to pay them twice. Once for search and once for exploiting google.
Qwant and Ecosia try to build their own index: https://techcrunch.com/2025/08/06/qwant-and-ecosia-debut-sta...
I do not believe that Qwant can produce something good, they always were a company to extract money from the french taxpayer to wrap bing results.
I use and enjoy Ecosia, it works pretty well for most use cases. Unfortunately it has the same limitation as Duck and basically all of the other non-enormous-players in the search engine market: Location aware search is garbage.
loading story #48268264
I am the same, but at the same time I don't want to make assumptions about how viable it is to run a useful index for a small company. I assume they looked into it and deemed it non viable, but would like to know more.
Yes, their argument is essentially that Microsoft spent $100 billion over 20 years trying to compete and still essentially failed.
loading story #48268281
They also had a browser called Orion and till date that gave me anxiety because YouTube videos won't play the first time you load them, you need to refresh the page (randomly) and similar other weird quirks. It's state hasn't changed much over the last year either, so I switched back to Brave now.
I don’t think that’s Orion specific, I have the exact same issue with Safari and Firefox
loading story #48267359
> they were working pretty hard to paper over the fact that they pay a third party to scrape Google

Not the least bit surprising to me. I had the misfortune of talking to Kagi's CEO several years ago. Every word out of his mouth was a lie.

Kagi's the one search company I trust less than Google.

I have found few CEOs capable of telling nothing but the truth. Based on that, I am nearly certain that lying is part of the job description.
Extraordinary claims require extraordinary evidence
loading story #48268531