Hacker News new | past | comments | ask | show | jobs | submit
P40 is Tesla architecture which is no longer receiving driver or CUDA updates. And only available as used hardware. Fine for hobbyists, startups, and home labs, but there is likely a growing market of businesses too large to depend on used gear from ebay, but too small for a full rack solution from Nvidia. Seems like that's who they're targeting.
99% of interest is in inference. If you want to fine-tune a model, just rent the best gpu in the cloud. It's often cheaper and faster.
Great option if you don't mind sharing your data with the cloud. Some businesses want to own the hardware their data resides on.
How many businesses have the capabilities and expertise to train their own models?
No idea. Probably more every day.
renting GPU, how is that sharing data with the cloud? you can rent GPU from GCP or AWS
I suppose if I rent a cloud GPU and just let it sit there dark and do nothing then I wouldn't have to move any data to it. Otherwise, I'm uploading some kind of work for it to do. And that usually involves some data to operate on. Even if it's just prompts.
So you also believe when you rent a server you are sharing your data with the cloud? AWS and GCP are copying all private data on servers? Give me a break. There's a big difference between renting a server and using an API.
> So you also believe when you rent a server you are sharing your data with the cloud [hosting provider]?

Only if you upload your data to that cloud server you rented. Then, by definition, you are.

> AWS and GCP are copying all private data on servers?

Every computer copies data when moving it. Several times, in fact. Through network card buffers, switches, system memory, disk caches, and finally to some form of semi-permanent storage.

I don't have to think Amazon is stealing my data to be aware that Amazon S3 buckets containing privileged information are routinely found open. I don't have to think that Google is spying on me to know that operating equipment my business owns on prem and does not share requires me to trust fewer people and less complex systems than doing the same work from the cloud.

You are very quick to make foolish assumptions and assign them to others.