Playing with Vision Embeddings

https://prestonbjensen.com/posts/playing-with-vision-embeddings

135prestoj | 3 days ago | 11 | HN

loading story #48447183

loading story #48446733

Beautiful illustrations I find, 'Playing' is just the free and motivated version of 'exploration'.

One thought on your nicely illustrated "key observation [is] that neural networks tend to place features along directions": my guess is that the neural net was TOLD to behave that way by choosing e.g. Cosine Loss?

loading story #48443879

RealityVoid7 hours ago | parent | next

For some reason, the uncanniness of the feature pictures are deeply unsettling for me. It just stirs intense unease. A bit amusing, to be honest.

loading story #48445055

jcattle10 hours ago | parent | next

Very nice visualizations, thanks for that!

One thing I still struggle with in my head is how these vision embeddings can then be used to give LLMs eyes.

Because you somehow need a giant training set which describes images in natural language, no? Is that actually how it works, or is there some smart trick so you don't need to pay labellers a bunch of money to look at pictures and describe them.

loading story #48442554

loading story #48444855

11 hours ago | parent | next

{"dead":true,"deleted":true,"id":48441909,"parent":48413366,"time":1780899930,"type":"comment"}

loading story #48443695

loading story #48443724

SkitterKherpi8 hours ago | parent

[dead]

#visit	13,657,757
#session	74,665
#live-session	0