Hacker News new | past | comments | ask | show | jobs | submit
Good time to focus on more memory efficient means of training and inference.

SeedLM from Apple is an interesting approach for inference memory efficiency. I'd like to see someone try and build that into training so that it's not a post training compression step.