Story Detail of id 47476949 | Liveview Hacker News

logicallee8 hours ago | on: Flash-MoE: Running a 397B Parameter Model on a Laptop

can you post the final result (or as far as you got before you killed it) to show us how cohesive and good it is? I'd like to see an example of the output of this.

frwickst8 hours ago | parent

Since the output is quite long, here is a link: https://pastebin.com/k76wiVGP

hrimfaxi8 hours ago | root | parent

Why does this G character appear to prefix most of the output? ("Ġlike")

frwickst7 hours ago | root | parent | next

It is a tokenizer artifact most likely (https://github.com/huggingface/transformers/issues/4786). So the output is not properly decoded in this case, it should just be a space.

kgeist7 hours ago | root | parent

The original tokens have Ġ instead of space. I had this issue too when writing an inference engine for Qwen. You have to "normalize" those special characters.

#visit	13,229,589
#session	74,665
#live-session	0