Hacker News new | past | comments | ask | show | jobs | submit

  vocabulary*

  *In the code above, we collect all unique characters across the dataset