Metrics perplexity(lower is better) Categories char-level LM word-level LM Research direction devising novel architectures improving regularization and optimization algorithms speeding up the Softmax computation enriching the output distribution family