参考文献/References:
[1] Shannon C E. Prediction and entropy of printed English[J]. Bell System Technical Journal, 1951, 30(1):50-64.
[2] Vaswani A, Zhao Y, Fossum V, et al. Decoding with large-scale neural language models improves translation[C]//Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing. Seattle, America: Association for Computational Linguistics,2013:1387-1392.
[3] Mnih A, Teh Y W. A fast and simple algorithm for training neural probabilistic language models[C]//Proceedings of the 29th International Conference on Machine Learning. Edinburgh: International Machine Learning Society,2012:1751-1758.
[4] Kneser R, Ney H. Improved clustering techniques for class-based statistical language modelling[C]//Eurospeech’93. Berlin, Germany: International Speech Communication Association,1993:973-976.
[5] Och F J, Ney H. A systematic comparison of various statistical alignment models//[J]. Computational Linguistics, 2003, 29(1):19-51.
[6] Koehn P, Hoang H, Birch A, et al. Moses: Open source toolkit for statistical machine translation[C]//Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions. Prague, Czech: Association for Computational Linguistics, 2007.177-180.
[7] Koehn P, Och F J, Marcu D. Statistical phrase-based translation[C]// Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology. Edmonton, Canada: Association for Computational Linguistics, 2003.127-133.
[8] Och F J. Statistical machine translation : from single word models to alignment templates[J]. Rwth Aachen, 2002, 10(2):65-70.
[9] Papineni K, Roukos S, Ward T, et al. BLEU: a method for automatic evaluation of machine translation[C]// Proceedings of the 40th Annual Meeting on Association for Computational Linguistics. Philadelphia, America: Association for Computational Linguistics, 2002.311-318.