×

Research on statistical machine translation model based on deep neural network. (English) Zbl 1459.68219

Summary: With the increase of translation demand, the advancement of information technology, the development of linguistic theories and the progress of natural language understanding models in artificial intelligence research, machine translation has gradually gained worldwide attention. However, at present, machine translation research still has problems such as insufficient bilingual data and lack of effective feature representation, which affects the further improvement of key modules of machine translation such as word alignment, sequence adjustment and translation modelling. The effect of machine translation is still unsatisfactory. As a new machine learning method, deep neural network can automatically learn abstract feature representation and establish a complex mapping relationship between input and output signals, which provides a new idea for statistical machine translation research. Firstly, the multi-layer neural network and the undirected probability graph model are combined, and the similarity and context information of vocabulary are effectively utilized to model the word alignment more fully, and the word alignment model named NNWAM is constructed. Secondly, the low dimension will be used. The feature representation is combined with other features into a linearly ordered pre-ordering model to construct the pre-ordering model named NNPR. Finally, the word alignment model and the pre-ordering model are combined in the same deep neural network framework to form DNNAPM, a statistical machine translation model based on deep neural networks. The experimental results show that the statistical machine translation model based on deep neural network has better effect, faster convergence and better reliability than the comparison model algorithm.

MSC:

68T50 Natural language processing
68T07 Artificial neural networks and deep learning

Software:

STEWord
Full Text: DOI

References:

[1] Claycomb, J.; Abreu-Goodger, C.; Buck, AH, RNA-mediated communication between helminths and their hosts: the missing links, RNA Biol, 14, 4, 436-441 (2017) · doi:10.1080/15476286.2016.1274852
[2] Su, J.; Zeng, J.; Xiong, D., A hierarchy-to-sequence attentional neural machine translation model, IEEE/ACM Trans Audio Speech Lang Process, 26, 3, 623-632 (2018) · doi:10.1109/TASLP.2018.2789721
[3] Lo, B.; Zettler, P.; Cedars, MI, A new era in the ethics of human embryonic stem cell research, Stem Cells, 23, 10, 1454-1459 (2010) · doi:10.1634/stemcells.2005-0324
[4] Curtmola, R.; Garay, J.; Kamara, S., Searchable symmetric encryption: improved definitions and efficient constructions, J Comput Secur, 19, 5, 895-934 (2011) · doi:10.3233/JCS-2011-0426
[5] Sun, Y.; Xu, J.; Qiang, H., Adaptive sliding mode control of maglev system based on RBF neural network minimum parameter learning method, Measurement, 141, 217-226 (2019) · doi:10.1016/j.measurement.2019.03.006
[6] Chakraborty, S.; Khasidashvili, Z.; Seger, CJH, Symbolic trajectory evaluation for word-level verification: theory and implementation, Form Methods Syst Des, 50, 2-3, 1-36 (2017)
[7] Kim, K.; Park, EJ; Shin, JH, Divergence-based fine pruning of phrase-based statistical translation model, Comput Speech Lang, 41, C, 146-160 (2017) · doi:10.1016/j.csl.2016.06.006
[8] Dai, X-G; Wang, P., A new classification of large-scale climate regimes around the Tibetan Plateau based on seasonal circulation patterns, Adv Clim Change Res, 8, 1, 26-36 (2017) · doi:10.1016/j.accre.2017.01.001
[9] Ashraf, N.; Ahmad, M., Machine translation techniques and their comparative study, Int J Comput Appl, 125, 7, 25-31 (2015)
[10] Gao, S.; Yang, X.; Yu, Z., Chinese-Naxi machine translation method based on Naxi dependency language model, Int J Mach Learn Cybern, 8, 1, 333-342 (2017) · doi:10.1007/s13042-014-0325-2
[11] Dong, W.; Chi, M., Long short-term memory with quadratic connections in recursive neural networks for representing compositional semantics, IEEE Access, 5, 16077-16083 (2017) · doi:10.1109/ACCESS.2016.2647384
[12] Zhang, X.; Liang, Y.; Chen, L., Recursive autoencoders-based unsupervised feature learning for hyperspectral image classification, IEEE Geosci Remote Sens Lett, 14, 11, 1928-1932 (2017) · doi:10.1109/LGRS.2017.2737823
[13] Wang, S.; Cong, Y.; Cao, J., Scalable gastroscopic video summarization via similar-inhibition dictionary selection, Artif Intell Med, 66, 1-13 (2016) · doi:10.1016/j.artmed.2015.08.006
[14] Guzmán, F.; Joty, S.; Màrquez, L., Machine translation evaluation with neural networks, Comput Speech Lang, 45, 180-200 (2017) · doi:10.1016/j.csl.2016.12.005
[15] Ding, C.; Sakanushi, K.; Touji, H., Inter-, intra-, and extra-chunk pre-ordering for statistical Japanese-to-English machine translation, ACM Trans Asian Low Resour Lang Inf Process, 15, 3, 1-28 (2016) · doi:10.1145/2818381
[16] Chong, CC; Lim, TY; Soon, LK, Meaning preservation in example-based machine translation with structural semantics, Expert Syst Appl, 78, 242-258 (2017) · doi:10.1016/j.eswa.2017.02.021
[17] Hasler, E.; Gispert, AD; Stahlberg, F., Source sentence simplification for statistical machine translation, Comput Speech Lang, 45, C, 221-235 (2017) · doi:10.1016/j.csl.2016.12.001
[18] Song, Z., The research on key technologies of chinese heavy-lift launch vehicle control system, Aerosp China, 18, 2, 13-22 (2017)
[19] Marmanis, D.; Datcu, M.; Esch, T., Deep learning earth observation classification using imagenet pretrained networks, IEEE Geosci Remote Sens Lett, 13, 1, 105-109 (2016) · doi:10.1109/LGRS.2015.2499239
[20] Weber, M.; Fackeldey, K.; Schütte, C., Set-free Markov state model building, J Chem Phys, 146, 12, 124133 (2017) · doi:10.1063/1.4978501
[21] Shen, M.; Dan, Y., A finite frequency approach to control of Markov jump linear systems with incomplete transition probabilities, Appl Math Comput, 295, 53-64 (2017) · Zbl 1411.93186
[22] Kang, L.; Xu, L.; Zhao, J., Co-extracting opinion targets and opinion words from online reviews based on the word alignment model, IEEE Trans Knowl Data Eng, 27, 3, 636-650 (2018)
[23] Wu, ZG; Ju, HP; Su, H., Passivity analysis of Markov jump neural networks with mixed time-delays and piecewise-constant transition rates, Nonlinear Anal Real World Appl, 13, 5, 2423-2431 (2012) · Zbl 1260.60172 · doi:10.1016/j.nonrwa.2012.02.009
[24] Mo, YY; Guo, JY; Yu, ZT, A bilingual word alignment algorithm of Vietnamese-Chinese based on feature constraint, Int J Mach Learn Cybern, 6, 4, 537-543 (2015) · doi:10.1007/s13042-014-0293-6
[25] Liu, Y., Digital image recognition based on improved cognitive neural network, Transl Neurosci, 10, 1, 125-128 (2019) · doi:10.1515/tnsci-2019-0021
[26] He, W., Computational neuroscience applied in surface roughness fiber optic sensor, Transl Neurosci, 10, 1, 70-75 (2019) · doi:10.1515/tnsci-2019-0012
[27] Chen, MC; Lu, SQ; Liu, QL, Global regularity for a 2D model of electro-kinetic fluid in a bounded domain, Acta Mathematicae Applicatae Sinica, Engl Ser, 34, 2, 398-403 (2018) · Zbl 1392.35204 · doi:10.1007/s10255-018-0740-3
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.