Using Collaborative Training Method to Build Vietnamese Dependency Treebank

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 10035))

Included in the following conference series:

1727 Accesses

Abstract

For the difficulty of marking Vietnamese dependency tree, this paper proposed the method which combined MST algorithm and improved Nivre algorithm to build Vietnamese dependency treebank. The method took full advantage of the characteristics of collaborative training. Firstly, we built a bit samples. Secondly, we used the samples to build two weak learners with two fully redundant views. Then, we marked a large number of unmarked samples mutually. Next, we selected the samples of high trust to relearn and built a dependency parsing system. Finally, we used 5000 Vietnamese sentences marked manually to do tenfold cross-test and obtained the accuracy of 76.33 %. Experimental results showed that the proposed method in this paper could take full advantage of unmarked corpus to effectively improve the quality of dependency treebank.

This work was supported in part by the National Natural Science Foundation of China (Grant Nos. 61262041, 61363044 and 61472168) and the key project of National Natural Science Foundation of Yunnan province (Grant No. 2013FA030).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Adapting Cross-Lingual Model to Improve Vietnamese Dependency Parsing

Iterative Integration of Unsupervised Features for Chinese Dependency Parsing

BERT-Based Sentence Recommendation for Building Vietnamese Universal Dependency Treebank

References

Le-Hong, P., Nguyen, T.M.H.: Part-of-speech induction for Vietnamese. In: Huynh, V.N., Denoeux, T., Tran, D.H., Le, A.C., Pham, B.S. (eds.) KSE 2013, Part II. AISC, vol. 245, pp. 273–286. Springer, Heidelberg (2014)
Chapter Google Scholar
Le-Hong, P., Nguyen, T.M.H., Rossignol, M., Roussanaly, A.: An empirical study of maximum entropy approach for part-of-speech tagging of Vietnamese texts. In: Actes du Traitement Automatique des Langues Naturelles (TALN-2010), Montreal, Canada (2010)
Google Scholar
Dinh, Q.T., Nguyen, T.M H., Vu, X.L., Rossignol, M., Le-Hong, P., Nguyen, C.T.: Word segmentation of Vietnamese texts: a comparison of approaches. In: Proceedings of the Sixth International Conference on Language Resources and Evaluation, Marrakech, Morocco (2008)
Google Scholar
Lai, T.B.Y., Huang, C.N., Zhou, M., Miao, J.B., Siu, K.C.: Span-based statistical dependency parsing of Chinese. In: Proceedings of NLPRS, pp. 677–684 (2001)
Google Scholar
Yamada, H., Matsumoto, Y.: Statistical dependency analysis with support vector machines. In: Proceedings of the 8th International Workshop on Parsing Technologies (IWPT), pp. 195–206 (2003)
Google Scholar
Ma, J.S., Zhang, Y., Liu, T., Li, S.: A statistical dependency parser of Chinese under small training data. In: Workshop: Beyond Shallow Analyses-Formalisms and Statistical Modeling for Deep Analyses, IJCNLP-2004, San Ya, pp. 113–118 (2004)
Google Scholar
Thi, L.N., Vietnam, H.N., Minh, H.N.T., Le Hong, P.: Building a treebank for Vietnamese dependency parsing. In: IEEE RIVF International Conference on Computing and Communication Technologies - Research, Innovation, and Vision for the Future (RIVF), 10–13 November 2013
Google Scholar
McDonald, R.: Non-projective dependency parsing using spanning tree algorithms, pp. 523–530. Association for Computational Linguistics (2005)
Google Scholar
Eisner, J.: Three new probabilistic models for dependency parsing: an exploration. In: Proceedings of the COLING (1996)
Google Scholar
Chu, Y.J., Liu, T.H.: On the shortest arborescence of a directed graph. Sci. Sinica 14, 1396–1400 (1965)
MathSciNet MATH Google Scholar
Edmonds, J.: Optimum branchings. J. Res. Natl. Bur. Stand. 71B, 233–240 (1967)
Article MathSciNet MATH Google Scholar
Beyer, K., Ramakrishnan, R.: Bottom-up computation of sparse and iceberg cubes. In: Proceedings of the 1999 ACM SIGMOD International Conference on Management of Data, Philadelphia, pp. 359–370 (1999)
Google Scholar
Findlater, L., Hamilton, H.J.: Iceberg-cube algorithms: an empirical evaluation on synthetic and real data. Intell. Data Anal. 7(2), 77–97 (2003)
MATH Google Scholar
Nivre, J., Scholz, M.: Deterministic dependency parsing of English text. In: Proceedings of the 20th International Conference on Computational Linguistics (COLING), pp. 64–70 (2004)
Google Scholar
Nivre, J., McDonald, R.: Integrating graphbased and transition-based dependency parsers. In: Proceedings of ACL, pp. 950–958 (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

The School of Information Engineering and Automation, Kunming University of Science and Technology, Kunming, 650500, Yunnan, China
Guoke Qiu & Jianyi Guo
The Key Laboratory of Intelligent Information Processing, Kunming University of Science and Technology, Kunming, 650500, Yunnan, China
Zhengtao Yu, Yantuan Xian & Cunli Mao

Authors

Guoke Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Jianyi Guo
View author publications
You can also search for this author in PubMed Google Scholar
Zhengtao Yu
View author publications
You can also search for this author in PubMed Google Scholar
Yantuan Xian
View author publications
You can also search for this author in PubMed Google Scholar
Cunli Mao
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jianyi Guo .

Editor information

Editors and Affiliations

Tsinghua University , Beijing, China
Maosong Sun
Fudan University , Shanghai, China
Xuanjing Huang
Dalian University of Technology , Dalian, China
Hongfei Lin
Tsinghua University , Beijing, China
Zhiyuan Liu
Tsinghua University , Beijing, China
Yang Liu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Qiu, G., Guo, J., Yu, Z., Xian, Y., Mao, C. (2016). Using Collaborative Training Method to Build Vietnamese Dependency Treebank. In: Sun, M., Huang, X., Lin, H., Liu, Z., Liu, Y. (eds) Chinese Computational Linguistics and Natural Language Processing Based on Naturally Annotated Big Data. NLP-NABD CCL 2016 2016. Lecture Notes in Computer Science(), vol 10035. Springer, Cham. https://doi.org/10.1007/978-3-319-47674-2_8

Download citation

DOI: https://doi.org/10.1007/978-3-319-47674-2_8
Published: 10 October 2016
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-47673-5
Online ISBN: 978-3-319-47674-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Using Collaborative Training Method to Build Vietnamese Dependency Treebank

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Adapting Cross-Lingual Model to Improve Vietnamese Dependency Parsing

Iterative Integration of Unsupervised Features for Chinese Dependency Parsing

BERT-Based Sentence Recommendation for Building Vietnamese Universal Dependency Treebank

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Using Collaborative Training Method to Build Vietnamese Dependency Treebank

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

Adapting Cross-Lingual Model to Improve Vietnamese Dependency Parsing

Iterative Integration of Unsupervised Features for Chinese Dependency Parsing

BERT-Based Sentence Recommendation for Building Vietnamese Universal Dependency Treebank

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation