A Feature-Rich Vietnamese Named-Entity Recognition Model - ArXiv
Có thể bạn quan tâm
Abstract:In this paper, we present a feature-based named-entity recognition (NER) model that achieves the start-of-the-art accuracy for Vietnamese language. We combine word, word-shape features, PoS, chunk, Brown-cluster-based features, and word-embedding-based features in the Conditional Random Fields (CRF) model. We also explore the effects of word segmentation, PoS tagging, and chunking results of many popular Vietnamese NLP toolkits on the accuracy of the proposed feature-based NER model. Up to now, our work is the first work that systematically performs an extrinsic evaluation of basic Vietnamese NLP toolkits on the downstream NER task. Experimental results show that while automatically-generated word segmentation is useful, PoS and chunking information generated by Vietnamese NLP tools does not show their benefits for the proposed feature-based NER model.
| Comments: | 12 pages, pre-print version of CICLing 2018 paper |
| Subjects: | Computation and Language (cs.CL) |
| Cite as: | arXiv:1803.04375 [cs.CL] |
| (or arXiv:1803.04375v1 [cs.CL] for this version) | |
| https://doi.org/10.48550/arXiv.1803.04375 Focus to learn more arXiv-issued DOI via DataCite |
Submission history
From: Quang Nhat Minh Pham Mr [view email] [v1] Mon, 12 Mar 2018 17:07:40 UTC (26 KB) Full-text links:Access Paper:
- View a PDF of the paper titled A Feature-Rich Vietnamese Named-Entity Recognition Model, by Pham Quang Nhat Minh
- View PDF
- TeX Source
References & Citations
- NASA ADS
- Google Scholar
- Semantic Scholar
DBLP - CS Bibliography
listing | bibtex Pham Quang Nhat Minh export BibTeX citation Loading...BibTeX formatted citation
× loading... Data provided by:Bookmark
- Author
- Venue
- Institution
- Topic
arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.
Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.
Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.
Which authors of this paper are endorsers? | Disable MathJax (What is MathJax?)Từ khóa » Phạm Quang Nhật Minh
-
Pham Quang Nhat MINH | Ph.D | Aimesoft Multimodal AI (AIMAI) Lab
-
Aimesoft - Tiến Sĩ Phạm Quang Nhật Minh Là Chuyên Gia Xử...
-
Pham Quang Nhat Minh - Director Of Aimesoft Multimodal AI Lab ...
-
TS. Phạm Quang Nhật Minh - FSB
-
Pham Quang Nhat Minh | Papers With Code
-
Pham Quang Nhat Minh - DBLP
-
Home - Bs Phạm Quang Nhật - Phó Trưởng Khoa Bv Từ Dũ
-
Pham Quang Nhat Minh Minhpqn - GitHub
-
Giảng Viên (Copy) - Red Cat Academy
-
Phạm Quang Nhật Minh (@baymax_bigboiii1423) • Instagram ...
-
Tin Tức Bài Viết Mới Nhất Về: PHẠM QUANG NHẬT MINH