This paper is published in Volume-3, Issue-2, 2017
Area
Computer Engineering
Author
Gunjal Sonali Vishram, Prof. N. B. Kadu
Org/Univ
Pravara Rural Engineering College, Loni, Maharastra, India
Pub. Date
14 April, 2017
Paper ID
V3I2-1504
Publisher
Keywords
Named Entity Recognition, Simconcept, Composite Mention, Gens, Disease.

Citationsacebook

IEEE
Gunjal Sonali Vishram, Prof. N. B. Kadu. A Hybrid System for Chemical Named Entity Simplification, International Journal of Advance Research, Ideas and Innovations in Technology, www.IJARIIT.com.

APA
Gunjal Sonali Vishram, Prof. N. B. Kadu (2017). A Hybrid System for Chemical Named Entity Simplification. International Journal of Advance Research, Ideas and Innovations in Technology, 3(2) www.IJARIIT.com.

MLA
Gunjal Sonali Vishram, Prof. N. B. Kadu. "A Hybrid System for Chemical Named Entity Simplification." International Journal of Advance Research, Ideas and Innovations in Technology 3.2 (2017). www.IJARIIT.com.

Abstract

One explicit challenge in medicine named entity recognition (NER) and normalization is that the identification and resolution of composite named entities, wherever one span refers to over one idea (e.g., compositeBRCA1/2). Previous Named Entity Recognition (NER) and normalization studies have either neglected composite mentions, used straight forward rules or solely handled coordination omission, making a strong approach for handling composites mentions greatly required to the present finish, we tend to propose a hybrid technique integrating a machine-learning model with a pattern identification strategy to spot the individual elements of every composite mention. Our method , that we’ve named SimConcept is the first to consistently handle many sorts of composite mentions. The technique achieves high performance in distinguishing and resolving composite mentions for three key biological entities: genes (90.42% in F-measure), diseases (86.47% in F-measure), and chemicals (86.05% in F-measure).The proposed SimConcept technique will later improve the performance of gene, disease chemicals concept recognition and normalization. We observe that in our datasets, approximately 10% of gene, disease, and chemical mentions are composite mentions, hence, it is important to handle them properly. This study presents a new method for bio-concept mention simplification in a systematic fashion.