Learex:: learning relationship extraction patterns from text based on typed dependencies

Patel, Chatali

Learex:: learning relationship extraction patterns from text based on typed dependencies

Patel, Chatali

2017

Formats

Format
BibTeX
MARCXML
TextMARC
MARC
DataCite
DublinCore
EndNote
NLM
RefWorks
RIS

Add to Basket

Files

Abstract

An immense number of articles containing important information are being published every day. We have developed a generalized text mining system which automatically extracts relationships between concepts from free text and presents them in user desired format. The system requires example sentences with entities of interest annotated by the user as an input to train the system. The system uses the SPARQL query language as an interface to identify grammatical patterns existing in the sentence, which helps in extracting relationships. A curatorial system can be used to verify extracted relationships. To improve the performance, an additional module was developed that generates SPARQL query patterns using expert feedback from the curatorial system; this module adds patterns to the extraction patterns set. Similar patterns are combined to reduce the overall numbers of distinct patterns to speed up extraction process. Additionally, the module improves system accuracy over time.

Details

Record ID

17149

Record Created

2024-12-05

Title

Learex:: learning relationship extraction patterns from text based on typed dependencies

Author

Patel, Chatali

College or School

Computer Sciences

Date

2017

Publisher

University of Georgia

Content Type

Thesis

Language

English

Dissertation/ Thesis Note

Graduate

Degree Type

Master of Science (MS)

Name of Granting Institution

University of Georgia, Summer 2017

Year Degree Granted

2017

Keywords

Text Mining, Natural Language Processing, Relationship Extraction, Parsing, Pattern Recognition, Ontology, Jena, SPARQL

Record Appears in

Electronic Theses and Dissertations > Graduate Thesis
All Resources

System Control Number

9949333957702959

PDF

Statistics

Download Full History