Developing a Keyword Extractor and Document Classifier: Emerging Research and Opportunities

Dimple Valayil Paul

Indexed In: SCOPUS

DOI: 10.4018/978-1-7998-3772-5

ISBN13: 9781799837725|ISBN10: 1799837726|EISBN13: 9781799837732

Hardcover:

Available

$215.00

Benefits & Incentives

Benefits

Printed-On-Demand (POD)
Usually ships one day from order

Hardcover:

Available

$215.00

Benefits & Incentives

Benefits

Printed-On-Demand (POD)
Usually ships one day from order

E-Book:

Available

$215.00

Benefits & Incentives

Benefits

Multi-user license (no added fee)
Immediate access after purchase
No DRM
PDF download

E-Book:

Available

$215.00

Benefits & Incentives

Benefits

Immediate access after purchase
No DRM
PDF download
Receive a 10% Discount on eBooks

Hardcover +
E-Book:

Available

$260.00

Benefits & Incentives

Benefits

Printed-On-Demand (POD)
Usually ships one day from order
Multi-user license (no added fee)
Immediate access after purchase
No DRM
PDF download

Hardcover +
E-Book:

Available

$260.00

Benefits & Incentives

Benefits

Printed-On-Demand (POD)
Usually ships one day from order
Immediate access after purchase
No DRM
PDF download

OnDemand:

(Individual Chapters)

Available

$37.50

Benefits & Incentives

Benefits

Purchase individual chapters from this book
Immediate PDF download after purchase or access through your personal library

Effective immediately, IGI Global has discontinued softcover book production. The softcover option is no longer available for direct purchase.

Description & Coverage

Description:

The main problems that prevent fast and high-quality document processing in electronic document management systems are insufficient and unstructured information, information redundancy, and the presence of large amounts of undesirable user information. The human factor has a significant impact on the efficiency of document search. An average user is not aware of the advanced option of a query language and uses typical queries. Development of a specialized software toolkit intended for information systems and electronic document management systems can be an effective solution of the tasks listed above. Such toolkits should be based on the means and methods of automatic keyword extraction and text classification. The categorization (or classification) of texts into predefined categories has witnessed a booming interest in the last 10 years due to the increased availability of documents in digital form and the ensuing need to organize them. Thus, research on keyword extraction, advancements in the field, and possible future solutions is of great importance in current times.

Developing a Keyword Extractor and Document Classifier: Emerging Research and Opportunities presents an information extraction mechanism that can process many kinds of inputs, realize the type of text, and understand the percentage of the keywords that has to be stored. This mechanism then supports information extraction and information categorization mechanisms. This module is used to support a text summarization mechanism, which leads—with the help of the keyword extraction module—to text categorization. It employs lexical and information retrieval techniques to extract phrases from the document text that are likely to characterize it and determines the category of the retrieved text to present a summary to the users. This book is ideal for practitioners, stakeholders, researchers, academicians, and students who are interested in the development of a new keyword extractor and document classifier method.

Coverage:

The many academic areas covered in this publication include, but are not limited to:

Data Mining
Document Classification
Input and Output Design
Keyword Extraction
Menu Design
Performance Measures
System Analysis and Design
System Testing
Text Categorization
Text Mining

Table of Contents

Search this Book:

Reset

Editor/Author Biographies

Working as Asst. Professor in the Department of Computer Science since 20 years

Abstracting & Indexing

Archiving

All of IGI Global's content is archived via the CLOCKSS and LOCKSS initiative. Additionally, all IGI Global published content is available in IGI Global's InfoSci^® platform.