Text analytics is a field that lies on the interface of information retrieval,machine learning, and natural language processing, and this textbookcarefully covers a coherently organized framework drawn from these intersectingtopics. The chapters of this textbook is organized into three categories:
- Basic algorithms: Chapters 1 through 7 discuss the classical algorithmsfor machine learning from text such as preprocessing, similaritycomputation, topic modeling, matrix factorization, clustering,classification, regression, and ensemble analysis.
- Domain-sensitive mining: Chapters 8 and 9 discuss the learning methodsfrom text when combined with different domains such as multimedia andthe Web. The problem of information retrieval and Web search is alsodiscussed in the context of its relationship with ranking and machinelearning methods.
- Sequence-centric mining: Chapters 10 through 14 discuss varioussequence-centric and natural language applications, such as featureengineering, neural language models, deep learning, text summarization,information extraction, opinion mining, text segmentation, and eventdetection.
This textbook covers machine learning topics for text in detail. Since thecoverage is extensive,multiple courses can be offered from the same book,depending on course level. Even though the presentation is text-centric,Chapters 3 to 7 cover machine learning algorithms that are often used indomains beyond text data. Therefore, the book can be used to offercourses not just in text analytics but also from the broader perspective ofmachine learning (with text as a backdrop).
This textbook targets graduate students in computer science, as well as researchers, professors, and industrialpractitioners working in these related fields. This textbook is accompanied with a solution manual forclassroom teaching.
Description:
Machine Learning for Text by Charu C. Aggarwal
Text analytics is a field that lies on the interface of information retrieval,machine learning, and natural language processing, and this textbookcarefully covers a coherently organized framework drawn from these intersectingtopics. The chapters of this textbook is organized into three categories:
- Basic algorithms: Chapters 1 through 7 discuss the classical algorithmsfor machine learning from text such as preprocessing, similaritycomputation, topic modeling, matrix factorization, clustering,classification, regression, and ensemble analysis.
- Domain-sensitive mining: Chapters 8 and 9 discuss the learning methodsfrom text when combined with different domains such as multimedia andthe Web. The problem of information retrieval and Web search is alsodiscussed in the context of its relationship with ranking and machinelearning methods.
- Sequence-centric mining: Chapters 10 through 14 discuss varioussequence-centric and natural language applications, such as featureengineering, neural language models, deep learning, text summarization,information extraction, opinion mining, text segmentation, and eventdetection.
This textbook covers machine learning topics for text in detail. Since thecoverage is extensive,multiple courses can be offered from the same book,depending on course level. Even though the presentation is text-centric,Chapters 3 to 7 cover machine learning algorithms that are often used indomains beyond text data. Therefore, the book can be used to offercourses not just in text analytics but also from the broader perspective ofmachine learning (with text as a backdrop).
This textbook targets graduate students in computer science, as well as researchers, professors, and industrialpractitioners working in these related fields. This textbook is accompanied with a solution manual forclassroom teaching.