Writing a Comprehensive Self-Paced Roadmap for NLP with research papers

Writing a Comprehensive Self-Paced Roadmap for NLP with research papers, for my personal development.

Prerequisites:

  • Basic understanding of Python programming language

  • Familiarity with Machine and Deep Learning algorithms

  • Knowledge of libraries used in NLP such as Natural Language Toolkit (NLTK), spaCy, Core NLP, Te Blob, PyNLPI, Gensim, Pattern, etc

Text Preprocessing Level

  • Tokenization, Lemmatization, Stemming, Parts of Speech (POS), Stopwords removal, Punctuation removal

  • Understanding that textual data isn’t directly compatible with Machine Learning algorithms, so it needs to be preprocessed before feeding it into our Machine Learning models

  • Research Paper: “A Neural Probabilistic Language Model”

Advanced level Text Cleaning

  • Normalization, Correction of Typos

  • Mapping and Replacement. This involves mapping words to standardized language equivalents

  • Correction of Typos: Written text often contains errors

Research Paper: “A Unified Architecture for Natural Language Processing: Deep Neural Networks with Multitask Learning”

Language Modeling

Text Classification

Deep Learning for NLP

Advanced NLP Concepts

Attention Mechanisms

  • Understanding of attention mechanisms

  • Types of Attention Mechanisms: Scaled-Dot Product Attention Mechanism, Multi-Head Attention Mechanism

  • Research Paper: “Attention is All You Need”

State-of-the-Art NLP Models

NLP Libraries and Frameworks

NLP Projects

  • Text Classification, Sentiment Analysis, Chatbot Development.

  • Hands-on experience with NLP projects

  • Research Paper: “A Dataset for Document Grounded Conversations”.