Skip to main content

Introduction To Natural Language Processing

Human beings are the most intelligent as well as advance species on earth and the significant reason behind this success is "Power of Communication". Now this communication occurs there in whole world in the form of languages, about 65,000 different languages exist there in world. And this leads us to develop a new field in Artificial Intelligence naming Natural Language Processing(NLP). So stay tuned to understand more about  NLP

There are huge amount of data available out there in structured as well as unstructured format. According to industrial reports, there is only approx 21% of data is structured. The structured data is in the form of speech, texts, etc,. Messages often produced on social medias like Facebook, WhatsApp is highly unstructured in nature.

 

So in order to get powerful and informative insights from this data Text Mining and NLP comes into account.

Text Mining:  Text Mining or Text Analytics is nothing but a process of deriving a not only meaningful but also powerful insights from natural language text. It usually involves the process of structuring the input text and seeking patterns from the structured text and finally interpreting output from that pattern evaluation.

NLP: Natural Language Processing:

            NLP is a one of the subfield of Computer Science and Artificial Intelligence which deals with languages used by human. Essential goal of NLP is to derive high quality information from the human language and use it for performing various operations with Data Analytics. Deep Learning, Supervised and Unsupervised learning are widely used for analyzing human languages, but Machine Learning approach for this is not sufficient and thus NLP is important because it minimizes ambiguity in language and extract powerful information.

Like that Text Mining/ Analytics and NLP go hand in hand! Both of these technologies used as a unit for many applications like Investigative discovery, Subject-matter expertise, Social media analytics and many more.

 Components Of NLP:

Natural Language of Processing have two components naming Natural Language Understanding and Natural Language Generation.

Here Natural Language Understanding deals with deriving high quality insight by mapping it from natural language and then analyze those aspects of language. Contrary to Natural language Understanding, Natural Language Generation is a process of creating variety of syntactically as well as semantically meaningful sentences and phrases in the form of natural or human language from the some internal representation.

Out of these components Natural Language Understanding takes more time to perform its task as it have to process a lot for getting those powerful insights and it becomes more critical if you are not a human. As compared to this task of Natural Language Generation is bit easy.

Steps Involved in NLP:

Tokenization: Tokenization is very first step in NLP which breaks strings into various small structures or units which are useful for creating tokens. Consider a sentence "Tokenization is first step", now this sentence is then broken into different tokens as, "Tokenization", "is", "first", "steps" and these tokens can be used for further processing.

Stemming: Stemming usually deals with normalizing words into their base or root form. For example Normalize, Normalization, Normalized, etc. words are originated from root or base Normal. In this process root word is extracted by cutting out its suffixes and prefixes.

Lemmatization:  Whereas, Lemmatization takes account of morphological analysis of the words. Process of Lemmatization collect the different inflected forms of root called Lemma and then map several words into one root.

POS Tags: POS Tags are nothing but a tags assigned to Parts of Speech in a sentence. For example, "NLP is subfield of AI", here 'NLP' and 'AI' are noun so tag is NN. It determines how a word functions in meaning as well as in sentence.

Name Entity Recognition: NER is nothing but a process of recognizing entities from a sentence like organization, quantity, person, location, etc.

Chunking: Chunking basically means picking up individual small pieces of information and grouping them together into the large pieces also known as Chunk. It helps to get meaningful insights from available information.

All these steps can be executed in Python on the platform of pythonNLTK where NLTK is Natural Language Tool Kit library.

Applications of NLP:

                One of the most important application of NLP is Sentiment Analysis of Facebook or Twitter. This application tries to analyze sentiments behind the posts or hash tags, etc.

                The next application of NLP is getting popular these days is Chatbots. These chatbots are out there in the form of costumer chat services provided by various companies.

                Furthermore we have Speech Recognition as an application of NLP. Siri, Alexa, Google Assistance are the examples of voice assistance.

                Machine Translation is also an application of NLP, one of the example of this is Google Translator. And other applications of NLP are Spell Checking, Keyword Searching, etc.

So, I hope this article have introduced you well with Natural Language Processing. You have understood the various concepts like, NLP, Text Mining, steps for performing NLP, applications of NLP. I this article must be good kick start to your knowledge of Natural Language Processing.

 


Comments

Popular posts from this blog

What's the difference between Artificial Intelligence and Robotics

          What's the difference between      Artificial Intelligence and Robotics?                     Is Artificial Intelligence part of Robotics? OR vice versa? Are these terms same or different?   These kind of questions pop up when you listen about AI and Robotics, but don't worry you will get answer of these question after reading this blog so stay tuned! Artificial Intelligence and Robotics are actually two different fields. However, people often get them mixed up. But before getting into the depth of their difference, let's first understand what these terms are: Artificial Intelligence : Artificial Intelligence, etymologically means creating an intelligence (i.e. learning, understanding, etc. ) artificially. Whereas, technically Artificial Intelligence is an approach to make a computer, a robot, or a product to think how smart human think. Artificial Intelligenc...

Let's Get Into Machine Learning

     Hello and welcome back to the CosmoTechno - Dive Deep Into The Cosmos Of Technology! So get ready for diving deep into the cosmos of Machine Learning today. You might have been heard about this buzzword everywhere, but not really sure about what is it. Hold On! you don't have to worry about this as this blog will help you to get a valuable insight of Machine Learning. Before starting with any discussion let's have a look at the technical term of Machine Learning, Arthur Samuel says, "Machine Learning is field of study that gives computers the ability to learn without being explicitly programmed." If I rephrase the above definition it will be something like, feeding lots of experience to the machine so that it will come up with a strategy to solve a particular problem. Here experience is in the form of data which can be a text, images, videos, etc. On the basis of this data we provide there are four types of Machine Learning:        ...

Hello Deep Learning World!

Hello and welcome back to CosmoTechno! As blog title suggest, today you would be going to get a significant intuition of Deep Learning. In this blog you will be understanding how to perform Hello World project in Deep Learning. Recognizing Handwritten Digits is one of the most famous as well as beginner friendly project of Deep Learning. If your completely newbie or you are not familiar with basic concept of Deep Learning then I will recommend  you to read the previous blog on deep learning. In this blog I am going to perform Handwritten digit recognition using Convolutional Neural Network(CNN) which we will implement with the help of Tensorflow, Keras libraries. Before getting into hands on coding part we will try to understand how CNN works. CNN is a one of the significant Neural Network which is commonly used in Deep Learning. In image recognition or classification task CNN divide each image into very small boxes and try to get some meaningful patterns from the same and store ...