Skip to content

Natural language processing

Islam Sediek Tolan edited this page Mar 21, 2015 · 10 revisions

Summary

      ### Using (tm Package and NLP package) to answer  user's questions### 

Description

      ### We will read a question from a web interface using (tm Package) then use (NLP Package) to break the question into tokenizers and convert it to matrix. then we can train a model to predict score using machine learning algorithm . finally use a UIMA to discover Knowledge to understand the question and search a database for the answer. This system can be used to build intelligent expert system in various fields ### 

Related work

      ### This system can be a good beginning to more intelligent expert systems ### 

Potential tasks

     ### 1- Create a web interface to enter the questions.### 
     ### 2- Use (tm package)to read the question ### 
     ### 3- Use (NLP package) to convert the question to tokenizers then the matrix finally train the model and  predict the score ### 
     ### 4- Use UIMA to discover Knowledge ###  
     ### 5- Create database that will save answers for the questions ### 

Skills required

     ### a very good programming experience in R language and Java ### 
     ### Good programming Knowledge in HTML ### 
     ### Database and SQL ### 
     ### Knowledge about tm package and NLP package ### 

Test

    ###  R has built in linear regression Function lm().it take a formula that denote the target and the predictor variables and output a model object . take an e.g. of simple linear regression model analysis on the cars data set , we consider the horse power as predictor variable to predict the car price so the powerful car has the higher price . visualize this property in scatter plot and perform the regression with the formula (price~horsepower).Observe the model coefficient by printing the model object ###           
Clone this wiki locally