Skip to content

Natural language processing

Islam Sediek Tolan edited this page Mar 21, 2015 · 10 revisions

Summary

      ### Using (tm Package and NLP package) to answer  user's questions. 

Description

      ### We will read a question from a web interface using (tm Package) then use (NLP Package) to break the question into tokenizers and convert it to matrix. then we can train a model to predict score using machine learning algorithm . finally use a UIMA to discover Knowledge to understand the question and search a database for the answer. This system can be used to build intelligent expert system in various fields.###  

Related work

      ### This system can be a good beginning to more intelligent expert systems.###  

Potential tasks

     ### 1- Create a web interface to enter the questions.
     ### 2- Use (tm package)to read the question. 
     ### 3- Use (NLP package) to convert the question to tokenizers then the matrix finally train the model and  predict the score.  
     ### 4- Use UIMA to discover Knowledge.   
     ### 5- Create database that will save answers for the questions.  

Skills required

     ### a very good programming experience in R language and Java. 
     ### Good programming Knowledge in HTML. 
     ### Database and SQL. 
     ### Knowledge about tm package and NLP package. 

Test

    ###  R has built in linear regression Function lm().it take a formula that denote the target and the predictor variables and output a model object . take an e.g. of simple linear regression model analysis on the cars data set , we consider the horse power as predictor variable to predict the car price so the powerful car has the higher price . visualize this property in scatter plot and perform the regression with the formula (price~horsepower).Observe the model coefficient by printing the model object.           
Clone this wiki locally