Project Overview

This project was my Senior design project for SE491 and SE492 Iowa State.

We created a POS tagger specifically for Software Documentation by adding a new tag set, getting new training data, and using various models - ultimately ending on a CRF to get the best results.

Check out the project poster for the quickest way to learn about the project.
The project website is available here

My role on the project was Computational Linguistics SME - I would often be the lead designer of our approach to problems like tag set, which model to use and with what parameters.

This was done in collaboration with:

Ahmad Alramahi, [email protected]
Austin Boling, [email protected]
Joseph Naberhaus, [email protected]
Ekene Okeke, [email protected]
Ethan Ruchotzke, [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
AutoTagging		AutoTagging
ConsistencyChecking		ConsistencyChecking
Data		Data
JSonToNLP		JSonToNLP
JavadocParser		JavadocParser
ManualTagger		ManualTagger
NLPModel		NLPModel
Other		Other
Reports And Presentations		Reports And Presentations
Tokenization		Tokenization
TrainingDir		TrainingDir
UniversalHTMLParser		UniversalHTMLParser
.gitignore		.gitignore
README.md		README.md
tags.json		tags.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Overview

Table of Contents

`AutoTagging`

`Consistency Checking`

`Data`

`JsonToNLP`

`Javadoc Parser`

`Manual Tagger`

`NLP Model`

`Other`

`Tokenization`

`TrainingDir`

`UniversalHTMLParser`

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Jamesetay1/Documentation-POS-Tagger

Folders and files

Latest commit

History

Repository files navigation

Project Overview

Table of Contents

AutoTagging

Consistency Checking

Data

JsonToNLP

Javadoc Parser

Manual Tagger

NLP Model

Other

Tokenization

TrainingDir

UniversalHTMLParser

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

`AutoTagging`

`Consistency Checking`

`Data`

`JsonToNLP`

`Javadoc Parser`

`Manual Tagger`

`NLP Model`

`Other`

`Tokenization`

`TrainingDir`

`UniversalHTMLParser`

Packages