forked from mothur/mothur
-
Notifications
You must be signed in to change notification settings - Fork 1
Project Schedule
darthxaher edited this page Jun 18, 2012
·
33 revisions
- Become Familiar with Microbial Ecology Terms
- Got mothur up and running with Xcode and Gfortran in my Mac OS X setup
- Follow mothur tutorial from the wiki and get to know the workflow
- Create a private git branch for mothur in Github
- Take a closer look at the R implementation
- Create the initial schematics for the command line programs that would be added to mothur ('train' and 'inquire')
- Go through text books to make sure my knowledge of the classifier algorithms are clear once again
- Register for Machine Learning Course at Coursera
- Set up Wiki, Issue Tracker and other stuffs in GitHub page
- Determine Performance Evaluation Criteria - Issue #1
- Prepare Datasets for The Random Forest Algorithm - Issue #2
- Classification/Regression or Both! - Issue #3
- Make a List of Reusable Classes/Functions from Mothur - Issue #4
- Continuing the discussion on all the open issues
- Find Mothur's Common Practices - Issue #5
- Find a Way to Merge 'train' and 'inquire' Commands Into One Single Command - Issue #6
- Investigate Mothur's Multithreading/Multiprocessing API - Issue #7
- Investigate The Possible Places of Parallelization in the Random Forest Classifier - Issue #8
- Create Code Schematics/Pseudocode for the Random Forest Implementation - Issue #9
- Investigate the Best Practices for Parameter Tuning/Estimation for Random Forest - Issue #10
- Continuing the discussion on all the open issues
Create a shared folder in file sharing service like DropBox and share it with mentor Kathryn and Sarah, put all the papers/resources together that we come across.
- Review the Literature of 'Parameter Selection' and Create a wiki Page to Document the Findings - Issue #11
- Continuing the discussion on all the open issues
Review the article titled An Introduction to Variable and Feature Selection (Isabelle Guyon, André Elisseeff - 2003) as per Issue #11
- Review the article titled A Review of Feature Selection Methods on Synthetic Data (Verónica Bolón-Canedo, Noelia Sánchez-Maroño, Amparo Alonso-Betanzos - 2012) as per Issue #11
- Take a Deeper Review on the Paper "Feature Selection via Regularized Trees" - Issue #13
- Continuing the discussion on all the open issues
Milestone 1 Reached: Preliminary investigation complete
- Prepare Datasets for The Random Forest Algorithm - Issue #2
- Week 5: Implement the Basic Building Blocks of Random Forest Algorithm - Issue #14
- Week 6: Implement the Core Part of Random Forest Algorithm - Issue #15
- Week 7: Finish the Implementation of Random Forest Algorithm - Issue #16
- Week 8: Start of Parameter Tuning & Preliminary Evaluation of the Implementation - Issue #17
- Choosing a Good Random Number Generator (RNG) a.k.a "Treading Into the Mucky Waters of RNG" - Issue #18
- Week 9: Implement Regularized Radom Forest Framework on top of Current Random Forest Implementation and Tune Parameters - Issue #19
Status:
Milestone 3 Reached
Status:
Status:
Status: