Skip to content

First beta release of protein function embeddings

Pre-release
Pre-release

Choose a tag to compare

@rohitharavinder rohitharavinder released this 29 Mar 14:16
· 2 commits to main since this release
65e299d

This release corresponds to a thesis work that explores how information for protein functions can be exploited through embeddings so that the produced information can be used to improve protein function annotations. The underlying hypothesis here is that any pair of proteins with high sequence similarity will also share a similar biological function which would be reflected by the corresponding protein embeddings. The comparison and evaluation of this is done using two text-driven embedding approaches: Word2doc2Vec and Hybrid-Word2doc2Vec.