id-prop-extractor

A simiple database IDs and biochemical properties extractor for compounds in SMILES format using the pubchempy API.

Author: Siddharth Yadav (syntax-surgeon)

Description of files:

main.py

Contains the main program which prompts the user to provide a path to a smiles files and extracts PubChem, ZINC, CHEBI, CHEMBL & BDBM (BindingDB) ids
REGEX is utilized to extract database ids from the synonyms column in the associated PubChem profile
Several properties included in the PubChem profile of the compound can also be extracted
The data is written to a file named 'molecular_properties.txt' in the directory from where the script was run
Compounds not found are written as '***NO-COMPOUND-FOUND***' in the 'molecular_properties.txt' file
Ctrl+C can be utilized to quit/pause the script

test.py

Tests for the appropriate and intended functionality of the pubchempy API
Uses three test cases based on the name, molecular formula and Inchi-Key of the drug 'Atorvastatin'

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
smiles_bank		smiles_bank
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
test.py		test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

id-prop-extractor

Author: Siddharth Yadav (syntax-surgeon)

Description of files:

main.py

test.py

About

Uh oh!

Releases

Packages

Languages

License

syntax-surgeon/id-prop-extractor

Folders and files

Latest commit

History

Repository files navigation

id-prop-extractor

Author: Siddharth Yadav (syntax-surgeon)

Description of files:

main.py

test.py

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages