RNA Structure & Ligand Analysis Toolkit

This toolkit provides a collection of automated Python scripts for RNA structure processing, sequence/ligand feature extraction, and batch analysis.

Requirements

Install the core dependencies with:

pip install -r requirements.txt

Note:

For 3D visualization and file conversion, you may also need to install PyMOL and OpenBabel as system tools (conda install -c conda-forge openbabel pymol-open-source).

For parallel processing, joblib is used. RDKit is best installed via conda if possible.

📁 Tool Overview

File Name	Brief Description
`remove_h_pymol.py`	Remove hydrogen atoms from PDB files using PyMOL.
`merge_rna_ligand.py`	Merge separate RNA and ligand PDB files into a single structure.
`generate_contact_map.py`	Calculate RNA–ligand contact maps based on atomic distances.
`split_rna.py`	Split PDB/CIF structures into RNA, ligand, and protein components.
`convert_cif2pdb.py`	Convert mmCIF to PDB format, keeping chain info.
`separate_rna_and_small_molecules_no_water_ions_residues_6Angs_CSVfile.py`	Extract RNA and small molecules (exclude water/ions) within 6Å. Batch processing via CSV.
`AddH_pymol_save_pdbqt_obabel_parallel_HPC.py`	Add hydrogens & convert to PDBQT using PyMOL + OpenBabel, parallel HPC support.
`get_sequences.py`	Extract RNA sequences (FASTA) or ligand SMILES from structures.
`AddH_pymol.py`	Add hydrogens to PDB structure via PyMOL.
`get_ligands_smiles_fingerprint.py`	Extract ligand SMILES & generate RDKit fingerprints.

Pipeline Overview

These tools can be combined for a full RNA-ligand structural workflow:

Split complex structures: split_rna.py
Standardize structures: remove_h_pymol.py → AddH_pymol.py
Merge components: merge_rna_ligand.py
Generate features: generate_contact_map.py, get_sequences.py, get_ligands_smiles_fingerprint.py
Batch/high-throughput: Use parallel scripts for scaling (AddH_pymol_save_pdbqt_obabel_parallel_HPC.py, CSV-driven tools).

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
test		test
.gitignore		.gitignore
AddH.sh		AddH.sh
AddH_pymol.py		AddH_pymol.py
AddH_pymol_save_pdbqt_obabel_parallel_HPC.py		AddH_pymol_save_pdbqt_obabel_parallel_HPC.py
README.md		README.md
__init__.py		__init__.py
cif2pdb.sh		cif2pdb.sh
convert_cif2pdb.py		convert_cif2pdb.py
generate_contact_map.py		generate_contact_map.py
generate_contact_map.sh		generate_contact_map.sh
get_ligands_smiles_fingerprint.py		get_ligands_smiles_fingerprint.py
get_ligands_smiles_fingerprint.sh		get_ligands_smiles_fingerprint.sh
get_sequences.py		get_sequences.py
get_sequences.sh		get_sequences.sh
hariboss_20240621_error_revised_rna.fasta		hariboss_20240621_error_revised_rna.fasta
hariboss_20240621_rna_sequence.fasta		hariboss_20240621_rna_sequence.fasta
merge_rna_ligand.py		merge_rna_ligand.py
merge_rna_ligand.sh		merge_rna_ligand.sh
missing_rna.fasta		missing_rna.fasta
remove_h.sh		remove_h.sh
remove_h_pymol.py		remove_h_pymol.py
requirements.txt		requirements.txt
rrna_shorter_than_621.csv		rrna_shorter_than_621.csv
separate_rna_and_small_molecules_no_water_ions_residues_6Angs_CSVfile.py		separate_rna_and_small_molecules_no_water_ions_residues_6Angs_CSVfile.py
similarity_results.csv		similarity_results.csv
split_rna.py		split_rna.py
submit_job.sh		submit_job.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RNA Structure & Ligand Analysis Toolkit

Requirements

📁 Tool Overview

Pipeline Overview

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RNA Structure & Ligand Analysis Toolkit

Requirements

📁 Tool Overview

Pipeline Overview

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages