This document describes how ebolaseq produces the optional GP mAb escape report: from GP alignment through epitope scoring to the files written in Escape/.
Alignment/MAFFT/GP/protein_aln.fasta.Escape/ (HTML, Excel, verification files).Makona 2014 (KJ660346.2) is the reference frame because the licensed mAbs were tested against this outbreak strain (watchlist sites match the identical UniProt Q05320 / Mayinga-76 numbering).
Extract GP CDS, translate, then MAFFT all proteins into one alignment. Gaps in the MSA can shift position numbers (see step 3).
Each literature epitope site has a Makona residue number (e.g. 116). Because MAFFT adds gap columns, each number is mapped to the correct MSA column via an anchor sequence. Output: gp_epitope_q05320_msa_map.tsv.
Per isolate at each mapped site vs Makona zaire_aa:
| Status | Meaning |
|---|---|
same |
matches Makona |
escape |
matches a catalogued escape mutation |
changed |
different, not in catalogue |
gap |
missing or unmapped |
Grantham distance for substitutions. Per therapy: contact-site conservation, epitope-change count, escape matches, max Grantham. In silico concern flags per cocktail component. Catalogue: MAB_ESCAPE_EPITOPES.md.
| File | Content |
|---|---|
gp_mab_escape_report.html |
Interactive report |
mab_escape_data.xlsx |
Tables for R/Excel |
gp_protein_aln.fasta |
GP MSA |
makona_gp_mature_reference.fasta |
Makona baseline |
gp_epitope_q05320_msa_map.tsv |
Position ↔ column map |
changed.gp_epitope_q05320_msa_map.tsv.zaire_aa, including non-Zaire species, for consistent numbering.