Omics-OS Docs
Case Studies

Drug Resistance: Overcoming the BCR-ABL T315I Gatekeeper in CML

From target validation to resistance pharmacogenomics — how Lobster AI analyzes the canonical drug resistance story in chronic myeloid leukemia.

Imatinib transformed chronic myeloid leukemia from a fatal diagnosis into a manageable chronic condition — then resistance mutations emerged. The BCR-ABL T315I "gatekeeper" mutation is the canonical drug resistance story in precision oncology: every generation of kinase inhibitor failed against it until ponatinib introduced a single structural innovation. This case study follows Lobster AI through a complete resistance analysis — target validation, mutation pharmacogenomics with ESM2, molecular comparison, and cardiovascular safety assessment — in three conversational turns.

Session context: Results generated February 2026 using lobster-ai 1.0.12 on AWS Bedrock (Claude Sonnet 4.5). External databases queried: Open Targets, ChEMBL, PubChem. Local models: ESM2 (protein language model), RDKit (molecular descriptors). Total cost: $1.85 across 3 turns (376,155 tokens). Database content changes over time — re-running these queries will return different results reflecting updated bioactivity records and compound registrations. This case study demonstrates an analytical workflow, not independently validated scientific findings.

Agents and Data Sources

This analysis uses the lobster-drug-discovery package, which provides four agents:

AgentRole
drug_discovery_expertTarget validation, druggability scoring, disease association analysis
cheminformatics_expertMolecular property calculation, Lipinski profiling, structural similarity
clinical_dev_expertClinical trial landscape, safety profile, regulatory pathway assessment
pharmacogenomics_expertResistance mutation analysis, ESM2 variant impact scoring, PGx strategy

External APIs queried during the session: Open Targets (disease associations, druggability), ChEMBL (bioactivity, compound ranking), PubChem (structural similarity, cross-validation). Local computation is handled by RDKit (molecular descriptors, Lipinski) and ESM2 (protein language model for variant impact scoring).

The Research Question

How does ponatinib overcome the T315I gatekeeper mutation, and what structural features create both its therapeutic advantage and cardiovascular liability?

This question spans target biology, resistance pharmacogenomics, structural chemistry, and safety pharmacology — domains that typically require separate teams, tools, and days of manual work. Lobster AI handles them in a single session.


Turn 1: Target Validation and Competitive Landscape

The first query establishes BCR-ABL1 as a validated target and maps the competitive landscape of approved ABL1 inhibitors.

lobster query --session-id cml_study \
  "Validate BCR-ABL1 (ENSG00000097007) — show disease associations \
   and druggability. Then find the top ABL1 inhibitors from ChEMBL \
   (target CHEMBL1862) ranked by IC50."

Disease Association and Druggability

The drug_discovery_expert queried Open Targets and returned a druggability score of 0.793/1.0 (high confidence), with CML as the top disease association at 0.818/1.0.

Evidence ComponentScore
Known Drug Evidence0.99
Genetic Association0.85
Literature Support0.99
CML Disease Association0.818

Additional disease associations include acute lymphoblastic leukemia (0.703) and blast phase CML (0.574).

Competitive Landscape

The agent queried ChEMBL target CHEMBL1862 and ranked all ABL1 inhibitors with IC50 data. Five approved drugs span a 120-fold potency range reflecting 20 years of iterative medicinal chemistry optimization driven by resistance mutations.

RankDrugGenerationIC50 (nM, biochemical)Notes
1Ponatinib3rd0.32Most potent, designed for T315I
2Dasatinib2nd0.576x more potent than imatinib
3Bosutinib2nd0.5Exceptional binding (Kd = 0.029 nM)
4Imatinib1st38Reference standard (2001)
5Nilotinib2nd38Similar potency, improved pharmacokinetics

IC50 values represent the most potent biochemical kinase assay reported in ChEMBL bioactivity records. Cellular IC50 values are typically higher. The 120-fold potency range across generations is consistent regardless of assay type.

The agent identified 87 unique ABL1 inhibitors with IC50 data from ChEMBL bioactivity records — the complete competitive landscape for this target, ranked programmatically rather than assembled from literature reviews.


Turn 2: Resistance Mutations and the Ethynyl Linker

The second query is the scientific centerpiece: scoring three clinically important resistance mutations with ESM2, then comparing the molecular basis for why ponatinib overcomes T315I while imatinib cannot.

lobster query --session-id cml_study \
  "Score the variant impact of ABL1 T315I, E255K, and Y253H. \
   Run mutation pattern analysis across all three. Then compare \
   imatinib vs ponatinib molecular properties with Lipinski assessment."

Variant Impact Scoring

The pharmacogenomics_expert loaded the ESM2 protein language model and scored each mutation's structural impact alongside clinical resistance data.

Fold-resistance is the ratio of mutant IC50 to wild-type IC50 — a value of 125x means the drug requires 125 times more concentration to achieve the same inhibition of the mutant kinase.

MutationLocationESM2 Log-Likelihood RatioClinical FrequencyFold-Resistance to Imatinib
T315IGatekeeperN/A (sequence length limit)15-20% (highest)125-250x
Y253HP-loop-2.75 (most deleterious)3-8%20-100x
E255KP-loop-1.10 (deleterious)5-10%10-50x

ESM2 ranked Y253H as the most structurally deleterious P-loop mutation (-2.75 vs -1.10 for E255K). However, T315I has the highest clinical priority: it occurs in 15-20% of resistant patients and confers resistance to all first- and second-generation TKIs. The P-loop mutations (Y253H, E255K) cluster at positions 253-255 and are likely mutually exclusive, while T315I is structurally independent and can co-occur as compound mutants in 2-5% of cases (poor prognosis).

Imatinib vs Ponatinib: Molecular Properties

The cheminformatics_expert computed molecular descriptors with RDKit and cross-validated against PubChem.

PropertyImatinibPonatinibImpact
Molecular Weight493.6 Da532.6 Da1 Lipinski violation (acceptable for oncology)
LogP4.594.46Both within acceptable lipophilicity range
TPSA86.3 angstroms squared65.8 angstroms squared (-24%)Better membrane permeability
Rotatable Bonds74 (-43%)Greater rigidity, reduced entropy penalty
Ro5 Violations01Both acceptable drug-like profiles
Oral Bioavailability~98%~60%Offset by 20-100x higher potency
IC50 (wild-type ABL1)38 nM0.32 nMPonatinib approximately 100x more potent
IC50 (T315I mutant)5,000-10,000 nM2-11 nMPonatinib retains potency; imatinib loses 125-250x

The Ethynyl Linker: 0.4 Angstroms That Changed Everything

The agent identified ponatinib's carbon-carbon triple bond (ethynyl) linker as the structural innovation that overcomes the T315I gatekeeper.

FeatureImatinib (NH linker)Ponatinib (ethynyl linker)
Bond Geometry~120 degrees (flexible)180 degrees (linear, rigid)
Linker Width3.8 angstroms3.4 angstroms
H-bond to T315Required for bindingNot required
Gatekeeper Clearance (I315)3.2 angstroms — blocked3.6 angstroms — clears

A 0.4 angstrom reduction in linker width lets ponatinib pass the bulky isoleucine gatekeeper. The linear 180-degree geometry minimizes steric clash, and the elimination of hydrogen bond dependency makes binding agnostic to the gatekeeper residue identity. This is a textbook example of structure-based drug design overcoming biological resistance. Linker dimensions were computed by RDKit from molecular geometry, consistent with crystallographic binding mode analysis of imatinib-ABL1 and ponatinib-ABL1 co-crystal structures.


Turn 3: Safety, Similarity, and the Double-Edged Sword

The third query brings the analysis full circle: the same structural feature that solves resistance creates a new clinical problem.

lobster query --session-id cml_study \
  "Search PubChem for compounds structurally similar to ponatinib \
   (Tanimoto > 75%). Predict ADMET for ponatinib — particularly \
   cardiovascular safety. Cross-validate ponatinib MW from PubChem."

PubChem Cross-Validation

Molecular weight was confirmed at 532.6 Da across RDKit computation and PubChem records (CID 24826799, formula C29H27F3N6O), with TPSA consistent at 65.8 angstroms squared.

The agent searched PubChem for compounds with greater than 75% Tanimoto similarity to ponatinib. Among 20 hits, one analog stood out.

CompoundIdentityMW (Da)LogPCardiovascular Risk ProfileT315I PotentialPriority
CID 644241Ponatinib F-analog529.54.9Very highExcellentTesting analog
CID 5291Imatinib493.63.5ModerateHighBalanced profile
CID 216210Low-lipophilicity scaffold471.51.7LowUnknownHighest priority

The similarity search independently recovered imatinib (CID 5291) as a structural neighbor of ponatinib, validating the Tanimoto approach.

CID 216210 shows a 62% reduction in lipophilicity (LogP 1.7 vs 4.46) while maintaining greater than 75% structural similarity — flagged as the highest-priority analog for reducing cardiovascular risk while potentially retaining T315I potency.

ADMET Assessment vs Clinical Reality

The cheminformatics_expert assessed ADMET properties from ponatinib's molecular structure. Every major parameter assessment matched published clinical data.

ParameterAssessmentClinical RealityMatch
hERG LiabilityModerate-HighQT prolongation warningYes
HepatotoxicityModerate29% elevated liver function testsYes
BBB PenetrationLimitedLow CNS adverse effectsYes
CYP3A4 SubstrateYesConfirmedYes
Half-life24-40 hours24 hoursYes
Oral Absorption60-70%Approximately 60%Yes
Volume of DistributionHigh1,223 LYes

The Double-Edged Sword

The agent identified five structural features that explain ponatinib's FDA black box warning for arterial occlusive events:

  1. High lipophilicity (LogP 4.46): Exceeds the optimal 2.5-3.5 range by 1-2 log units, causing vascular tissue accumulation
  2. Basic tertiary amine (pKa approximately 8.2): The N-methylpiperazine is 85% protonated at physiological pH, causing ion trapping in acidic endothelial lysosomes
  3. Insufficient kinase selectivity: BCR-ABL IC50 of 0.4 nM vs VEGFR-2 of approximately 1.5 nM — only 3-4x selectivity, causing off-target endothelial dysfunction
  4. Long half-life (24 hours): Sustained vascular kinase inhibition with no recovery window
  5. 99% plasma protein binding: Enhanced vascular wall deposition

The ethynyl linker that solves T315I resistance also enables multi-kinase binding (VEGFR-2, PDGFR), which drives the cardiovascular toxicity. The same structural feature is both the therapeutic advantage and the safety liability — and the agent identified this mechanistic link from molecular properties alone.

Arterial occlusive events show a dose-response relationship: approximately 25-35% at 45 mg in the PACE trial (5-year follow-up), with significantly lower rates demonstrated in the OPTIC trial using response-based dose reduction. The current recommended approach is to start at 45 mg and reduce upon molecular response.


What This Demonstrates

Multi-Agent Coordination

No single agent could produce this analysis. The drug_discovery_expert handled target validation and disease associations via Open Targets. The pharmacogenomics_expert loaded ESM2 for variant impact scoring and analyzed mutation patterns. The cheminformatics_expert computed molecular descriptors with RDKit, ran PubChem similarity searches, and predicted ADMET properties. The supervisor routed each sub-question to the appropriate specialist and synthesized results across all three turns.

Database Integration

The agents queried Open Targets, ChEMBL, and PubChem programmatically through validated API tools — not through LLM approximation. ESM2 ran locally for protein-level variant scoring. Molecular weight cross-validation between RDKit and PubChem confirmed 532.6 Da with exact agreement. The structure-based assessment matched clinical data for 7 of 7 parameters.

Provenance and Reproducibility

Every tool call is logged with an AnalysisStep intermediate representation that captures the operation, parameters, data sources, and outputs. The session can be reproduced or extended with --session-id.

Comparison

Estimates based on this case study session. Human researcher timing assumes manual database queries without automation.

TaskHuman ResearcherRaw LLMLobster AI
Validate target across Open Targets20-30 minCannot access API~1 min
Rank 87 compounds by IC50 from ChEMBL30-45 minCannot access API~1 min
Score 3 resistance mutations with ESM22-4 hours (install, write scripts)Approximates, no model~2 min
Compare 2 drugs with Lipinski and ADMET15-20 min (multiple tools)Hallucinates properties~1 min (RDKit local)
Find 20 structural analogs from PubChem15-20 minCannot search~1 min
Cross-validate MW across databases5-10 minUnreliableAutomatic
Identify cardiovascular mechanism from ADMET1-2 hours (literature + computation)Generic, no scoringStructure-based, 7/7 match
Total for complete 3-turn analysis1-2 daysNot reliable~5 min, under $2.00

Limitations

  • IC50 values are assay-dependent. The competitive landscape uses biochemical IC50 values from ChEMBL. Cellular IC50 values, which better reflect clinical potency, can differ by 5-50x.
  • Ethynyl linker dimensions are computational estimates. The 3.4 and 3.8 angstrom linker widths were computed by RDKit, not measured crystallographically. Exact pocket dimensions depend on the DFG motif conformational state.
  • ESM2 sequence length constraint. T315I could not be scored due to the model's input length limit. Per-residue language model scores capture structural disruption but not clinical resistance frequency or multi-drug cross-resistance.
  • Arterial occlusive event rates vary by study. Published AOE rates for ponatinib differ across PACE, OPTIC, and real-world studies depending on follow-up duration and dose management strategy.
  • ADMET assessments reflect historical data. Ponatinib is well-represented in public ADMET training datasets (ChEMBL, PubChem, Tox21). The 7/7 parameter match demonstrates the platform's ability to rapidly retrieve and contextualize known safety profiles, which is valuable for drug profiling. For truly novel scaffolds not represented in training data, prediction accuracy will be lower.

Reproducibility

To reproduce this analysis, install the drug discovery package and run the three turns sequentially:

pip install 'lobster-ai[full]==1.0.12'
lobster query --session-id cml_study \
  "Validate BCR-ABL1 (ENSG00000097007) — show disease associations \
   and druggability. Then find the top ABL1 inhibitors from ChEMBL \
   (target CHEMBL1862) ranked by IC50."
lobster query --session-id cml_study \
  "Score the variant impact of ABL1 T315I, E255K, and Y253H. \
   Run mutation pattern analysis across all three. Then compare \
   imatinib vs ponatinib molecular properties with Lipinski assessment."
lobster query --session-id cml_study \
  "Search PubChem for compounds structurally similar to ponatinib \
   (Tanimoto > 75%). Predict ADMET for ponatinib — particularly \
   cardiovascular safety. Cross-validate ponatinib MW from PubChem."

Session continuity via --session-id ensures each turn builds on prior context. Results are stored in the .lobster_workspace/ directory and can be exported with /pipeline export.


On this page