SAE Intelligence: Interpretable Genomic Features
Go beyond the score. See the exact biological features—exons, TF motifs, protein structures—that drive a prediction and understand *why* a variant is disruptive.
Why It Matters
- Transform black-box predictions into transparent, biologically-grounded stories
What We Delivered
- Clear biological explanations for every prediction
Core Capabilities
From feature attribution to activation steering
Feature Attribution (Live)
LIVETechnical
We simulate the extraction of active SAE features for a given sequence and calculate the change in log-likelihood (ΔLL) caused by a variant.
Scientific
Connects the model's internal logic to human-readable biological concepts (RUO).
Business
Use Cases
DynamicOracleExplain component. Prompt Safety (Live)
LIVETechnical
Detect low‑complexity repeats and other pathological attractors; flag viral/sensitive content (aligned with Forge safety gates).
Scientific
Reduces junk outputs and improves the reliability of generative demos.
Business
Use Cases
Activation Steering (Roadmap)
ROADMAPTechnical
Expose endpoints to nudge/target feature activations (e.g., chromatin patterns, motif presence) with compute‑aware beam search.
Scientific
Maps CrisPRO.ai‑style inference‑time scaling to controllable design objectives.
Business
Use Cases
Interactive Demonstrations
See SAE Intelligence in action
Feature Overlay Visualization
Toggle Features:
Genomic Sequence (43044290-43044450):
Feature Types:
Disruption Scores (ΔLL)
Exon Boundary
High Impact
TF Motif (AP-1)
High Impact
Secondary Structure
Medium Impact
Splice Site
Low Impact
Key Insight:
The ΔLL (Delta Log-Likelihood) score quantifies how much a variant disrupts each biological feature. Negative values indicate disruption, with more negative values showing greater impact.
Prompt Safety Checker
Key Benefits:
- • Prevents pathological inputs that could generate junk outputs
- • Flags low-complexity repeats and ambiguous sequences
- • Improves reliability of generative AI demonstrations
- • Provides clear suggestions for sequence improvement
Activation Steering (Roadmap)
Overall Progress
46% CompleteAP-1 Binding Sites
Transcription factor binding motifs
Open Chromatin
Accessible chromatin regions
Alpha Helix
Protein secondary structure
Roadmap Feature
Activation steering is currently in development. This demo shows the planned interface for controlling feature activations during generation, with compute-aware beam search and predictable quality scaling.
Planned Benefits:
- • Steer generation towards desired biological features
- • Predictable quality scaling with transparent controls
- • Compute-aware beam search for efficient generation
- • Auditable design process with clear provenance