> ## Documentation Index > Fetch the complete documentation index at: https://docs.standardmodel.bio/llms.txt > Use this file to discover all available pages before exploring further. # More about the Standard Model `smb-v1-1.7b` is our flagship biological world model. Unlike traditional LLMs that predict tokens, it predicts **patient states** in latent space, modeling how patients evolve over time and respond to interventions. Read more about our modeling approach and benchmarks at our [blog](blog.standardmodel.bio). ## Key Differentiators Predicts future patient states in latent space, not text tokens Learns cause-and-effect: (Pre-State + Intervention) → Post-State Unifies genomics, imaging, EHR, and proteomics ## Environment Activation ```bash theme={null} source standard_model/bin/activate ``` ## Usage ```python theme={null} from transformers import AutoModel, AutoTokenizer import torch # Load model model = AutoModel.from_pretrained("standardmodelbio/smb-v1-1.7b") tokenizer = AutoTokenizer.from_pretrained("standardmodelbio/smb-v1-1.7b") # Move to GPU device = "cuda" if torch.cuda.is_available() else "cpu" model = model.to(device) model.eval() ``` ## Architecture The Standard Model uses **Joint-Embedding Predictive Architecture (JEPA)** — treating the patient as a dynamic "world" and treatments as interventions that change that world. Standard Model Architecture

### How It Works Raw signals — genomics, proteomics, imaging, EHR data — pass through modality-specific encoders. Each encoder is trained to extract meaningful representations from its data type. A specialized projector maps these encodings into a universal latent space. This creates a "fused" patient state embedding that retains both high-level semantic context and low-level biological granularity. Given the current patient state *S(t)* and an intervention *A(t)*, the model predicts the future state *S(t+1)* in latent space — not as text, but as a dense embedding. The model combines supervised fine-tuning (anchoring to clinical outcomes) with JEPA objectives (learning dynamics), preventing training collapse. ## Extracting Embeddings Get patient state embeddings for downstream tasks: ```python theme={null} from transformers import AutoModel, AutoTokenizer import torch model = AutoModel.from_pretrained("standardmodelbio/smb-v1-1.7b") tokenizer = AutoTokenizer.from_pretrained("standardmodelbio/smb-v1-1.7b") device = "cuda" if torch.cuda.is_available() else "cpu" model = model.to(device) model.eval() # Prepare input inputs = tokenizer( "patient clinical data here", return_tensors="pt", padding=True, truncation=True ).to(device) # Extract embeddings with torch.no_grad(): outputs = model(**inputs) embeddings = outputs.last_hidden_state # Pool to get single patient embedding patient_embedding = embeddings.mean(dim=1) # [batch, hidden_dim] print(f"Embedding shape: {patient_embedding.shape}") ``` ## Use Cases Simulate how a tumor would evolve under Treatment A versus Treatment B by conditioning on different interventions. Predict disease progression over 3, 6, or 12 month windows. Create evolving patient representations that update as new data arrives. Model probability of response to specific therapies. ## Memory Optimization `smb-v1-1.7b` requires 16GB GPU memory at full precision. Memory usage can be reduced in several ways: ```python theme={null} model = AutoModel.from_pretrained( "standardmodelbio/smb-v1-1.7b", torch_dtype=torch.float16, device_map="auto" ) ``` **Memory:** \~8GB ```python theme={null} # pip install bitsandbytes model = AutoModel.from_pretrained( "standardmodelbio/smb-v1-1.7b", load_in_8bit=True, device_map="auto" ) ``` **Memory:** \~4GB ```python theme={null} # pip install bitsandbytes from transformers import BitsAndBytesConfig quantization_config = BitsAndBytesConfig( load_in_4bit=True, bnb_4bit_compute_dtype=torch.float16 ) model = AutoModel.from_pretrained( "standardmodelbio/smb-v1-1.7b", quantization_config=quantization_config, device_map="auto" ) ``` **Memory:** \~2GB ## Hardware Requirements | Precision | GPU Memory | Recommended GPU | | --------- | ---------- | --------------- | | float32 | 16 GB | A100, A6000 | | float16 | 8 GB | RTX 4090, A10 | | 8-bit | 4 GB | RTX 3080, T4 | | 4-bit | 2 GB | RTX 3060 |