Install the Standard Model - Standard Model Bio

Quickstart (Recommended)

bash -c "$(curl -fsSL https://docs.standardmodel.bio/quickstart.sh)"

This script will:

Clone the quickstart repo with demo.py and all dependencies.
Install uv if not already present.
Run uv sync to install the exact locked dependency set (PyTorch, transformers, smb_utils, etc.).

GPU support is strongly recommended. smb-v1-1.7b has 1.7B parameters and requires approximately 16GB GPU memory for inference.

Manual Installation

Requirements: Python 3.11+, Git, and uv.

The quickstart uses uv for dependency management. If you prefer a different tool (conda, pip, etc.), refer to pyproject.toml in the quickstart repo for the list of dependencies.

Install uv

curl -LsSf https://astral.sh/uv/install.sh | sh

Clone and install

Once complete, run the demo:

cd quickstart
uv run python demo.py

git clone https://github.com/standardmodelbio/quickstart.git
cd quickstart
uv sync

All dependencies (PyTorch, transformers, smb_utils, etc.) are installed from the lockfile, ensuring a reproducible environment.

Verify Your Installation

From the quickstart/ directory, verify that everything is working:

uv run python -c "
import torch
from transformers import AutoModelForCausalLM, AutoTokenizer

# Check PyTorch and CUDA
print(f'PyTorch version: {torch.__version__}')
print(f'CUDA available: {torch.cuda.is_available()}')

# Load smb-v1-1.7b
model_id = 'standardmodelbio/smb-v1-1.7b'
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(
    model_id,
    trust_remote_code=True,
    device_map='auto'
)

print('smb-v1-1.7b loaded successfully!')
"

Next Steps

Try a full example with synthetic data

Use the model on your own data

Troubleshooting

CUDA Not Detected

Ensure NVIDIA drivers are up to date. Run nvidia-smi to verify GPU is accessible.

Out of Memory

Reduce memory use via torch.float16 or quantization (see below).

Model Access Denied

Some models may require authentication. Run huggingface-cli login with your token.

Slow Download

Model downloads can be large (several GB). Ensure stable connection and sufficient disk space.

Memory optimization

For large cohorts and/or limited GPU memory, use half-precision or quantization:

Float16
8-bit Quantization

model = AutoModelForCausalLM.from_pretrained(
    model_id,
    trust_remote_code=True,
    torch_dtype=torch.float16,
    device_map="auto"
)

Memory: ~8GB

model = AutoModelForCausalLM.from_pretrained(
    model_id,
    trust_remote_code=True,
    load_in_8bit=True,
    device_map="auto"
)

Memory: ~4GB

Contact Us

Having trouble, or just want to talk about your project? Send an email to erik@standardmodel.bio

​Quickstart (Recommended)

​Verify Your Installation

​Next Steps

Try a full example with synthetic data

Use the model on your own data

​Troubleshooting

CUDA Not Detected

Out of Memory

Model Access Denied

Slow Download

​Memory optimization

​Contact Us

Quickstart (Recommended)

Verify Your Installation

Next Steps

Troubleshooting

Memory optimization

Contact Us