Command Line Interface¶

The RetroCast CLI provides a unified interface for standardizing, scoring, and analyzing retrosynthesis predictions.

Two modes of operation

Project Mode Structured workflow for reproducible benchmarking of multiple models

Ad-Hoc Mode
Direct commands for processing individual files without configuration

Installation¶

uv (recommended)pip

uv tool install retrocast

or, optionally, if you want to create plots during analysis:

uv tool install "retrocast[viz]"

pip install retrocast

or, optionally, if you want to create plots during analysis:

uv tool install "retrocast[viz]"

Verify installation:

retrocast --version

Global Options¶

These options apply to all commands:

retrocast [--config CONFIG] [--data-dir DATA_DIR] <command>

Option	Description	Default
`--config`	Path to config file	`retrocast-config.yaml`
`--data-dir`	Override data directory	`data/retrocast/`

Data Directory Resolution¶

The data directory is resolved with the following priority:

CLI flag: --data-dir /custom/path
Environment variable: RETROCAST_DATA_DIR=/custom/path
Config file: data_dir: /custom/path in retrocast-config.yaml
Default: data/retrocast/

Examples

# Use CLI flag (highest priority)
retrocast --data-dir ./my-data ingest --model my-model --dataset mkt-cnv-160

# Use environment variable
RETROCAST_DATA_DIR=./my-data retrocast ingest --model my-model --dataset mkt-cnv-160

# Check current configuration
retrocast config

Migration from older versions

The default data directory changed from data/ to data/retrocast/ in version 0.6. If you have existing data at data/, either:

Move it: mv data/1-benchmarks data/retrocast/
Or set the environment variable: export RETROCAST_DATA_DIR=data

Ad-Hoc Workflow¶

When to use ad-hoc mode

Use these commands to process single files immediately without setting up a project directory. Great for:

Quick experiments
One-off evaluations
Testing new adapters

`adapt` - Convert Raw Predictions¶

Convert raw output from a supported model into the standardized RetroCast format.

retrocast adapt \
  --input raw_predictions.json.gz \
  --adapter aizynth \ # (1)!
  --output standardized_routes.json.gz \
  --benchmark benchmark.json.gz  # (2)!

See available adapters with retrocast list-adapters
Optional: Ensures target IDs match exactly

Supported adapters: aizynth, dms, retrostar, synplanner, syntheseus, askcos, retrochimera, dreamretro, multistepttl, synllama, paroutes

`score-file` - Evaluate Routes¶

Evaluate standardized routes against a stock file.

retrocast score-file \
  --benchmark benchmark.json.gz \
  --routes standardized_routes.json.gz \
  --stock stock_smiles.txt \ # (1)!
  --output scores.json.gz \
  --model-name "My-Experiment"

Text file with one canonical SMILES per line

Output: JSON file with boolean flags (is_solved, matches_ground_truth, etc.)

`create-benchmark` - Generate Benchmarks¶

Generate a benchmark definition file from a simple list of SMILES strings.

retrocast create-benchmark \
  --input targets.txt \ # (1)!
  --name "custom-benchmark" \
  --stock-name "zinc-stock" \
  --output custom-benchmark.json.gz

Text file or CSV with SMILES strings

Project Workflow¶

Recommended for research

For large-scale evaluations, use project mode for:

Reproducible benchmarking
Multiple models comparison
Cryptographic audit trail
Automated manifest tracking

`init` - Initialize Project¶

Generate a default configuration file and directory structure:

retrocast init

Creates:

retrocast-config.yaml - Model configuration
data/ directory structure

Configuration¶

Edit retrocast-config.yaml to register your models:

retrocast-config.yaml

models:
  dms-explorer: # (1)!
    adapter: dms # (2)!
    raw_results_filename: predictions.json # (3)!
    sampling: # (4)!
      strategy: top-k
      k: 50

Model identifier (used in CLI commands)
Adapter type (see retrocast list-adapters)
Expected filename in 2-raw/<model>/<benchmark>/
Optional: Limit routes per target

Advanced configuration

models:
  my-model:
    adapter: aizynth
    raw_results_filename: results.json.gz
    sampling:
      strategy: top-k
      k: 100
    # Custom metadata
    metadata:
      version: "2.0"
      training_data: "USPTO-50k"

The Pipeline¶

graph LR
    A[2-raw/] --> B[ingest]
    B --> C[3-processed/]
    C --> D[score]
    D --> E[4-scored/]
    E --> F[analyze]
    F --> G[5-results/]

All paths are relative to your data directory (default: data/retrocast/).

`ingest` - Standardize Routes¶

Transforms raw model outputs into standardized Route objects.

retrocast ingest \
  --model dms-explorer \
  --dataset paroutes-n1 \
  --anonymize \  # (1)!
  --ignore-stereo  # (2)!

Optional: Hashes the model name for blind review
Optional: Strip stereochemistry during SMILES canonicalization

Input: <data-dir>/2-raw/<model>/<dataset>/<raw_results_filename>
Output: <data-dir>/3-processed/<model>/<dataset>/routes.json.gz

Operations:

Parse raw format via adapter
Canonicalize SMILES (optionally ignoring stereochemistry with --ignore-stereo)
Deduplicate routes
Apply sampling strategy (if configured)

Stereochemistry-agnostic processing

The --ignore-stereo flag removes stereochemical information during canonicalization. This is useful for model developers who want to isolate whether their model struggles specifically with stereochemistry or has broader issues with reaction prediction and stock termination.

Not recommended for production evaluation - stereochemistry is critical for experimental chemistry.

`score` - Evaluate Routes¶

Evaluates processed routes against benchmark stock.

retrocast score \
  --model dms-explorer \
  --dataset paroutes-n1 \
  --stock-override zinc-stock \  # (1)!
  --ignore-stereo  # (2)!

Optional: Override default benchmark stock
Optional: Perform stereochemistry-agnostic matching by dropping stereochemistry from InChIKeys

Input: <data-dir>/3-processed/<model>/<dataset>/routes.json.gz
Output: <data-dir>/4-scored/<model>/<dataset>/<stock>/scores.json.gz

Annotations added:

is_solved - All leaves in stock
matches_ground_truth - Route matches reference (with optional stereochemistry-agnostic matching via --ignore-stereo)
length - Number of steps
is_convergent - Contains convergent reactions

Stereochemistry-agnostic evaluation

The --ignore-stereo flag enables stereochemistry-agnostic evaluation. When enabled, molecules that differ only in stereochemistry are treated as identical during scoring. This allows model developers to calculate Top-K accuracy metrics focused on molecular connectivity rather than stereochemical correctness.

Use case: Helps distinguish between stereochemistry-specific issues and fundamental retrosynthesis planning problems.

Not recommended for production evaluation - stereochemistry is critical for experimental chemistry.

`analyze` - Generate Reports¶

Aggregates scores into statistical reports with confidence intervals.

retrocast analyze \
  --model dms-explorer \
  --dataset paroutes-n1 \
  --make-plots \  # (1)!
  --top-k 1 5 10 50  # (2)!

Generates interactive HTML visualizations
Customizes K values (default: 1, 3, 5, 10, 20, 50, 100)

Input: <data-dir>/4-scored/<model>/<dataset>/<stock>/scores.json.gz
Output: <data-dir>/5-results/<dataset>/<model>/

report.md - Statistical summary
*.html - Interactive plots (if --make-plots)

Metrics computed:

Overall Solvability with 95% CI (bootstrap)
Top-K accuracy (K ∈ {1, 3, 5, 10, ...})
Stratified performance by route length
Ground truth match rate

Verification & Auditing¶

Cryptographic audit trail

RetroCast generates a manifest.json for every file it creates, tracking:

Input file SHA256 hashes
Command parameters
Output file hashes
Timestamp and RetroCast version

`verify` - Check Data Integrity¶

Verify the integrity of your data pipeline:

retrocast verify \
  --target data/5-results/paroutes-n1/dms-explorer \
  --deep  # (1)!

Optional: Recursively verify entire dependency graph

Verification modes:

Standard CheckDeep Check

Verifies that the file on disk matches the SHA256 hash in its manifest.

retrocast verify --target 4-scored/model/dataset/

Recursively verifies the entire dependency graph:

Analyze → Score → Ingest → Raw

Ensures logical consistency across the pipeline.

retrocast verify --target 5-results/model/dataset/ --deep

What it detects:

Data corruption
Manual file tampering
Out-of-order execution
Hash mismatches

Configuration & Debugging¶

`config` - Show Resolved Configuration¶

Display the resolved data directory and paths, useful for debugging path issues:

retrocast config

Output:

RetroCast Configuration
========================================

Data directory: /path/to/project/data/retrocast
  Source: default

Environment:
  RETROCAST_DATA_DIR: not set

Resolved paths:
  benchmarks: data/retrocast/1-benchmarks/definitions (exists)
  stocks    : data/retrocast/1-benchmarks/stocks (exists)
  raw       : data/retrocast/2-raw (missing)
  processed : data/retrocast/3-processed (missing)
  scored    : data/retrocast/4-scored (missing)
  results   : data/retrocast/5-results (missing)

Debugging path issues

If commands can't find your data, run retrocast config to see where RetroCast is looking and which paths exist.

Helper Commands¶

`list` - Show Configured Models¶

Lists all models in retrocast-config.yaml:

retrocast list

Output:

Configured models:
  - dms-explorer (adapter: dms)
  - aizynthfinder-mcts (adapter: aizynth)
  - retro-star (adapter: retrostar)

`list-adapters` - Show Available Adapters¶

Lists all built-in adapters:

retrocast list-adapters

Output:

Available adapters:
  - aizynth: AiZynthFinder (bipartite graph)
  - dms: DirectMultiStep (recursive dict)
  - retrostar: Retro* (precursor map)
  - synplanner: SynPlanner (bipartite graph)
  - syntheseus: Syntheseus (bipartite graph)
  - askcos: ASKCOS (custom format)
  - retrochimera: RetroChimera (precursor map)
  - dreamretro: DreamRetro (precursor map)
  - multistepttl: MultiStepTTL (custom format)
  - synllama: SynLlama (precursor map)
  - paroutes: PaRoutes (reference format)

`info` - Show Model Details¶

Display configuration details for a specific model:

retrocast info --model dms-explorer

Output:

Model: dms-explorer
  Adapter: dms
  Raw results filename: predictions.json
  Sampling:
    Strategy: top-k
    K: 50

Command Reference¶

Quick Lookup¶

Command	Purpose	Input	Output
`config`	Show resolved paths	-	Configuration display
`init`	Initialize project	-	`retrocast-config.yaml`
`adapt`	Convert raw → standardized	Raw predictions	Route objects
`score-file`	Evaluate routes	Routes + stock	Scored routes
`create-benchmark`	Generate benchmark	SMILES list	Benchmark JSON
`ingest`	Standardize (project mode)	`2-raw/`	`3-processed/`
`score`	Evaluate (project mode)	`3-processed/`	`4-scored/`
`analyze`	Generate report	`4-scored/`	`5-results/`
`verify`	Audit integrity	Manifest files	Validation report
`list`	Show models	`retrocast-config.yaml`	Model list
`list-adapters`	Show adapters	-	Adapter list
`info`	Show model config	Model name	Config details

Advanced Options¶

Stereochemistry Control¶

Both ingest and score commands support the --ignore-stereo flag for stereochemistry-agnostic processing:

Command	Flag	Purpose	Use Case
`ingest`	`--ignore-stereo`	Strip stereochemistry during canonicalization	Analyze stock termination rate/solvability without stereochemical constraints
`score`	`--ignore-stereo`	Perform stereochemistry-agnostic matching	Calculate Top-K accuracy independent of stereochemistry

For model developers

The --ignore-stereo flag is primarily useful for model development and diagnostic purposes. It allows you to determine whether prediction errors stem from stereochemical confusion or more fundamental retrosynthetic planning issues.

Not recommended for evaluating production models - stereochemistry is critical for experimental chemistry.

Command Line Interface¶

Installation¶

Global Options¶

Data Directory Resolution¶

Ad-Hoc Workflow¶

adapt - Convert Raw Predictions¶

score-file - Evaluate Routes¶

create-benchmark - Generate Benchmarks¶

Project Workflow¶

init - Initialize Project¶

Configuration¶

The Pipeline¶

ingest - Standardize Routes¶

score - Evaluate Routes¶

analyze - Generate Reports¶

Verification & Auditing¶

verify - Check Data Integrity¶

Configuration & Debugging¶

config - Show Resolved Configuration¶

Helper Commands¶

list - Show Configured Models¶

list-adapters - Show Available Adapters¶

info - Show Model Details¶

Command Reference¶

Quick Lookup¶

Advanced Options¶

Stereochemistry Control¶

`adapt` - Convert Raw Predictions¶

`score-file` - Evaluate Routes¶

`create-benchmark` - Generate Benchmarks¶

`init` - Initialize Project¶

`ingest` - Standardize Routes¶

`score` - Evaluate Routes¶

`analyze` - Generate Reports¶

`verify` - Check Data Integrity¶

`config` - Show Resolved Configuration¶

`list` - Show Configured Models¶

`list-adapters` - Show Available Adapters¶

`info` - Show Model Details¶