JAX implementation of Model Predictive Path Integral (MPPI) control

These details have not been verified by PyPI

Project links

Project description

jax_mppi

License Python Status

jax_mppi is a functional, JIT-compilable port of the pytorch_mppi library to JAX. It implements Model Predictive Path Integral (MPPI) control with a focus on performance and composability.

Design Philosophy

This library embraces JAX's functional paradigm:

Pure Functions: Core logic is implemented as pure functions command(state, mppi_state) -> (action, mppi_state).
Dataclass State: State is held in jax.tree_util.register_dataclass containers, allowing easy integration with jit, vmap, and grad.
No Side Effects: Unlike the PyTorch version, there is no mutable self. State transitions are explicit.

Key Features

Core MPPI: Robust implementation of the standard MPPI algorithm.
Smooth MPPI (SMPPI): Maintains action sequences and smoothness costs for better trajectory generation.
Kernel MPPI (KMPPI): Uses kernel interpolation for control points, reducing the parameter space.
I-MPPI (Informative MPPI): Two-layer hierarchical architecture for autonomous exploration:
- Layer 2 (FSMI): Fast Shannon Mutual Information trajectory generation (~5 Hz)
- Layer 3 (Biased MPPI): Information-aware tracking control (~50 Hz)
- Occupancy grid-based exploration with GPU acceleration
- Interactive Colab notebook for real-time parameter tuning
Autotuning: Built-in hyperparameter optimization with multiple backends:
- CMA-ES (via `cma` library) - Classic evolution strategy
- CMA-ES, Sep-CMA-ES, OpenES (via `evosax`) - JAX-native, GPU-accelerated ⚡
- Ray Tune - Distributed hyperparameter search
- CMA-ME (via `ribs`) - Quality diversity optimization
CUDA/C++ Backend: High-performance implementations of all controllers in CUDA/C++17, exposed to Python via `nanobind`. Ideal for deployments needing maximum throughput.
JAX Integration:
- jax.vmap for efficient batch processing.
- jax.lax.scan for fast horizon loops.
- Fully compatible with JIT compilation for high-performance control loops.

Installation

# Install from PyPI
pip install jax-mppi

# Or with optional dependencies
pip install jax-mppi[dev]              # Development tools
pip install jax-mppi[docs]             # Documentation
pip install jax-mppi[autotuning]       # Autotuning (cma + evosax)
pip install jax-mppi[autotuning-extra] # Ray Tune, Hyperopt, Ribs

Development Installation

For contributors who want to work on the package (requires Python 3.12+):

# Clone the repository with submodules
git clone --recursive https://github.com/riccardo-enr/jax_mppi.git
cd jax_mppi

# Or if already cloned without --recursive
git submodule update --init --recursive

# Install in development mode
pip install -e .

Note: The CUDA backend lives in a separate repository (cuda_mppi) integrated as a git submodule at third_party/cuda-mppi. You need to initialize submodules to build the CUDA components.

Versioning

This project uses Semantic Versioning following the major.minor.patch scheme:

Major: Breaking changes to the API or significant feature additions.
Minor: New features or enhancements that are backward compatible.
Patch: Bug fixes and minor updates.

See CHANGELOG for detailed version history.

Usage

import jax
import jax.numpy as jnp
from jax_mppi import mppi

# Define dynamics and cost functions
def dynamics(state, action):
    # Your dynamics model here
    return state + action

def running_cost(state, action):
    # Your cost function here
    return jnp.sum(state**2) + jnp.sum(action**2)

# Create configuration and initial state
config, mppi_state = mppi.create(
    nx=4, nu=2,
    noise_sigma=jnp.eye(2) * 0.1,
    horizon=20,
    lambda_=1.0
)

# Control loop
key = jax.random.PRNGKey(0)
current_obs = jnp.zeros(4)

# JIT compile the command function for performance
jitted_command = jax.jit(mppi.command, static_argnames=['dynamics', 'running_cost'])

for _ in range(100):
    key, subkey = jax.random.split(key)
    action, mppi_state = jitted_command(
        config,
        mppi_state,
        current_obs,
        dynamics=dynamics,
        running_cost=running_cost
    )
    # Apply action to environment...

Autotuning

JAX-MPPI includes powerful hyperparameter optimization capabilities. You can automatically tune MPPI parameters like lambda_, noise_sigma, and horizon using multiple optimization backends.

Quick Example

from jax_mppi import autotune, mppi

# Create MPPI configuration
config, state = mppi.create(nx=4, nu=2, horizon=20)
holder = autotune.ConfigStateHolder(config, state)

# Define what to tune
params_to_tune = [
    autotune.LambdaParameter(holder, min_value=0.1),
    autotune.NoiseSigmaParameter(holder, min_value=0.01),
]

# Define evaluation function
def evaluate():
    # Run MPPI, return cost
    # ... your evaluation logic ...
    return autotune.EvaluationResult(mean_cost=cost, ...)

# Choose an optimizer
from jax_mppi import autotune_evosax  # JAX-native, GPU-accelerated
optimizer = autotune_evosax.CMAESOpt(population=10, sigma=0.1)

# Or use classic CMA-ES
# optimizer = autotune.CMAESOpt(population=10, sigma=0.1)

# Run optimization
tuner = autotune.Autotune(
    params_to_tune=params_to_tune,
    evaluate_fn=evaluate,
    optimizer=optimizer,
)
best = tuner.optimize_all(iterations=50)

Available Optimizers

Optimizer	Backend	GPU Support	Best For
`autotune.CMAESOpt`	`cma` library	❌	Classic CMA-ES, stable
`autotune_evosax.CMAESOpt`	evosax	✅	JAX-native, 5-10x faster on GPU
`autotune_evosax.SepCMAESOpt`	evosax	✅	High-dimensional problems
`autotune_evosax.OpenESOpt`	evosax	✅	Large populations, parallelization
`autotune_global.RayOptimizer`	Ray Tune	✅	Distributed search
`autotune_qd.CMAMEOpt`	ribs	❌	Quality diversity

Evosax vs CMA Library

Migrating from cma to evosax:

# Before (cma library)
from jax_mppi.autotune import CMAESOpt
optimizer = CMAESOpt(population=10, sigma=0.1)

# After (evosax - JAX-native)
from jax_mppi.autotune_evosax import CMAESOpt
optimizer = CMAESOpt(population=10, sigma=0.1)

Benefits of evosax:

⚡ 5-10x faster on GPU due to JIT compilation
🔧 Multiple strategies (CMA-ES, Sep-CMA-ES, OpenES, SNES, xNES)
🎯 JAX-native - seamless integration with JAX code
📦 Pure Python - no external C++ dependencies

See examples/autotuning/evosax_comparison.py for a detailed performance comparison.

I-MPPI: Informative Path Planning

I-MPPI extends the MPPI framework with information-theoretic path planning for autonomous exploration. The system uses a hierarchical architecture that combines global strategic planning with reactive control:

Architecture

Layer 2 (FSMI Planner): Generates information-rich reference trajectories using Fast Shannon Mutual Information
Layer 3 (Biased MPPI): Tracks references while gathering local information via Uniform-FSMI
Occupancy Grid: Represents environment uncertainty and enables information gain computation

Key Capabilities

Autonomous Exploration: Seeks high-information regions while avoiding obstacles
Real-time Performance: ~5 Hz for global planning, ~50 Hz for local control
GPU Accelerated: Full JAX implementation for efficient computation
Interactive Tuning: Jupyter notebook with widgets for parameter exploration

Getting Started with I-MPPI

from jax_mppi.i_mppi import FSMIConfig, create_fsmi_state

# Configure information-driven planner
config = FSMIConfig(
    grid_resolution=0.1,  # 10cm grid cells
    sensor_range=5.0,     # 5m sensing range
    info_weight=1.0       # Information gain weight
)

# Run I-MPPI simulation
# See examples/i_mppi/simulation.py for complete example

For detailed theory and implementation, see the I-MPPI documentation.

Quadrotor Examples

JAX-MPPI includes comprehensive quadrotor control examples demonstrating trajectory tracking with nonlinear 6-DOF dynamics:

Features

6-DOF Dynamics: Full quaternion-based attitude representation with NED/FRD frame conventions
Multiple Trajectories: Hover, circle, figure-8, and custom waypoint-based paths
MPPI Variant Comparison: Side-by-side performance analysis of MPPI, SMPPI, and KMPPI
Real-time Performance: 50 Hz control loops with JIT compilation
Rich Visualizations: Trajectory plots, tracking errors, control inputs, and performance metrics

Available Examples

# Basic stabilization
python examples/quadrotor/hover.py

# Trajectory tracking
python examples/quadrotor/circle.py

# Waypoint navigation
python examples/quadrotor/custom_trajectory.py

# Compare MPPI variants
python examples/quadrotor/figure8_comparison.py

Key Results: SMPPI achieves 30-40% smoother control (lower jerk) compared to standard MPPI while maintaining similar tracking accuracy (<0.1m RMS error).

See examples/quadrotor/ for more details.

Project Structure

jax_mppi/
├── src/jax_mppi/
│   ├── mppi.py              # Core MPPI implementation
│   ├── smppi.py             # Smooth MPPI variant
│   ├── kmppi.py             # Kernel MPPI variant
│   ├── types.py             # Type definitions
│   ├── autotune.py          # Autotuning core & CMA-ES (cma lib)
│   ├── autotune_evosax.py   # JAX-native optimizers (evosax)
│   ├── autotune_global.py   # Ray Tune integration
│   └── autotune_qd.py       # Quality Diversity optimization
├── examples/
│   ├── basic/               # Introductory examples (pendulum)
│   ├── quadrotor/           # Quadrotor control & comparisons
│   ├── i_mppi/              # Informative MPPI simulation
│   ├── autotuning/          # Hyperparameter optimization
│   ├── cuda/                # CUDA acceleration examples
│   └── benchmarks/          # Performance comparisons
└── tests/                   # Unit and integration tests

Roadmap

The development is structured in phases:

Core MPPI: Basic implementation with JAX parity.
Integration: Pendulum example and verification.
Smooth MPPI: Implementation of smoothness constraints.
Kernel MPPI: Kernel-based control parameterization.
Comparisons: Benchmarking and visual comparisons.
Autotuning: Parameter optimization using CMA-ES, Ray Tune, and QD.

Credits

This project is a direct port of pytorch_mppi. We aim to maintain parity with the original implementation while leveraging JAX's unique features for performance and flexibility.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.3.0

Feb 12, 2026

0.1.8

Feb 1, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jax_mppi-0.3.0.tar.gz (337.4 kB view details)

Uploaded Feb 12, 2026 Source

File details

Details for the file jax_mppi-0.3.0.tar.gz.

File metadata

Download URL: jax_mppi-0.3.0.tar.gz
Upload date: Feb 12, 2026
Size: 337.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for jax_mppi-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`d0ee0a070229edb8d46aad6b2c93569d8d764696befbe3ea25e172253ec7965b`
MD5	`8bca95bf01bedfdefa29cf5731336ee2`
BLAKE2b-256	`7848acc0bbfc3e32f1c4a0895e3614dd2bf33edc7c5a545a1a92ba21466229b4`

See more details on using hashes here.

jax-mppi 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

jax_mppi

Design Philosophy

Key Features

Installation

Development Installation

Versioning

Usage

Autotuning

Quick Example

Available Optimizers

Evosax vs CMA Library

I-MPPI: Informative Path Planning

Architecture

Key Capabilities

Getting Started with I-MPPI

Quadrotor Examples

Features

Available Examples

Project Structure

Roadmap

Credits

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes