aprender

Production ML Library in Pure Rust

A pure Rust machine learning library with SIMD acceleration, GPU inference, and the APR v2 model format. No Python dependencies, no C bindings -- memory-safe, thread-safe, and WebAssembly-ready.

What is aprender?

Aprender is a production-ready machine learning library written entirely in Rust. It provides classical and modern ML algorithms -- from linear regression and k-means to neural networks and transformers -- with SIMD acceleration via trueno and GPU inference via realizar. Models serialize to the APR v2 format with LZ4/ZSTD compression, zero-copy loading, and optional AES-256-GCM encryption.

Aprender is the ML foundation of the PAIML Sovereign AI Stack.

Features

Pure Rust -- Zero C/C++ dependencies, memory-safe and thread-safe by default.
SIMD Acceleration -- Vectorized operations via trueno 0.16 (AVX2/AVX-512/NEON).
GPU Inference -- CUDA-accelerated inference via realizar (67.8 tok/s 7B, 851 tok/s 1.5B batched on RTX 4090).
APR v2 Format -- Native model serialization with LZ4/ZSTD compression, zero-copy loading, and Int4/Int8 quantization.
Multi-Format -- Native .apr, SafeTensors (single + sharded), and GGUF support.
WebAssembly Ready -- Compile to WASM for browser and edge deployment.
12,974+ Tests -- 96.35% coverage, 73 provable contracts.

Installation

Add aprender to your Cargo.toml:

[dependencies]
aprender = "0.27"

Optional Features

[dependencies]
aprender = { version = "0.27", features = ["format-encryption", "hf-hub-integration"] }

Feature	Description
`format-encryption`	AES-256-GCM encryption for model files
`format-signing`	Ed25519 digital signatures
`format-compression`	Zstd compression
`hf-hub-integration`	Hugging Face Hub push/pull support
`gpu`	GPU acceleration via wgpu

Quick Start

use aprender::prelude::*;

fn main() -> Result<(), Box<dyn std::error::Error>> {
    // Training data
    let x = Matrix::from_vec(4, 2, vec![
        1.0, 2.0,
        2.0, 3.0,
        3.0, 4.0,
        4.0, 5.0,
    ])?;
    let y = Vector::from_slice(&[3.0, 5.0, 7.0, 9.0]);

    // Train model
    let mut model = LinearRegression::new();
    model.fit(&x, &y)?;

    // Evaluate
    println!("R-squared = {:.4}", model.score(&x, &y));

    Ok(())
}

Algorithms

Linear Models

Algorithm	Module
`LinearRegression`	`linear_model`
`LogisticRegression`	`linear_model`
`LinearSVM`	`classification`

Clustering

Algorithm	Module
`KMeans`	`cluster`
`DBSCAN`	`cluster`

Classification

Algorithm	Module
`NaiveBayes`	`classification`
`KNeighborsClassifier`	`classification`
`RandomForestClassifier`	`ensemble`
`GradientBoostingClassifier`	`ensemble`

Decision Trees

Algorithm	Module
`DecisionTreeClassifier`	`tree`

NLP and Text

Capability	Module
Tokenization (BPE, WordPiece)	`text`
TF-IDF vectorization	`text`
Stemming	`text`
Chat templates (ChatML, Llama2, Mistral, Phi)	`text`

Time Series

Capability	Module
ARIMA forecasting	`time_series`

Graph Analysis

Capability	Module
PageRank	`graph`
Betweenness centrality	`graph`
Community detection	`graph`

Bayesian Inference

Capability	Module
Gaussian Naive Bayes	`bayesian`

Neural Networks

Capability	Module
Sequential models	`nn`
Transformer layers	`nn`
Mixture of Experts	`nn`

Decomposition

Algorithm	Module
PCA	`decomposition`

Model Serialization (APR v2)

Capability	Module
Save/load with encryption	`format`
LZ4/ZSTD compression	`format`
Zero-copy memory-mapped loading	`format`
Ed25519 signatures	`format`

Online Learning

Capability	Module
Incremental model updates	`online`

Recommendation Systems

Capability	Module
Collaborative filtering	`recommend`

APR Format

The .apr format provides secure, efficient model serialization:

use aprender::format::{save, load, ModelType, SaveOptions};

// Save with encryption
save(&model, ModelType::LinearRegression, "model.apr",
    SaveOptions::default()
        .with_encryption("password")
        .with_compression(true))?;

// Load
let model: LinearRegression = load("model.apr", ModelType::LinearRegression)?;

Feature	APR v1	APR v2
Tensor Compression	None	LZ4/ZSTD
Index Format	JSON	Binary
Zero-Copy Loading	Partial	Full
Quantization	Int8	Int4/Int8
Streaming	No	Yes

Architecture

aprender
  primitives/       Core tensor types (Matrix, Vector)
  linear_model/     Linear and logistic regression
  cluster/          KMeans, DBSCAN
  classification/   SVM, Naive Bayes, KNN
  tree/             Decision trees
  ensemble/         Random forest, gradient boosting
  text/             Tokenization, TF-IDF, chat templates
  time_series/      ARIMA
  graph/            PageRank, centrality, community detection
  bayesian/         Bayesian inference
  glm/              Generalized linear models
  decomposition/    PCA, dimensionality reduction
  nn/               Neural networks, transformers, MoE
  online/           Online / incremental learning
  recommend/        Recommendation systems
  synthetic/        Synthetic data generation
  format/           APR v2 serialization (encryption, compression)
  serialization/    Legacy serialization
  prelude/          Convenient re-exports

Key dependency: trueno provides the SIMD-accelerated compute primitives (AVX2/AVX-512/NEON).

Quality

12,974+ tests, 96.35% line coverage
Zero clippy warnings (-D warnings)
73 provable contracts via provable-contracts
TDG Score: A+ (95.2/100)
Mutation testing target: >80% mutation score
Zero SATD (self-admitted technical debt)

Sovereign AI Stack

Aprender is the ML layer of the PAIML Sovereign AI Stack -- a pure Rust ecosystem for privacy-preserving ML infrastructure.

Layer	Crate	Purpose
Compute	trueno	SIMD/GPU primitives (AVX2/AVX-512/NEON, wgpu)
ML	aprender	ML algorithms, APR v2 format
Training	entrenar	Autograd, LoRA/QLoRA, quantization
Inference	realizar	APR/GGUF/SafeTensors inference, GPU kernels
Speech	whisper-apr	Pure Rust Whisper ASR
Distribution	repartir	Distributed compute (CPU/GPU/Remote)
Simulation	simular	Monte Carlo, physics, optimization
Registry	pacha	Model registry with Ed25519 signatures
Orchestration	batuta	Stack coordination and CLI

Related Crates

Crate	Description
aprender-tsp	TSP solver with CLI and `.apr` model persistence
aprender-shell	AI-powered shell completion trained on your history
apr-cookbook	50+ idiomatic Rust examples for `.apr` format and SIMD acceleration

Documentation

Resource	Link
API Reference	docs.rs/aprender
User Guide	paiml.github.io/aprender
Examples	`examples/`
APR Format Spec	`docs/specifications/archive/APR-SPEC.md`

Contributing

Fork the repository
Make your changes on the master branch
Run quality gates: cargo test --all-features && cargo clippy --all-targets -- -D warnings && cargo fmt --check
Submit a pull request

See CONTRIBUTING.md for guidelines.

License

MIT

_{Built by PAIML}

Name		Name	Last commit message	Last commit date
Latest commit History 2,516 Commits
.cargo		.cargo
.claude/skills		.claude/skills
.config		.config
.githooks		.githooks
.github		.github
.pv		.pv
aprender/.pv		aprender/.pv
benches		benches
book		book
contracts		contracts
crates		crates
docs		docs
examples		examples
fuzz		fuzz
golden_traces		golden_traces
playbooks		playbooks
proofs		proofs
proptest-regressions/format		proptest-regressions/format
provable-contracts/contracts		provable-contracts/contracts
qa_artifacts		qa_artifacts
scripts		scripts
src		src
tests		tests
.bashrsignore		.bashrsignore
.cargo-mutants.toml		.cargo-mutants.toml
.clippy.toml		.clippy.toml
.gitignore		.gitignore
.pmat-baseline.json		.pmat-baseline.json
.pmat-gates.toml		.pmat-gates.toml
.pmat-metrics.toml		.pmat-metrics.toml
.pmat.yaml		.pmat.yaml
.proptest.toml		.proptest.toml
.pv.toml		.pv.toml
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
GOLDEN_TRACE_INTEGRATION_SUMMARY.md		GOLDEN_TRACE_INTEGRATION_SUMMARY.md
LICENSE		LICENSE
Makefile		Makefile
PROVABLE_CONTRACTS_ANALYSIS.md		PROVABLE_CONTRACTS_ANALYSIS.md
PROVABLE_CONTRACTS_INVENTORY.txt		PROVABLE_CONTRACTS_INVENTORY.txt
PROVABLE_CONTRACTS_README.md		PROVABLE_CONTRACTS_README.md
PROVABLE_CONTRACTS_SUMMARY.txt		PROVABLE_CONTRACTS_SUMMARY.txt
QA-REPORT.md		QA-REPORT.md
README.md		README.md
RELEASE.md		RELEASE.md
RELEASE_NOTES_v0.6.0.md		RELEASE_NOTES_v0.6.0.md
RELEASE_NOTES_v0.7.6.md		RELEASE_NOTES_v0.7.6.md
ROADMAP.md		ROADMAP.md
VERIFICATION_REPORT.md		VERIFICATION_REPORT.md
batuta.toml		batuta.toml
build.rs		build.rs
build_codegen.rs		build_codegen.rs
build_parsing.rs		build_parsing.rs
codecov.yml		codecov.yml
coverage-analysis.md		coverage-analysis.md
defect-report-20251216-134026.json		defect-report-20251216-134026.json
deny.toml		deny.toml
entrenar-contract-falsification-sweep.md		entrenar-contract-falsification-sweep.md
justfile		justfile
mutation-testing-setup.md		mutation-testing-setup.md
notes-poisson.md		notes-poisson.md
pmat.toml		pmat.toml
renacer.toml		renacer.toml
reproduce_qa_016.rs		reproduce_qa_016.rs
rust-toolchain.toml		rust-toolchain.toml
rustfmt.toml		rustfmt.toml
showcase_qa_report.md		showcase_qa_report.md
todos.md		todos.md
trace_h1.json		trace_h1.json

Folders and files

Latest commit

History

Repository files navigation

aprender

Table of Contents

What is aprender?

Features

Installation

Optional Features

Quick Start

Algorithms

Linear Models

Clustering

Classification

Decision Trees

NLP and Text

Time Series

Graph Analysis

Bayesian Inference

Neural Networks

Decomposition

Model Serialization (APR v2)

Online Learning

Recommendation Systems

APR Format

Architecture

Quality

Sovereign AI Stack

Related Crates

Documentation

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 17

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages