Module `Kaun_hf`

HuggingFace Hub integration.

Download pretrained model weights and configuration files from the HuggingFace Hub. Supports single-file and sharded SafeTensors checkpoints, caching, authentication, and offline mode.

  let config =
    Kaun_hf.load_config ~model_id:"bert-base-uncased" ()
  in
  let weights =
    Kaun_hf.load_weights ~model_id:"bert-base-uncased" ()
  in
  (* weights : (string * Kaun.Ptree.tensor) list *)

Types

type revision =

| Main
(*
The default branch.
*)
| Rev of string
(*
A tag, branch name, or commit hash.
*)

The type for repository revisions.

Downloading

val download_file : 
  ?token:string ->
  ?cache_dir:string ->
  ?offline:bool ->
  ?revision:revision ->
  model_id:string ->
  filename:string ->
  unit ->
  string

download_file ~model_id ~filename () is the local path to filename from the repository model_id.

The file is downloaded to the cache on first access and served from there on subsequent calls.

token is a HuggingFace API token for private repositories. Defaults to the value of HF_TOKEN.

cache_dir defaults to {RAVEN_CACHE_ROOT}/huggingface, or {XDG_CACHE_HOME}/raven/huggingface when unset.

offline defaults to false. When true, only cached files are returned.

revision defaults to Main.

Raises Failure if the download fails or the file is not cached in offline mode.

Loading

val load_config : 
  ?token:string ->
  ?cache_dir:string ->
  ?offline:bool ->
  ?revision:revision ->
  model_id:string ->
  unit ->
  Jsont.json

load_config ~model_id () is the parsed config.json from model_id.

Parameters are the same as download_file.

Raises Failure on download or JSON parse errors.

val load_weights : 
  ?token:string ->
  ?cache_dir:string ->
  ?offline:bool ->
  ?revision:revision ->
  model_id:string ->
  unit ->
  (string * Kaun.Ptree.tensor) list

load_weights ~model_id () is the list of (name, tensor) pairs from model_id's SafeTensors checkpoint.

Handles sharded checkpoints transparently: when model.safetensors.index.json is present, all referenced shards are downloaded and merged. Falls back to model.safetensors when no index exists.

Tensor names are the raw keys from the SafeTensors file (e.g. "bert.encoder.layer.0.attention.self.query.weight"). Model code is responsible for mapping these to its own parameter structure.

Parameters are the same as download_file.

Raises Failure if no SafeTensors files are found, or on download/parse errors.

Cache management

val clear_cache : ?cache_dir:string -> ?model_id:string -> unit -> unit

clear_cache () removes all cached files.

When model_id is given, only that model's cache is removed.