Module Kaun_hf
HuggingFace Hub integration.
Download pretrained model weights and configuration files from the HuggingFace Hub. Supports single-file and sharded SafeTensors checkpoints, caching, authentication, and offline mode.
let config =
Kaun_hf.load_config ~model_id:"bert-base-uncased" ()
in
let weights =
Kaun_hf.load_weights ~model_id:"bert-base-uncased" ()
in
(* weights : (string * Kaun.Ptree.tensor) list *)Types
The type for repository revisions.
Downloading
val download_file :
?token:string ->
?cache_dir:string ->
?offline:bool ->
?revision:revision ->
model_id:string ->
filename:string ->
unit ->
stringdownload_file ~model_id ~filename () is the local path to filename from the repository model_id.
The file is downloaded to the cache on first access and served from there on subsequent calls.
token is a HuggingFace API token for private repositories. Defaults to the value of HF_TOKEN.
cache_dir defaults to {RAVEN_CACHE_ROOT}/huggingface, or {XDG_CACHE_HOME}/raven/huggingface when unset.
offline defaults to false. When true, only cached files are returned.
revision defaults to Main.
Raises Failure if the download fails or the file is not cached in offline mode.
Loading
val load_config :
?token:string ->
?cache_dir:string ->
?offline:bool ->
?revision:revision ->
model_id:string ->
unit ->
Jsont.jsonload_config ~model_id () is the parsed config.json from model_id.
Parameters are the same as download_file.
Raises Failure on download or JSON parse errors.
val load_weights :
?token:string ->
?cache_dir:string ->
?offline:bool ->
?revision:revision ->
model_id:string ->
unit ->
(string * Kaun.Ptree.tensor) listload_weights ~model_id () is the list of (name, tensor) pairs from model_id's SafeTensors checkpoint.
Handles sharded checkpoints transparently: when model.safetensors.index.json is present, all referenced shards are downloaded and merged. Falls back to model.safetensors when no index exists.
Tensor names are the raw keys from the SafeTensors file (e.g. "bert.encoder.layer.0.attention.self.query.weight"). Model code is responsible for mapping these to its own parameter structure.
Parameters are the same as download_file.
Raises Failure if no SafeTensors files are found, or on download/parse errors.