Package: ggmlR 0.7.9

ggmlR: 'GGML' Tensor Operations for Machine Learning

Provides 'R' bindings to the 'GGML' tensor library for machine learning, optimized for 'Vulkan' GPU acceleration with a transparent CPU fallback. The package features a 'Keras'-like sequential API and a 'PyTorch'-style 'autograd' engine for building, training, and deploying neural networks. Key capabilities include high-performance 5D tensor operations, 'f16' precision, and efficient quantization. It supports native 'ONNX' model import (50+ operators) and 'GGUF' weight loading from the 'llama.cpp' and 'Hugging Face' ecosystems. Designed for zero-overhead inference via dedicated weight buffering, it integrates seamlessly as a 'parsnip' engine for 'tidymodels' and provides first-class learners for the 'mlr3' framework. See <https://github.com/ggml-org/ggml> for more information about the underlying library.

Authors:Yuri Baramykov [aut, cre], Georgi Gerganov [ctb, cph], Jeffrey Quesnelle [ctb, cph], Bowen Peng [ctb, cph], Mozilla Foundation [ctb, cph]

ggmlR_0.7.9.tar.gz
ggmlR_0.7.9.zip(r-4.7)ggmlR_0.7.9.zip(r-4.6)ggmlR_0.7.9.zip(r-4.5)
ggmlR_0.7.9.tgz(r-4.6-x86_64)ggmlR_0.7.9.tgz(r-4.6-arm64)ggmlR_0.7.9.tgz(r-4.5-x86_64)ggmlR_0.7.9.tgz(r-4.5-arm64)
ggmlR_0.7.9.tar.gz(r-4.7-arm64)ggmlR_0.7.9.tar.gz(r-4.7-x86_64)ggmlR_0.7.9.tar.gz(r-4.6-arm64)ggmlR_0.7.9.tar.gz(r-4.6-x86_64)
ggmlR_0.7.9.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
DESCRIPTION |NEWS
card.svg |card.png
ggmlR/json (API)

# Install 'ggmlR' in R:

install.packages('ggmlR', repos = c('https://zabis13.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/zabis13/ggmlr/issues

Uses libs:

c++– GNU Standard C++ Library v3
openmp– GCC OpenMP (GOMP) support library

On CRAN:

8.76 score 13 stars 2 packages 79 scripts 634 downloads 652 exports 2 dependencies

Last updated from:8f3dd0f448. Checks:13 OK. Indexed: yes.

Target	Result	Time	Files	Syslog
linux-devel-arm64	OK	277
linux-devel-x86_64	OK	317
source / vignettes	OK	421
linux-release-arm64	OK	283
linux-release-x86_64	OK	314
macos-release-arm64	OK	229
macos-release-x86_64	OK	396
macos-oldrel-arm64	OK	230
macos-oldrel-x86_64	OK	414
windows-devel	OK	329
windows-release	OK	293
windows-oldrel	OK	329
wasm-release	OK	230

Exports:ag_add ag_batch_norm ag_clamp ag_cross_entropy_loss ag_dataloader ag_default_device ag_default_dtype ag_device ag_dropout ag_dtype ag_embedding ag_eval ag_exp ag_gradcheck ag_linear ag_load_model ag_log ag_matmul ag_mean ag_mse_loss ag_mul ag_multihead_attention ag_param ag_pow ag_relu ag_reshape ag_save_model ag_scale ag_sequential ag_sigmoid ag_softmax ag_softmax_cross_entropy_loss ag_sub ag_sum ag_tanh ag_tensor ag_to_device ag_train ag_transpose backward clip_grad_norm compile dequantize_row_iq1_m dequantize_row_iq1_s dequantize_row_iq2_s dequantize_row_iq2_xs dequantize_row_iq2_xxs dequantize_row_iq3_s dequantize_row_iq3_xxs dequantize_row_iq4_nl dequantize_row_iq4_xs dequantize_row_mxfp4 dequantize_row_nvfp4 dequantize_row_q1_0 dequantize_row_q2_K dequantize_row_q3_K dequantize_row_q4_0 dequantize_row_q4_1 dequantize_row_q4_K dequantize_row_q5_0 dequantize_row_q5_1 dequantize_row_q5_K dequantize_row_q6_K dequantize_row_q8_0 dequantize_row_q8_K dequantize_row_tq1_0 dequantize_row_tq2_0 dp_train evaluate fit ggml_abort_is_r_enabled ggml_abs ggml_abs_inplace ggml_add ggml_add_inplace ggml_add_rel_pos ggml_add1 ggml_apply ggml_arange ggml_are_same_layout ggml_are_same_shape ggml_are_same_stride ggml_argmax ggml_argsort ggml_backend_alloc_ctx_tensors ggml_backend_buffer_clear ggml_backend_buffer_free ggml_backend_buffer_get_size ggml_backend_buffer_get_usage ggml_backend_buffer_is_host ggml_backend_buffer_is_multi_buffer ggml_backend_buffer_name ggml_backend_buffer_reset ggml_backend_buffer_set_usage ggml_backend_buffer_usage_any ggml_backend_buffer_usage_compute ggml_backend_buffer_usage_weights ggml_backend_cpu_init ggml_backend_cpu_set_n_threads ggml_backend_dev_by_name ggml_backend_dev_by_type ggml_backend_dev_count ggml_backend_dev_description ggml_backend_dev_get ggml_backend_dev_get_props ggml_backend_dev_init ggml_backend_dev_memory ggml_backend_dev_name ggml_backend_dev_offload_op ggml_backend_dev_supports_buft ggml_backend_dev_supports_op ggml_backend_dev_type ggml_backend_device_register ggml_backend_device_type_accel ggml_backend_device_type_cpu ggml_backend_device_type_gpu ggml_backend_device_type_igpu ggml_backend_event_free ggml_backend_event_new ggml_backend_event_record ggml_backend_event_synchronize ggml_backend_event_wait ggml_backend_free ggml_backend_get_device ggml_backend_graph_compute ggml_backend_graph_compute_async ggml_backend_graph_plan_compute ggml_backend_graph_plan_create ggml_backend_graph_plan_free ggml_backend_init_best ggml_backend_init_by_name ggml_backend_init_by_type ggml_backend_load ggml_backend_load_all ggml_backend_meta_device ggml_backend_multi_buffer_alloc_buffer ggml_backend_multi_buffer_set_usage ggml_backend_name ggml_backend_reg_by_name ggml_backend_reg_count ggml_backend_reg_dev_count ggml_backend_reg_dev_get ggml_backend_reg_get ggml_backend_reg_name ggml_backend_register ggml_backend_sched_alloc_graph ggml_backend_sched_free ggml_backend_sched_get_backend ggml_backend_sched_get_n_backends ggml_backend_sched_get_n_copies ggml_backend_sched_get_n_splits ggml_backend_sched_get_tensor_backend ggml_backend_sched_graph_compute ggml_backend_sched_graph_compute_async ggml_backend_sched_new ggml_backend_sched_reserve ggml_backend_sched_reset ggml_backend_sched_set_tensor_backend ggml_backend_sched_synchronize ggml_backend_synchronize ggml_backend_tensor_copy_async ggml_backend_tensor_get_and_sync ggml_backend_tensor_get_async ggml_backend_tensor_get_data ggml_backend_tensor_get_f32_first ggml_backend_tensor_set_async ggml_backend_tensor_set_data ggml_backend_unload ggml_batch_norm ggml_blck_size ggml_build_forward_expand ggml_callback_early_stopping ggml_can_repeat ggml_ceil ggml_ceil_inplace ggml_clamp ggml_compile ggml_concat ggml_cont ggml_conv_1d ggml_conv_1d_dw ggml_conv_2d ggml_conv_2d_direct ggml_conv_2d_dw ggml_conv_2d_dw_direct ggml_conv_transpose_1d ggml_conv_transpose_2d_p0 ggml_cos ggml_count_equal ggml_cpu_add ggml_cpu_features ggml_cpu_get_rvv_vlen ggml_cpu_get_sve_cnt ggml_cpu_has_amx_int8 ggml_cpu_has_arm_fma ggml_cpu_has_avx ggml_cpu_has_avx_vnni ggml_cpu_has_avx2 ggml_cpu_has_avx512 ggml_cpu_has_avx512_bf16 ggml_cpu_has_avx512_vbmi ggml_cpu_has_avx512_vnni ggml_cpu_has_bmi2 ggml_cpu_has_dotprod ggml_cpu_has_f16c ggml_cpu_has_fma ggml_cpu_has_fp16_va ggml_cpu_has_llamafile ggml_cpu_has_matmul_int8 ggml_cpu_has_neon ggml_cpu_has_riscv_v ggml_cpu_has_sme ggml_cpu_has_sse3 ggml_cpu_has_ssse3 ggml_cpu_has_sve ggml_cpu_has_vsx ggml_cpu_has_vxe ggml_cpu_has_wasm_simd ggml_cpu_mul ggml_cpy ggml_cycles ggml_cycles_per_ms ggml_default_mlp ggml_dense ggml_diag ggml_diag_mask_inf ggml_diag_mask_inf_inplace ggml_diag_mask_zero ggml_div ggml_div_inplace ggml_dup ggml_dup_inplace ggml_dup_tensor ggml_element_size ggml_elu ggml_elu_inplace ggml_embedding ggml_estimate_memory ggml_evaluate ggml_exp ggml_exp_inplace ggml_extract ggml_fit ggml_fit_opt ggml_flash_attn_back ggml_flash_attn_ext ggml_floor ggml_floor_inplace ggml_free ggml_freeze_weights ggml_ftype_to_ggml_type ggml_gallocr_alloc_graph ggml_gallocr_free ggml_gallocr_get_buffer_size ggml_gallocr_new ggml_gallocr_reserve ggml_geglu ggml_geglu_quick ggml_geglu_split ggml_gelu ggml_gelu_erf ggml_gelu_inplace ggml_gelu_quick ggml_get_f32 ggml_get_f32_nd ggml_get_first_tensor ggml_get_i32 ggml_get_i32_nd ggml_get_layer ggml_get_max_tensor_size ggml_get_mem_size ggml_get_n_threads ggml_get_name ggml_get_next_tensor ggml_get_no_alloc ggml_get_op_params ggml_get_op_params_f32 ggml_get_op_params_i32 ggml_get_rel_pos ggml_get_rows ggml_get_rows_back ggml_get_unary_op ggml_glu GGML_GLU_OP_GEGLU GGML_GLU_OP_GEGLU_ERF GGML_GLU_OP_GEGLU_QUICK GGML_GLU_OP_REGLU GGML_GLU_OP_SWIGLU GGML_GLU_OP_SWIGLU_OAI ggml_glu_split ggml_graph_compute ggml_graph_compute_with_ctx ggml_graph_dump_dot ggml_graph_get_tensor ggml_graph_n_nodes ggml_graph_node ggml_graph_overhead ggml_graph_print ggml_graph_reset ggml_graph_view ggml_group_norm ggml_group_norm_inplace ggml_gru ggml_hardsigmoid ggml_hardswish ggml_im2col ggml_init ggml_init_auto ggml_inject ggml_input ggml_is_available ggml_is_contiguous ggml_is_contiguous_0 ggml_is_contiguous_1 ggml_is_contiguous_2 ggml_is_contiguous_channels ggml_is_contiguous_rows ggml_is_contiguously_allocated ggml_is_permuted ggml_is_quantized ggml_is_transposed ggml_l2_norm ggml_l2_norm_inplace ggml_layer_add ggml_layer_batch_norm ggml_layer_concatenate ggml_layer_conv_1d ggml_layer_conv_2d ggml_layer_dense ggml_layer_dropout ggml_layer_embedding ggml_layer_flatten ggml_layer_global_average_pooling_2d ggml_layer_global_max_pooling_2d ggml_layer_gru ggml_layer_lstm ggml_layer_max_pooling_2d ggml_leaky_relu ggml_load_model ggml_load_weights ggml_log ggml_log_inplace ggml_log_is_r_enabled ggml_log_set_default ggml_log_set_r ggml_lstm ggml_marshal_model ggml_mean ggml_model ggml_model_backend ggml_model_sequential ggml_mul ggml_mul_inplace ggml_mul_mat ggml_mul_mat_id ggml_n_dims ggml_nbytes ggml_neg ggml_neg_inplace ggml_nelements ggml_new_f32 ggml_new_i32 ggml_new_tensor ggml_new_tensor_1d ggml_new_tensor_2d ggml_new_tensor_3d ggml_new_tensor_4d ggml_norm ggml_norm_inplace ggml_nrows ggml_op_can_inplace ggml_op_desc ggml_op_name GGML_OP_POOL_AVG GGML_OP_POOL_MAX ggml_op_symbol ggml_ops_registry ggml_opt_alloc ggml_opt_context_optimizer_type ggml_opt_dataset_data ggml_opt_dataset_free ggml_opt_dataset_get_batch ggml_opt_dataset_init ggml_opt_dataset_labels ggml_opt_dataset_ndata ggml_opt_dataset_shuffle ggml_opt_dataset_weights ggml_opt_default_params ggml_opt_epoch ggml_opt_eval ggml_opt_fit ggml_opt_free ggml_opt_get_lr ggml_opt_grad_acc ggml_opt_init ggml_opt_init_for_fit ggml_opt_inputs ggml_opt_labels ggml_opt_loss ggml_opt_loss_type_cross_entropy ggml_opt_loss_type_mean ggml_opt_loss_type_mse ggml_opt_loss_type_sum ggml_opt_loss_type_weighted_mse ggml_opt_ncorrect ggml_opt_optimizer_name ggml_opt_optimizer_type_adamw ggml_opt_optimizer_type_sgd ggml_opt_outputs ggml_opt_pred ggml_opt_prepare_alloc ggml_opt_reset ggml_opt_result_accuracy ggml_opt_result_free ggml_opt_result_init ggml_opt_result_loss ggml_opt_result_ndata ggml_opt_result_pred ggml_opt_result_reset ggml_opt_set_lr ggml_opt_static_graphs ggml_out_prod ggml_pad ggml_pad_reflect_1d ggml_permute ggml_pool_1d ggml_pool_2d ggml_pop_layer ggml_predict ggml_predict_classes ggml_print_mem_status ggml_print_objects ggml_quant_block_info ggml_quantize_chunk ggml_quantize_free ggml_quantize_init ggml_quantize_requires_imatrix ggml_reglu ggml_reglu_split ggml_relu ggml_relu_inplace ggml_repeat ggml_repeat_back ggml_reset ggml_reshape_1d ggml_reshape_2d ggml_reshape_3d ggml_reshape_4d ggml_result ggml_rms_norm ggml_rms_norm_back ggml_rms_norm_inplace ggml_roll ggml_rope ggml_rope_ext ggml_rope_ext_back ggml_rope_ext_inplace ggml_rope_inplace ggml_rope_multi ggml_rope_multi_inplace GGML_ROPE_TYPE_MROPE GGML_ROPE_TYPE_NEOX GGML_ROPE_TYPE_NORM GGML_ROPE_TYPE_VISION ggml_round ggml_round_inplace ggml_run ggml_save_model ggml_save_weights ggml_scale ggml_scale_inplace GGML_SCALE_MODE_BICUBIC GGML_SCALE_MODE_BILINEAR GGML_SCALE_MODE_NEAREST ggml_schedule_cosine_decay ggml_schedule_reduce_on_plateau ggml_schedule_step_decay ggml_set ggml_set_1d ggml_set_2d ggml_set_abort_callback_default ggml_set_abort_callback_r ggml_set_f32 ggml_set_f32_nd ggml_set_i32 ggml_set_i32_nd ggml_set_input ggml_set_n_threads ggml_set_name ggml_set_no_alloc ggml_set_omp_threads ggml_set_op_params ggml_set_op_params_f32 ggml_set_op_params_i32 ggml_set_output ggml_set_param ggml_set_seed ggml_set_zero ggml_sgn ggml_sigmoid ggml_sigmoid_inplace ggml_silu ggml_silu_back ggml_silu_inplace ggml_sin ggml_soft_max ggml_soft_max_ext ggml_soft_max_ext_back ggml_soft_max_ext_back_inplace ggml_soft_max_ext_inplace ggml_soft_max_inplace ggml_softplus ggml_softplus_inplace GGML_SORT_ORDER_ASC GGML_SORT_ORDER_DESC ggml_sqr ggml_sqr_inplace ggml_sqrt ggml_sqrt_inplace ggml_step ggml_sub ggml_sub_inplace ggml_sum ggml_sum_rows ggml_swiglu ggml_swiglu_split ggml_tanh ggml_tanh_inplace ggml_task ggml_tensor_copy ggml_tensor_nb ggml_tensor_num ggml_tensor_overhead ggml_tensor_set_f32_scalar ggml_tensor_shape ggml_tensor_type ggml_test ggml_time_init ggml_time_ms ggml_time_us ggml_timestep_embedding ggml_top_k ggml_training_history ggml_transpose GGML_TYPE_BF16 GGML_TYPE_F16 GGML_TYPE_F32 GGML_TYPE_I32 ggml_type_name GGML_TYPE_Q2_K GGML_TYPE_Q3_K GGML_TYPE_Q4_0 GGML_TYPE_Q4_1 GGML_TYPE_Q4_K GGML_TYPE_Q5_K GGML_TYPE_Q6_K GGML_TYPE_Q8_0 ggml_type_size ggml_type_sizef ggml_unary_op_name ggml_unfreeze_weights ggml_unmarshal_model ggml_upscale ggml_used_mem ggml_version ggml_view_1d ggml_view_2d ggml_view_3d ggml_view_4d ggml_view_tensor ggml_vulkan_available ggml_vulkan_backend_name ggml_vulkan_device_caps ggml_vulkan_device_count ggml_vulkan_device_description ggml_vulkan_device_memory ggml_vulkan_free ggml_vulkan_init ggml_vulkan_is_backend ggml_vulkan_list_devices ggml_vulkan_status ggml_win_part ggml_win_unpart ggml_with_temp_ctx ggmlr_parsnip_fit_classif ggmlr_parsnip_fit_regr gguf_free gguf_load gguf_metadata gguf_tensor_data gguf_tensor_info gguf_tensor_names iq2xs_free_impl iq2xs_init_impl iq3xs_free_impl iq3xs_init_impl lr_scheduler_cosine lr_scheduler_step nn_topo_sort onnx_device_info onnx_inputs onnx_load onnx_run onnx_summary optimizer_adam optimizer_sgd quantize_iq1_m quantize_iq1_s quantize_iq2_s quantize_iq2_xs quantize_iq2_xxs quantize_iq3_s quantize_iq3_xxs quantize_iq4_nl quantize_iq4_xs quantize_mxfp4 quantize_nvfp4 quantize_q1_0 quantize_q2_K quantize_q3_K quantize_q4_0 quantize_q4_1 quantize_q4_K quantize_q5_0 quantize_q5_1 quantize_q5_K quantize_q6_K quantize_q8_0 quantize_row_iq2_s_ref quantize_row_iq3_s_ref quantize_row_iq3_xxs_ref quantize_row_iq4_nl_ref quantize_row_iq4_xs_ref quantize_row_mxfp4_ref quantize_row_q2_K_ref quantize_row_q3_K_ref quantize_row_q4_0_ref quantize_row_q4_1_ref quantize_row_q4_K_ref quantize_row_q5_0_ref quantize_row_q5_1_ref quantize_row_q5_K_ref quantize_row_q6_K_ref quantize_row_q8_0_ref quantize_row_q8_1_ref quantize_row_q8_K_ref quantize_row_tq1_0_ref quantize_row_tq2_0_ref quantize_tq1_0 quantize_tq2_0 RunGGML with_grad_tape

Dependencies:generics R6

Single-cell GPU Acceleration with Seurat

The one call: RunGGML() | A worked example | Normalize and scale on the GPU | PCA (embed) | UMAP (umap) | Neighbour graphs (neighbors) | Provenance | The layers underneath | Bioconductor: SingleCellExperiment | What is and isn't accelerated

Rendered fromsingle-cell-seurat.Rmdusingknitr::rmarkdown

Last update: 2026-06-21
Started: 2026-06-21

Quickstart: from data to prediction in ~10 lines

Rendered fromquickstart.Rmdusingknitr::rmarkdown

Last update: 2026-06-08
Started: 2026-06-08

Keras-like API in ggmlR

Installation | 1. Sequential API | 1.1 Prepare data — iris (3-class classification) | 1.2 Build and compile | 1.3 Train | 1.4 Evaluate and predict | 1.5 Save and reload | 2. Functional API | 2.1 Single-input DAG — residual-style network on iris | 2.2 Multi-input model — mtcars regression | 3. Callbacks | 3.1 Early stopping | 3.2 Learning-rate schedulers | 4. Autograd Engine (ag_*) | 4.1 Prepare data (iris, col-major) | 4.2 Build a model with ag_sequential | 4.3 Training loop | 4.4 Inference and accuracy | 4.5 LR scheduler + gradient clipping | 4.6 Manual model from raw ag_param | 5. GPU / device selection | API summary

Rendered fromkeras-like-api.Rmdusingknitr::rmarkdown

Last update: 2026-06-08
Started: 2026-04-13

Autograd Engine

1. Core primitives | ag_tensor and ag_param | Forward pass and gradient tape | 2. Built-in operations | 3. ag_sequential — module API | 4. Optimizers | Training loop | 5. LR schedulers | 6. Gradient clipping | 7. Dataloader | 8. Eval mode and inference | 9. Raw ag_param — full manual control | 10. Mixed precision | 11. Data-parallel training

Rendered fromautograd-engine.Rmdusingknitr::rmarkdown

Last update: 2026-06-08
Started: 2026-04-13

tidymodels / parsnip Integration

1. Classification | 2. Regression | 3. Engine parameters | 4. Resampling with rsample | 5. Recipes for preprocessing | 6. Hyperparameter tuning | 7. Comparison with other engines | 8. Extracting the fitted engine and fit time | 9. Limitations of the engine object | Summary

Rendered fromtidymodels-integration.Rmdusingknitr::rmarkdown

Last update: 2026-06-08
Started: 2026-04-13

mlr3 Integration

Quick start | Classification | Regression | Learner parameters | GPU acceleration | Custom model architecture | Resampling and benchmarking | Hyperparameter tuning | Callbacks | Observation weights | Marshal and parallel execution | Summary

Rendered frommlr3-integration.Rmdusingknitr::rmarkdown

Last update: 2026-06-08
Started: 2026-04-13

GPU / Vulkan Backend

1. Check availability | 2. Device enumeration | 3. Autograd device selection | 4. Mixed precision (f16 / bf16) | 5. Vulkan memory info | 6. Multi-GPU | 7. Sequential / functional API on GPU | 8. ONNX inference on GPU | 9. Build configuration

Rendered fromgpu-vulkan.Rmdusingknitr::rmarkdown

Last update: 2026-06-08
Started: 2026-04-13

1. Quantization formats | 2. Quantize and dequantize | 3. K-quants (better quality) | 4. IQ quants — importance matrix | 5. Comparing formats | 6. Reference (row-level) functions

Rendered fromquantization.Rmdusingknitr::rmarkdown

Last update: 2026-06-08
Started: 2026-02-06

Data-Parallel Training

1. Concept | 2. Minimal example | 3. Multi-GPU | 4. Gradient clipping | 5. ag_dataloader — batched training loop | 6. Full example

Rendered fromdata-parallel-training.Rmdusingknitr::rmarkdown

Last update: 2026-06-08
Started: 2026-04-13

ONNX Model Import

1. Load and inspect a model | 2. Run inference | 3. GPU inference | 4. Dynamic input shapes | 5. FP16 inference | 6. Supported operators | 7. Examples | 8. Debugging tips

Rendered fromonnx-import.Rmdusingknitr::rmarkdown

Last update: 2026-06-08
Started: 2026-04-13

Using ggmlR as a Backend in Your Package

Part A — Using the ggmlR R API from your package | A1. Depend on ggmlR | A2. Everything the examples use is exported | A3. tidymodels / mlr3 engines | Part B — Linking ggml from your package's C/C++ code | 1. What ggmlR exports | 2. DESCRIPTION | 3. src/Makevars | 4. configure — detect ggmlR at build time | 5. C code — minimal example | 6. R wrapper | 7. Thread count | 8. Vulkan in downstream packages | 9. Real-world references

Rendered fromembedding-ggmlR.Rmdusingknitr::rmarkdown

Last update: 2026-06-08
Started: 2026-04-13

Citation

Development and contributors

Readme and manuals

Help Manual

Help page	Topics
Element-wise addition with broadcasting	ag_add
Create a Batch Normalisation layer	ag_batch_norm
Element-wise clamp	ag_clamp
Categorical Cross-Entropy loss	ag_cross_entropy_loss
Create a mini-batch data loader	ag_dataloader
Return the current default compute device	ag_default_device
Return the current default dtype for GPU operations	ag_default_dtype
Set the default compute device for ag_* operations	ag_device
Create a Dropout layer	ag_dropout
Set the default floating-point precision for ag_* GPU operations	ag_dtype
Create an Embedding layer	ag_embedding
Switch a layer or sequential model to eval mode	ag_eval
Element-wise exponential	ag_exp
Numerical gradient check (like torch.autograd.gradcheck)	ag_gradcheck
Create a dense layer with learnable parameters	ag_linear
Load an autograd module from a saved state	ag_load_model
Element-wise natural logarithm	ag_log
Matrix multiplication	ag_matmul
Mean of elements (or along a dim)	ag_mean
Mean Squared Error loss	ag_mse_loss
Element-wise multiplication	ag_mul
Create a Multi-Head Attention layer	ag_multihead_attention
Create a parameter tensor (gradient tracked)	ag_param
Element-wise power	ag_pow
ReLU activation	ag_relu
Reshape tensor	ag_reshape
Save an autograd module's state to disk	ag_save_model
Scale tensor by a scalar constant	ag_scale
Create a sequential container of layers	ag_sequential
Sigmoid activation	ag_sigmoid
Softmax activation (column-wise)	ag_softmax
Fused softmax + cross-entropy loss (numerically stable)	ag_softmax_cross_entropy_loss
Element-wise subtraction	ag_sub
Sum all elements (or along a dim): out = sum(x)	ag_sum
Tanh activation	ag_tanh
Create a dynamic tensor (no gradient tracking)	ag_tensor
Move a tensor to the specified device	ag_to_device
Switch a layer or sequential model to training mode	ag_train
Transpose a tensor	ag_transpose
Augment new data with predictions from a fitted ggml parsnip model	augment.ggmlr_parsnip_model
Run backward pass from a scalar loss tensor	backward
Clip gradients by global L2 norm	clip_grad_norm
Compile a Model	compile.ggml_functional_model compile.ggml_sequential_model
Dequantize Row (IQ)	dequantize_row_iq1_m dequantize_row_iq1_s dequantize_row_iq2_s dequantize_row_iq2_xs dequantize_row_iq2_xxs dequantize_row_iq3_s dequantize_row_iq3_xxs dequantize_row_iq4_nl dequantize_row_iq4_xs
Dequantize Row (MXFP4)	dequantize_row_mxfp4
Dequantize NVFP4 Data	dequantize_row_nvfp4
Dequantize Q1_0 Data	dequantize_row_q1_0
Dequantize Row (K-quants)	dequantize_row_q2_K dequantize_row_q3_K dequantize_row_q4_K dequantize_row_q5_K dequantize_row_q6_K dequantize_row_q8_K
Dequantize Row (Q4_0)	dequantize_row_q4_0 dequantize_row_q4_1 dequantize_row_q5_0 dequantize_row_q5_1 dequantize_row_q8_0
Dequantize Row (Ternary)	dequantize_row_tq1_0 dequantize_row_tq2_0
Data-parallel training across multiple GPUs	dp_train
Evaluate a Model	evaluate.ggml_functional_model evaluate.ggml_sequential_model
Train a Model	fit.ggml_functional_model fit.ggml_sequential_model
Check if R Abort Handler is Enabled	ggml_abort_is_r_enabled
Absolute Value (Graph)	ggml_abs
Absolute Value In-place (Graph)	ggml_abs_inplace
Add tensors	ggml_add
Element-wise Addition In-place (Graph)	ggml_add_inplace
Add Relative Position Bias (Graph)	ggml_add_rel_pos
Add Scalar to Tensor (Graph)	ggml_add1
Apply a Layer Object to a Tensor Node	ggml_apply
Arange (Graph)	ggml_arange
Check if Two Tensors Have the Same Layout	ggml_are_same_layout
Compare Tensor Shapes	ggml_are_same_shape
Compare Tensor Strides	ggml_are_same_stride
Argmax (Graph)	ggml_argmax
Argsort - Get Sorting Indices (Graph)	ggml_argsort
Allocate Context Tensors to Backend	ggml_backend_alloc_ctx_tensors
Clear buffer memory	ggml_backend_buffer_clear
Free Backend Buffer	ggml_backend_buffer_free
Get Backend Buffer Size	ggml_backend_buffer_get_size
Get buffer usage	ggml_backend_buffer_get_usage
Check if buffer is host memory	ggml_backend_buffer_is_host
Check if buffer is a multi-buffer	ggml_backend_buffer_is_multi_buffer
Get Backend Buffer Name	ggml_backend_buffer_name
Reset buffer	ggml_backend_buffer_reset
Set buffer usage hint	ggml_backend_buffer_set_usage
Buffer usage: Any	ggml_backend_buffer_usage_any
Buffer usage: Compute	ggml_backend_buffer_usage_compute
Buffer usage: Weights	ggml_backend_buffer_usage_weights
Initialize CPU Backend	ggml_backend_cpu_init
Set CPU Backend Threads	ggml_backend_cpu_set_n_threads
Get device by name	ggml_backend_dev_by_name
Get device by type	ggml_backend_dev_by_type
Get number of available devices	ggml_backend_dev_count
Get device description	ggml_backend_dev_description
Get device by index	ggml_backend_dev_get
Get device properties	ggml_backend_dev_get_props
Initialize backend from device	ggml_backend_dev_init
Get device memory	ggml_backend_dev_memory
Get device name	ggml_backend_dev_name
Check if device should offload operation	ggml_backend_dev_offload_op
Check if device supports buffer type	ggml_backend_dev_supports_buft
Check if device supports operation	ggml_backend_dev_supports_op
Get device type	ggml_backend_dev_type
Register a device	ggml_backend_device_register
Device type: Accelerator	ggml_backend_device_type_accel
Device type: CPU	ggml_backend_device_type_cpu
Device type: GPU	ggml_backend_device_type_gpu
Device type: Integrated GPU	ggml_backend_device_type_igpu
Free event	ggml_backend_event_free
Create new event	ggml_backend_event_new
Record event	ggml_backend_event_record
Synchronize event	ggml_backend_event_synchronize
Wait for event	ggml_backend_event_wait
Free Backend	ggml_backend_free
Get device from backend	ggml_backend_get_device
Compute Graph with Backend	ggml_backend_graph_compute
Compute graph asynchronously	ggml_backend_graph_compute_async
Execute graph plan	ggml_backend_graph_plan_compute
Create graph execution plan	ggml_backend_graph_plan_create
Free graph execution plan	ggml_backend_graph_plan_free
Initialize best available backend	ggml_backend_init_best
Initialize backend by name	ggml_backend_init_by_name
Initialize backend by type	ggml_backend_init_by_type
Load backend from dynamic library	ggml_backend_load
Load all available backends	ggml_backend_load_all
Create a Meta Backend Device	ggml_backend_meta_device
Allocate multi-buffer	ggml_backend_multi_buffer_alloc_buffer
Set usage for all buffers in a multi-buffer	ggml_backend_multi_buffer_set_usage
Get Backend Name	ggml_backend_name
Get backend registry by name	ggml_backend_reg_by_name
Get number of registered backends	ggml_backend_reg_count
Get number of devices in registry	ggml_backend_reg_dev_count
Get device from registry	ggml_backend_reg_dev_get
Get backend registry by index	ggml_backend_reg_get
Get registry name	ggml_backend_reg_name
Register a backend	ggml_backend_register
Allocate graph on scheduler	ggml_backend_sched_alloc_graph
Free backend scheduler	ggml_backend_sched_free
Get backend from scheduler	ggml_backend_sched_get_backend
Get number of backends in scheduler	ggml_backend_sched_get_n_backends
Get number of tensor copies	ggml_backend_sched_get_n_copies
Get number of graph splits	ggml_backend_sched_get_n_splits
Get tensor backend assignment	ggml_backend_sched_get_tensor_backend
Compute graph using scheduler	ggml_backend_sched_graph_compute
Compute graph asynchronously	ggml_backend_sched_graph_compute_async
Create a new backend scheduler	ggml_backend_sched_new
Reserve memory for scheduler	ggml_backend_sched_reserve
Reset scheduler	ggml_backend_sched_reset
Set tensor backend assignment	ggml_backend_sched_set_tensor_backend
Synchronize scheduler	ggml_backend_sched_synchronize
Synchronize backend	ggml_backend_synchronize
Copy tensor asynchronously between backends	ggml_backend_tensor_copy_async
Backend Tensor Get and Sync	ggml_backend_tensor_get_and_sync
Get tensor data asynchronously	ggml_backend_tensor_get_async
Get Tensor Data via Backend	ggml_backend_tensor_get_data
Get First Float from Backend Tensor	ggml_backend_tensor_get_f32_first
Set tensor data asynchronously	ggml_backend_tensor_set_async
Set Tensor Data via Backend	ggml_backend_tensor_set_data
Unload backend	ggml_backend_unload
Create a Batch Normalization Layer Object	ggml_batch_norm
Get Block Size	ggml_blck_size
Build forward expand	ggml_build_forward_expand
Early stopping callback	ggml_callback_early_stopping
Check If Tensor Can Be Repeated	ggml_can_repeat
Ceiling (Graph)	ggml_ceil
Ceiling In-place (Graph)	ggml_ceil_inplace
Clamp (Graph)	ggml_clamp
Compile a Sequential Model	ggml_compile ggml_compile.ggml_functional_model ggml_compile.ggml_sequential_model
Concatenate Tensors (Graph)	ggml_concat
Make Contiguous (Graph)	ggml_cont
1D Convolution (Graph)	ggml_conv_1d
Depthwise 1D Convolution (Graph)	ggml_conv_1d_dw
2D Convolution (Graph)	ggml_conv_2d
Direct 2D Convolution (Graph)	ggml_conv_2d_direct
Depthwise 2D Convolution (Graph)	ggml_conv_2d_dw
Depthwise 2D Convolution, direct (Graph)	ggml_conv_2d_dw_direct
Transposed 1D Convolution (Graph)	ggml_conv_transpose_1d
Transposed 2D Convolution, zero padding (Graph)	ggml_conv_transpose_2d_p0
Cosine (Graph)	ggml_cos
Count Equal Elements (Graph)	ggml_count_equal
Element-wise Addition (CPU Direct)	ggml_cpu_add
Get All CPU Features	ggml_cpu_features
Get RISC-V Vector Length	ggml_cpu_get_rvv_vlen
Get SVE Vector Length (ARM)	ggml_cpu_get_sve_cnt
CPU Feature Detection - AMX INT8	ggml_cpu_has_amx_int8
CPU Feature Detection - ARM FMA	ggml_cpu_has_arm_fma
CPU Feature Detection - AVX	ggml_cpu_has_avx
CPU Feature Detection - AVX-VNNI	ggml_cpu_has_avx_vnni
CPU Feature Detection - AVX2	ggml_cpu_has_avx2
CPU Feature Detection - AVX-512	ggml_cpu_has_avx512
CPU Feature Detection - AVX-512 BF16	ggml_cpu_has_avx512_bf16
CPU Feature Detection - AVX-512 VBMI	ggml_cpu_has_avx512_vbmi
CPU Feature Detection - AVX-512 VNNI	ggml_cpu_has_avx512_vnni
CPU Feature Detection - BMI2	ggml_cpu_has_bmi2
CPU Feature Detection - Dot Product (ARM)	ggml_cpu_has_dotprod
CPU Feature Detection - F16C	ggml_cpu_has_f16c
CPU Feature Detection - FMA	ggml_cpu_has_fma
CPU Feature Detection - FP16 Vector Arithmetic (ARM)	ggml_cpu_has_fp16_va
CPU Feature Detection - Llamafile	ggml_cpu_has_llamafile
CPU Feature Detection - INT8 Matrix Multiply (ARM)	ggml_cpu_has_matmul_int8
CPU Feature Detection - NEON (ARM)	ggml_cpu_has_neon
CPU Feature Detection - RISC-V Vector	ggml_cpu_has_riscv_v
CPU Feature Detection - SME (ARM)	ggml_cpu_has_sme
CPU Feature Detection - SSE3	ggml_cpu_has_sse3
CPU Feature Detection - SSSE3	ggml_cpu_has_ssse3
CPU Feature Detection - SVE (ARM)	ggml_cpu_has_sve
CPU Feature Detection - VSX (PowerPC)	ggml_cpu_has_vsx
CPU Feature Detection - VXE (IBM z/Architecture)	ggml_cpu_has_vxe
CPU Feature Detection - WebAssembly SIMD	ggml_cpu_has_wasm_simd
Element-wise Multiplication (CPU Direct)	ggml_cpu_mul
Copy Tensor with Type Conversion (Graph)	ggml_cpy
Get CPU Cycles	ggml_cycles
Get CPU Cycles per Millisecond	ggml_cycles_per_ms
Default MLP builder for classification and regression	ggml_default_mlp
Create a Dense Layer Object	ggml_dense
Diagonal Matrix (Graph)	ggml_diag
Diagonal Mask with -Inf (Graph)	ggml_diag_mask_inf
Diagonal Mask with -Inf In-place (Graph)	ggml_diag_mask_inf_inplace
Diagonal Mask with Zero (Graph)	ggml_diag_mask_zero
Element-wise Division (Graph)	ggml_div
Element-wise Division In-place (Graph)	ggml_div_inplace
Duplicate Tensor (Graph)	ggml_dup
Duplicate Tensor In-place (Graph)	ggml_dup_inplace
Duplicate Tensor	ggml_dup_tensor
Get Element Size	ggml_element_size
ELU Activation (Graph)	ggml_elu
ELU Activation In-place (Graph)	ggml_elu_inplace
Create an Embedding Layer Object	ggml_embedding
Estimate Required Memory	ggml_estimate_memory
Evaluate a Trained Model	ggml_evaluate ggml_evaluate.ggml_functional_model ggml_evaluate.ggml_sequential_model
Exponential (Graph)	ggml_exp
Exponential In-place (Graph)	ggml_exp_inplace
Extract a feature-by-cell matrix from a single-cell container	ggml_extract ggml_extract.dgCMatrix ggml_extract.matrix ggml_extract.Seurat ggml_extract.SingleCellExperiment
Fit model with R-side epoch loop and callbacks	ggml_fit_opt
Train a Model (dispatcher)	ggml_fit ggml_fit.default ggml_fit.ggml_functional_model ggml_fit.ggml_sequential_model
Flash Attention Backward (Graph)	ggml_flash_attn_back
Flash Attention (Graph)	ggml_flash_attn_ext
Floor (Graph)	ggml_floor
Floor In-place (Graph)	ggml_floor_inplace
Free GGML context	ggml_free
Freeze Layer Weights	ggml_freeze_weights
Convert ftype to ggml_type	ggml_ftype_to_ggml_type
Allocate Memory for Graph	ggml_gallocr_alloc_graph
Free Graph Allocator	ggml_gallocr_free
Get Graph Allocator Buffer Size	ggml_gallocr_get_buffer_size
Create Graph Allocator	ggml_gallocr_new
Reserve Memory for Graph	ggml_gallocr_reserve
GeGLU (GELU Gated Linear Unit) (Graph)	ggml_geglu
GeGLU Quick (Fast GeGLU) (Graph)	ggml_geglu_quick
GeGLU Split (Graph)	ggml_geglu_split
GELU Activation (Graph)	ggml_gelu
Exact GELU Activation (Graph)	ggml_gelu_erf
GELU Activation In-place (Graph)	ggml_gelu_inplace
GELU Quick Activation (Graph)	ggml_gelu_quick
Get F32 data	ggml_get_f32
Get Single Float Value by N-D Index	ggml_get_f32_nd
Get First Tensor from Context	ggml_get_first_tensor
Get I32 Data	ggml_get_i32
Get Single Int32 Value by N-D Index	ggml_get_i32_nd
Get a Layer from a Sequential Model	ggml_get_layer
Get Maximum Tensor Size	ggml_get_max_tensor_size
Get Context Memory Size	ggml_get_mem_size
Get Number of Threads	ggml_get_n_threads
Get Tensor Name	ggml_get_name
Get Next Tensor from Context	ggml_get_next_tensor
Get No Allocation Mode	ggml_get_no_alloc
Get Tensor Operation Parameters	ggml_get_op_params
Get Float Op Parameter	ggml_get_op_params_f32
Get Integer Op Parameter	ggml_get_op_params_i32
Get Relative Position (Graph)	ggml_get_rel_pos
Get Rows by Indices (Graph)	ggml_get_rows
Get Rows Backward (Graph)	ggml_get_rows_back
Get Unary Operation from Tensor	ggml_get_unary_op
Generic GLU (Gated Linear Unit) (Graph)	ggml_glu
GLU Operation Types	GGML_GLU_OP_GEGLU GGML_GLU_OP_GEGLU_ERF GGML_GLU_OP_GEGLU_QUICK GGML_GLU_OP_REGLU GGML_GLU_OP_SWIGLU GGML_GLU_OP_SWIGLU_OAI
Generic GLU Split (Graph)	ggml_glu_split
Compute graph	ggml_graph_compute
Compute Graph with Context (Alternative Method)	ggml_graph_compute_with_ctx
Export Graph to DOT Format	ggml_graph_dump_dot
Get Tensor from Graph by Name	ggml_graph_get_tensor
Get Number of Nodes in Graph	ggml_graph_n_nodes
Get Graph Node	ggml_graph_node
Get Graph Overhead	ggml_graph_overhead
Print Graph Information	ggml_graph_print
Reset Graph (for backpropagation)	ggml_graph_reset
Create a View of a Subgraph	ggml_graph_view
Group Normalization (Graph)	ggml_group_norm
Group Normalization In-place (Graph)	ggml_group_norm_inplace
Create a GRU Layer Object	ggml_gru
Hard Sigmoid Activation (Graph)	ggml_hardsigmoid
Hard Swish Activation (Graph)	ggml_hardswish
Image to Column (Graph)	ggml_im2col
Initialize GGML context	ggml_init
Create Context with Auto-sizing	ggml_init_auto
Inject a single-cell result back into its container	ggml_inject ggml_inject.Seurat ggml_inject.SingleCellExperiment
Declare a Functional API Input Tensor	ggml_input
Check if GGML is available	ggml_is_available
Check if Tensor is Contiguous	ggml_is_contiguous
Check Tensor Contiguity (Dimension 0)	ggml_is_contiguous_0
Check Tensor Contiguity (Dimensions >= 1)	ggml_is_contiguous_1
Check Tensor Contiguity (Dimensions >= 2)	ggml_is_contiguous_2
Check Channel-wise Contiguity	ggml_is_contiguous_channels
Check Row-wise Contiguity	ggml_is_contiguous_rows
Check If Tensor is Contiguously Allocated	ggml_is_contiguously_allocated
Check if Tensor is Permuted	ggml_is_permuted
Check If Type is Quantized	ggml_is_quantized
Check if Tensor is Transposed	ggml_is_transposed
L2 Normalization (Graph)	ggml_l2_norm
L2 Normalization In-place (Graph)	ggml_l2_norm_inplace
Element-wise Addition of Two Tensor Nodes	ggml_layer_add
Add Batch Normalization Layer	ggml_layer_batch_norm
Concatenate Tensor Nodes Along an Axis	ggml_layer_concatenate
Create a Conv1D Layer Object	ggml_layer_conv_1d
Create a Conv2D Layer Object	ggml_layer_conv_2d
Add Dense (Fully Connected) Layer	ggml_layer_dense
Add Dropout Layer	ggml_layer_dropout
Add Embedding Layer	ggml_layer_embedding
Add Flatten Layer	ggml_layer_flatten
Global Average Pooling for 2D Feature Maps	ggml_layer_global_average_pooling_2d
Global Max Pooling for 2D Feature Maps	ggml_layer_global_max_pooling_2d
Add a GRU Layer	ggml_layer_gru
Add an LSTM Layer	ggml_layer_lstm
Add 2D Max Pooling Layer	ggml_layer_max_pooling_2d
Leaky ReLU Activation (Graph)	ggml_leaky_relu
Load a Full Model (Architecture + Weights)	ggml_load_model
Load Model Weights from File	ggml_load_weights
Natural Logarithm (Graph)	ggml_log
Natural Logarithm In-place (Graph)	ggml_log_inplace
Check if R Logging is Enabled	ggml_log_is_r_enabled
Restore Default GGML Logging	ggml_log_set_default
Enable R-compatible GGML Logging	ggml_log_set_r
Create an LSTM Layer Object	ggml_lstm
Marshal a ggmlR model to an in-memory container	ggml_marshal_model
Mean (Graph)	ggml_mean
Create a Functional Model	ggml_model
Backend a fitted ggml model actually ran on	ggml_model_backend
Create a Sequential Neural Network Model	ggml_model_sequential
Multiply tensors	ggml_mul
Element-wise Multiplication In-place (Graph)	ggml_mul_inplace
Matrix Multiplication (Graph)	ggml_mul_mat
Matrix Multiplication with Expert Selection (Graph)	ggml_mul_mat_id
Get Number of Dimensions	ggml_n_dims
Get number of bytes	ggml_nbytes
Negation (Graph)	ggml_neg
Negation In-place (Graph)	ggml_neg_inplace
Get number of elements	ggml_nelements
Create Scalar F32 Tensor	ggml_new_f32
Create Scalar I32 Tensor	ggml_new_i32
Create Tensor with Arbitrary Dimensions	ggml_new_tensor
Create 1D tensor	ggml_new_tensor_1d
Create 2D tensor	ggml_new_tensor_2d
Create 3D Tensor	ggml_new_tensor_3d
Create 4D Tensor	ggml_new_tensor_4d
Layer Normalization (Graph)	ggml_norm
Layer Normalization In-place (Graph)	ggml_norm_inplace
Get Number of Rows	ggml_nrows
Check if Operation Can Be Done In-place	ggml_op_can_inplace
Get Operation Description from Tensor	ggml_op_desc
Get Operation Name	ggml_op_name
Get Operation Symbol	ggml_op_symbol
Supported single-cell operations	ggml_ops_registry
Allocate graph for evaluation	ggml_opt_alloc
Get optimizer type from context	ggml_opt_context_optimizer_type
Get data tensor from dataset	ggml_opt_dataset_data
Free optimization dataset	ggml_opt_dataset_free
Get batch from dataset	ggml_opt_dataset_get_batch
Create a new optimization dataset	ggml_opt_dataset_init
Get labels tensor from dataset	ggml_opt_dataset_labels
Get number of datapoints in dataset	ggml_opt_dataset_ndata
Shuffle dataset	ggml_opt_dataset_shuffle
Get dataset per-datapoint weights tensor	ggml_opt_dataset_weights
Get default optimizer parameters	ggml_opt_default_params
Run one training epoch	ggml_opt_epoch
Evaluate model	ggml_opt_eval
Fit model to dataset	ggml_opt_fit
Free optimizer context	ggml_opt_free
Get current learning rate from optimizer context	ggml_opt_get_lr
Get gradient accumulator for a tensor	ggml_opt_grad_acc
Initialize optimizer context	ggml_opt_init
Initialize optimizer context for R-side epoch loop	ggml_opt_init_for_fit
Get inputs tensor from optimizer context	ggml_opt_inputs
Get labels tensor from optimizer context	ggml_opt_labels
Get loss tensor from optimizer context	ggml_opt_loss
Loss type: Cross Entropy	ggml_opt_loss_type_cross_entropy
Loss type: Mean	ggml_opt_loss_type_mean
Loss type: Mean Squared Error	ggml_opt_loss_type_mse
Loss type: Sum	ggml_opt_loss_type_sum
Loss type: Weighted Mean Squared Error	ggml_opt_loss_type_weighted_mse
Get number of correct predictions tensor	ggml_opt_ncorrect
Get optimizer name	ggml_opt_optimizer_name
Optimizer type: AdamW	ggml_opt_optimizer_type_adamw
Optimizer type: SGD	ggml_opt_optimizer_type_sgd
Get outputs tensor from optimizer context	ggml_opt_outputs
Get predictions tensor from optimizer context	ggml_opt_pred
Prepare allocation for non-static graphs	ggml_opt_prepare_alloc
Reset optimizer context	ggml_opt_reset
Get accuracy from result	ggml_opt_result_accuracy
Free optimization result	ggml_opt_result_free
Initialize optimization result	ggml_opt_result_init
Get loss from result	ggml_opt_result_loss
Get number of datapoints from result	ggml_opt_result_ndata
Get predictions from result	ggml_opt_result_pred
Reset optimization result	ggml_opt_result_reset
Set learning rate in optimizer context	ggml_opt_set_lr
Check if using static graphs	ggml_opt_static_graphs
Outer Product (Graph)	ggml_out_prod
Pad Tensor with Zeros (Graph)	ggml_pad
Reflective 1D Padding (Graph)	ggml_pad_reflect_1d
Permute Tensor Dimensions (Graph)	ggml_permute
1D Pooling (Graph)	GGML_OP_POOL_AVG GGML_OP_POOL_MAX ggml_pool_1d
2D Pooling (Graph)	ggml_pool_2d
Remove the Last Layer from a Sequential Model	ggml_pop_layer
Predict Classes from a Trained Model	ggml_predict_classes
Get Predictions from a Trained Model	ggml_predict ggml_predict.ggml_functional_model ggml_predict.ggml_sequential_model
Print Context Memory Status	ggml_print_mem_status
Print Objects in Context	ggml_print_objects
Get Quantization Block Info	ggml_quant_block_info
Quantize Data Chunk	ggml_quantize_chunk
Free Quantization Resources	ggml_quantize_free
Initialize Quantization Tables	ggml_quantize_init
Check if Quantization Requires Importance Matrix	ggml_quantize_requires_imatrix
ReGLU (ReLU Gated Linear Unit) (Graph)	ggml_reglu
ReGLU Split (Graph)	ggml_reglu_split
ReLU Activation (Graph)	ggml_relu
ReLU Activation In-place (Graph)	ggml_relu_inplace
Repeat (Graph)	ggml_repeat
Repeat Backward (Graph)	ggml_repeat_back
Reset GGML Context	ggml_reset
Reshape to 1D (Graph)	ggml_reshape_1d
Reshape to 2D (Graph)	ggml_reshape_2d
Reshape to 3D (Graph)	ggml_reshape_3d
Reshape to 4D (Graph)	ggml_reshape_4d
Construct a single-cell result	ggml_result
RMS Normalization (Graph)	ggml_rms_norm
RMS Norm Backward (Graph)	ggml_rms_norm_back
RMS Normalization In-place (Graph)	ggml_rms_norm_inplace
Roll (Graph)	ggml_roll
Rotary Position Embedding (Graph)	ggml_rope
Extended RoPE with Frequency Scaling (Graph)	ggml_rope_ext
RoPE Extended Backward (Graph)	ggml_rope_ext_back
Extended RoPE Inplace (Graph)	ggml_rope_ext_inplace
Rotary Position Embedding In-place (Graph)	ggml_rope_inplace
Multi-RoPE for Vision Models (Graph)	ggml_rope_multi
Multi-RoPE Inplace (Graph)	ggml_rope_multi_inplace
Round (Graph)	ggml_round
Round In-place (Graph)	ggml_round_inplace
Run a single-cell task on the GGML backend	ggml_run
Save a Full Model (Architecture + Weights)	ggml_save_model
Save Model Weights to File	ggml_save_weights
Scale (Graph)	ggml_scale
Scale Tensor In-place (Graph)	ggml_scale_inplace
Cosine annealing LR scheduler	ggml_schedule_cosine_decay
Reduce on plateau LR scheduler	ggml_schedule_reduce_on_plateau
Step decay LR scheduler	ggml_schedule_step_decay
Set Tensor Region (Graph)	ggml_set
Set 1D Tensor Region (Graph)	ggml_set_1d
Set 2D Tensor Region (Graph)	ggml_set_2d
Restore Default Abort Behavior	ggml_set_abort_callback_default
Enable R-compatible Abort Handling	ggml_set_abort_callback_r
Set F32 data	ggml_set_f32
Set Single Float Value by N-D Index	ggml_set_f32_nd
Set I32 Data	ggml_set_i32
Set Single Int32 Value by N-D Index	ggml_set_i32_nd
Mark Tensor as Input	ggml_set_input
Set Number of Threads	ggml_set_n_threads
Set Tensor Name	ggml_set_name
Set No Allocation Mode	ggml_set_no_alloc
Set OpenMP Thread Count	ggml_set_omp_threads
Set Tensor Operation Parameters	ggml_set_op_params
Set Float Op Parameter	ggml_set_op_params_f32
Set Integer Op Parameter	ggml_set_op_params_i32
Mark Tensor as Output	ggml_set_output
Set Tensor as Trainable Parameter	ggml_set_param
Set the random seed for reproducible ggmlR runs	ggml_set_seed
Set Tensor to Zero	ggml_set_zero
Sign Function (Graph)	ggml_sgn
Sigmoid Activation (Graph)	ggml_sigmoid
Sigmoid Activation In-place (Graph)	ggml_sigmoid_inplace
SiLU Activation (Graph)	ggml_silu
SiLU Backward (Graph)	ggml_silu_back
SiLU Activation In-place (Graph)	ggml_silu_inplace
Sine (Graph)	ggml_sin
Softmax (Graph)	ggml_soft_max
Extended Softmax with Masking and Scaling (Graph)	ggml_soft_max_ext
Softmax Backward Extended (Graph)	ggml_soft_max_ext_back
Extended Softmax Backward Inplace (Graph)	ggml_soft_max_ext_back_inplace
Extended Softmax Inplace (Graph)	ggml_soft_max_ext_inplace
Softmax In-place (Graph)	ggml_soft_max_inplace
Softplus Activation (Graph)	ggml_softplus
Softplus Activation In-place (Graph)	ggml_softplus_inplace
Sort Order Constants	GGML_SORT_ORDER_ASC GGML_SORT_ORDER_DESC
Square (Graph)	ggml_sqr
Square In-place (Graph)	ggml_sqr_inplace
Square Root (Graph)	ggml_sqrt
Square Root In-place (Graph)	ggml_sqrt_inplace
Step Function (Graph)	ggml_step
Element-wise Subtraction (Graph)	ggml_sub
Element-wise Subtraction In-place (Graph)	ggml_sub_inplace
Sum (Graph)	ggml_sum
Sum Rows (Graph)	ggml_sum_rows
SwiGLU (Swish/SiLU Gated Linear Unit) (Graph)	ggml_swiglu
SwiGLU Split (Graph)	ggml_swiglu_split
Tanh Activation (Graph)	ggml_tanh
Tanh Activation In-place (Graph)	ggml_tanh_inplace
Construct a single-cell compute task	ggml_task
Copy Tensor Data	ggml_tensor_copy
Get Tensor Strides (nb)	ggml_tensor_nb
Count Tensors in Context	ggml_tensor_num
Get Tensor Overhead	ggml_tensor_overhead
Fill Tensor with Scalar	ggml_tensor_set_f32_scalar
Get Tensor Shape	ggml_tensor_shape
Get Tensor Type	ggml_tensor_type
Test GGML	ggml_test
Initialize GGML Timer	ggml_time_init
Get Time in Milliseconds	ggml_time_ms
Get Time in Microseconds	ggml_time_us
Timestep Embedding (Graph Operation)	ggml_timestep_embedding
Top-K Indices (Graph)	ggml_top_k
Training history of a fitted ggml model	ggml_training_history
Transpose (Graph)	ggml_transpose
GGML Data Types	GGML_TYPE_BF16 GGML_TYPE_F16 GGML_TYPE_F32 GGML_TYPE_I32 GGML_TYPE_Q2_K GGML_TYPE_Q3_K GGML_TYPE_Q4_0 GGML_TYPE_Q4_1 GGML_TYPE_Q4_K GGML_TYPE_Q5_K GGML_TYPE_Q6_K GGML_TYPE_Q8_0
Get Type Name	ggml_type_name
Get Type Size in Bytes	ggml_type_size
Get Type Size as Float	ggml_type_sizef
Get Unary Operation Name	ggml_unary_op_name
Unfreeze Layer Weights	ggml_unfreeze_weights
Unmarshal a ggmlR model from an in-memory container	ggml_unmarshal_model
Upscale Tensor (Graph)	GGML_SCALE_MODE_BICUBIC GGML_SCALE_MODE_BILINEAR GGML_SCALE_MODE_NEAREST ggml_upscale
Get Used Memory	ggml_used_mem
Get GGML version	ggml_version
1D View with Byte Offset (Graph)	ggml_view_1d
2D View with Byte Offset (Graph)	ggml_view_2d
3D View with Byte Offset (Graph)	ggml_view_3d
4D View with Byte Offset (Graph)	ggml_view_4d
View Tensor	ggml_view_tensor
Check if Vulkan support is available	ggml_vulkan_available
Get Vulkan backend name	ggml_vulkan_backend_name
Get Vulkan device capabilities	ggml_vulkan_device_caps
Get number of Vulkan devices	ggml_vulkan_device_count
Get Vulkan device description	ggml_vulkan_device_description
Get Vulkan device memory	ggml_vulkan_device_memory
Free Vulkan backend	ggml_vulkan_free
Initialize Vulkan backend	ggml_vulkan_init
Check if backend is Vulkan	ggml_vulkan_is_backend
List all Vulkan devices	ggml_vulkan_list_devices
Print Vulkan status	ggml_vulkan_status
Window Partition (Graph)	ggml_win_part
Window Un-partition (Graph)	ggml_win_unpart
Execute with Temporary Context	ggml_with_temp_ctx
Free GGUF Resources	gguf_free
Load a GGUF File	gguf_load
Get GGUF Metadata	gguf_metadata
Extract Tensor Data	gguf_tensor_data
Get Tensor Info	gguf_tensor_info
List Tensor Names in a GGUF File	gguf_tensor_names
One-row summary of a fitted ggml parsnip model	glance.ggmlr_parsnip_model
Free IQ2 Quantization Tables	iq2xs_free_impl
Initialize IQ2 Quantization Tables	iq2xs_init_impl
Free IQ3 Quantization Tables	iq3xs_free_impl
Initialize IQ3 Quantization Tables	iq3xs_init_impl
Cosine-annealing learning rate scheduler	lr_scheduler_cosine
Step-decay learning rate scheduler	lr_scheduler_step
Topologically sort nodes reachable from output nodes	nn_topo_sort
ONNX model device/scheduler diagnostics	onnx_device_info
List ONNX model inputs	onnx_inputs
Load an ONNX model	onnx_load
Run ONNX model inference	onnx_run
ONNX model summary	onnx_summary
Create an Adam optimizer	optimizer_adam
Create an SGD optimizer	optimizer_sgd
Plot training history	plot.ggml_history
Predict with a Trained Model	predict.ggml_functional_model predict.ggml_sequential_model
Print method for ag_tensor	print.ag_tensor
Print method for ggml_functional_model	print.ggml_functional_model
Print method for ggml_history	print.ggml_history
Print method for ggml_sequential_model	print.ggml_sequential_model
Print ONNX model summary	print.onnx_model
Quantize Data (IQ)	quantize_iq1_m quantize_iq1_s quantize_iq2_s quantize_iq2_xs quantize_iq2_xxs quantize_iq3_s quantize_iq3_xxs quantize_iq4_nl quantize_iq4_xs
Quantize Data (MXFP4)	quantize_mxfp4
Quantize Data (NVFP4)	quantize_nvfp4
Quantize Data (Q1_0)	quantize_q1_0
Quantize Data (K-quants)	quantize_q2_K quantize_q3_K quantize_q4_K quantize_q5_K quantize_q6_K
Quantize Data (Q4_0)	quantize_q4_0 quantize_q4_1 quantize_q5_0 quantize_q5_1 quantize_q8_0
Quantize Row Reference (IQ)	quantize_row_iq2_s_ref quantize_row_iq3_s_ref quantize_row_iq3_xxs_ref quantize_row_iq4_nl_ref quantize_row_iq4_xs_ref
Quantize Row Reference (MXFP4)	quantize_row_mxfp4_ref
Quantize Row Reference (K-quants)	quantize_row_q2_K_ref quantize_row_q3_K_ref quantize_row_q4_K_ref quantize_row_q5_K_ref quantize_row_q6_K_ref quantize_row_q8_K_ref
Quantize Row Reference (Basic)	quantize_row_q4_0_ref quantize_row_q4_1_ref quantize_row_q5_0_ref quantize_row_q5_1_ref quantize_row_q8_0_ref quantize_row_q8_1_ref
Quantize Row Reference (Ternary)	quantize_row_tq1_0_ref quantize_row_tq2_0_ref
Quantize Data (Ternary)	quantize_tq1_0 quantize_tq2_0
RoPE Mode Constants	GGML_ROPE_TYPE_MROPE GGML_ROPE_TYPE_NEOX GGML_ROPE_TYPE_NORM GGML_ROPE_TYPE_VISION rope_types
Run a GGML GPU operation on a Seurat object	RunGGML RunGGML.default RunGGML.Seurat RunGGML.SingleCellExperiment
Summary method for ggml_sequential_model	summary.ggml_sequential_model
Tidy a fitted ggml parsnip model into a per-layer table	tidy.ggmlr_parsnip_model
Run code with gradient tape enabled	with_grad_tape

Usage by other packages (reverse dependencies)