| Element-wise addition with broadcasting | ag_add |
| Create a Batch Normalisation layer | ag_batch_norm |
| Element-wise clamp | ag_clamp |
| Categorical Cross-Entropy loss | ag_cross_entropy_loss |
| Create a mini-batch data loader | ag_dataloader |
| Return the current default compute device | ag_default_device |
| Return the current default dtype for GPU operations | ag_default_dtype |
| Set the default compute device for ag_* operations | ag_device |
| Create a Dropout layer | ag_dropout |
| Set the default floating-point precision for ag_* GPU operations | ag_dtype |
| Create an Embedding layer | ag_embedding |
| Switch a layer or sequential model to eval mode | ag_eval |
| Element-wise exponential | ag_exp |
| Numerical gradient check (like torch.autograd.gradcheck) | ag_gradcheck |
| Create a dense layer with learnable parameters | ag_linear |
| Load an autograd module from a saved state | ag_load_model |
| Element-wise natural logarithm | ag_log |
| Matrix multiplication | ag_matmul |
| Mean of elements (or along a dim) | ag_mean |
| Mean Squared Error loss | ag_mse_loss |
| Element-wise multiplication | ag_mul |
| Create a Multi-Head Attention layer | ag_multihead_attention |
| Create a parameter tensor (gradient tracked) | ag_param |
| Element-wise power | ag_pow |
| ReLU activation | ag_relu |
| Reshape tensor | ag_reshape |
| Save an autograd module's state to disk | ag_save_model |
| Scale tensor by a scalar constant | ag_scale |
| Create a sequential container of layers | ag_sequential |
| Sigmoid activation | ag_sigmoid |
| Softmax activation (column-wise) | ag_softmax |
| Fused softmax + cross-entropy loss (numerically stable) | ag_softmax_cross_entropy_loss |
| Element-wise subtraction | ag_sub |
| Sum all elements (or along a dim): out = sum(x) | ag_sum |
| Tanh activation | ag_tanh |
| Create a dynamic tensor (no gradient tracking) | ag_tensor |
| Move a tensor to the specified device | ag_to_device |
| Switch a layer or sequential model to training mode | ag_train |
| Transpose a tensor | ag_transpose |
| Augment new data with predictions from a fitted ggml parsnip model | augment.ggmlr_parsnip_model |
| Run backward pass from a scalar loss tensor | backward |
| Clip gradients by global L2 norm | clip_grad_norm |
| Compile a Model | compile.ggml_functional_model compile.ggml_sequential_model |
| Dequantize Row (IQ) | dequantize_row_iq1_m dequantize_row_iq1_s dequantize_row_iq2_s dequantize_row_iq2_xs dequantize_row_iq2_xxs dequantize_row_iq3_s dequantize_row_iq3_xxs dequantize_row_iq4_nl dequantize_row_iq4_xs |
| Dequantize Row (MXFP4) | dequantize_row_mxfp4 |
| Dequantize NVFP4 Data | dequantize_row_nvfp4 |
| Dequantize Q1_0 Data | dequantize_row_q1_0 |
| Dequantize Row (K-quants) | dequantize_row_q2_K dequantize_row_q3_K dequantize_row_q4_K dequantize_row_q5_K dequantize_row_q6_K dequantize_row_q8_K |
| Dequantize Row (Q4_0) | dequantize_row_q4_0 dequantize_row_q4_1 dequantize_row_q5_0 dequantize_row_q5_1 dequantize_row_q8_0 |
| Dequantize Row (Ternary) | dequantize_row_tq1_0 dequantize_row_tq2_0 |
| Data-parallel training across multiple GPUs | dp_train |
| Evaluate a Model | evaluate.ggml_functional_model evaluate.ggml_sequential_model |
| Train a Model | fit.ggml_functional_model fit.ggml_sequential_model |
| Check if R Abort Handler is Enabled | ggml_abort_is_r_enabled |
| Absolute Value (Graph) | ggml_abs |
| Absolute Value In-place (Graph) | ggml_abs_inplace |
| Add tensors | ggml_add |
| Element-wise Addition In-place (Graph) | ggml_add_inplace |
| Add Relative Position Bias (Graph) | ggml_add_rel_pos |
| Add Scalar to Tensor (Graph) | ggml_add1 |
| Apply a Layer Object to a Tensor Node | ggml_apply |
| Arange (Graph) | ggml_arange |
| Check if Two Tensors Have the Same Layout | ggml_are_same_layout |
| Compare Tensor Shapes | ggml_are_same_shape |
| Compare Tensor Strides | ggml_are_same_stride |
| Argmax (Graph) | ggml_argmax |
| Argsort - Get Sorting Indices (Graph) | ggml_argsort |
| Allocate Context Tensors to Backend | ggml_backend_alloc_ctx_tensors |
| Clear buffer memory | ggml_backend_buffer_clear |
| Free Backend Buffer | ggml_backend_buffer_free |
| Get Backend Buffer Size | ggml_backend_buffer_get_size |
| Get buffer usage | ggml_backend_buffer_get_usage |
| Check if buffer is host memory | ggml_backend_buffer_is_host |
| Check if buffer is a multi-buffer | ggml_backend_buffer_is_multi_buffer |
| Get Backend Buffer Name | ggml_backend_buffer_name |
| Reset buffer | ggml_backend_buffer_reset |
| Set buffer usage hint | ggml_backend_buffer_set_usage |
| Buffer usage: Any | ggml_backend_buffer_usage_any |
| Buffer usage: Compute | ggml_backend_buffer_usage_compute |
| Buffer usage: Weights | ggml_backend_buffer_usage_weights |
| Initialize CPU Backend | ggml_backend_cpu_init |
| Set CPU Backend Threads | ggml_backend_cpu_set_n_threads |
| Get device by name | ggml_backend_dev_by_name |
| Get device by type | ggml_backend_dev_by_type |
| Get number of available devices | ggml_backend_dev_count |
| Get device description | ggml_backend_dev_description |
| Get device by index | ggml_backend_dev_get |
| Get device properties | ggml_backend_dev_get_props |
| Initialize backend from device | ggml_backend_dev_init |
| Get device memory | ggml_backend_dev_memory |
| Get device name | ggml_backend_dev_name |
| Check if device should offload operation | ggml_backend_dev_offload_op |
| Check if device supports buffer type | ggml_backend_dev_supports_buft |
| Check if device supports operation | ggml_backend_dev_supports_op |
| Get device type | ggml_backend_dev_type |
| Register a device | ggml_backend_device_register |
| Device type: Accelerator | ggml_backend_device_type_accel |
| Device type: CPU | ggml_backend_device_type_cpu |
| Device type: GPU | ggml_backend_device_type_gpu |
| Device type: Integrated GPU | ggml_backend_device_type_igpu |
| Free event | ggml_backend_event_free |
| Create new event | ggml_backend_event_new |
| Record event | ggml_backend_event_record |
| Synchronize event | ggml_backend_event_synchronize |
| Wait for event | ggml_backend_event_wait |
| Free Backend | ggml_backend_free |
| Get device from backend | ggml_backend_get_device |
| Compute Graph with Backend | ggml_backend_graph_compute |
| Compute graph asynchronously | ggml_backend_graph_compute_async |
| Execute graph plan | ggml_backend_graph_plan_compute |
| Create graph execution plan | ggml_backend_graph_plan_create |
| Free graph execution plan | ggml_backend_graph_plan_free |
| Initialize best available backend | ggml_backend_init_best |
| Initialize backend by name | ggml_backend_init_by_name |
| Initialize backend by type | ggml_backend_init_by_type |
| Load backend from dynamic library | ggml_backend_load |
| Load all available backends | ggml_backend_load_all |
| Create a Meta Backend Device | ggml_backend_meta_device |
| Allocate multi-buffer | ggml_backend_multi_buffer_alloc_buffer |
| Set usage for all buffers in a multi-buffer | ggml_backend_multi_buffer_set_usage |
| Get Backend Name | ggml_backend_name |
| Get backend registry by name | ggml_backend_reg_by_name |
| Get number of registered backends | ggml_backend_reg_count |
| Get number of devices in registry | ggml_backend_reg_dev_count |
| Get device from registry | ggml_backend_reg_dev_get |
| Get backend registry by index | ggml_backend_reg_get |
| Get registry name | ggml_backend_reg_name |
| Register a backend | ggml_backend_register |
| Allocate graph on scheduler | ggml_backend_sched_alloc_graph |
| Free backend scheduler | ggml_backend_sched_free |
| Get backend from scheduler | ggml_backend_sched_get_backend |
| Get number of backends in scheduler | ggml_backend_sched_get_n_backends |
| Get number of tensor copies | ggml_backend_sched_get_n_copies |
| Get number of graph splits | ggml_backend_sched_get_n_splits |
| Get tensor backend assignment | ggml_backend_sched_get_tensor_backend |
| Compute graph using scheduler | ggml_backend_sched_graph_compute |
| Compute graph asynchronously | ggml_backend_sched_graph_compute_async |
| Create a new backend scheduler | ggml_backend_sched_new |
| Reserve memory for scheduler | ggml_backend_sched_reserve |
| Reset scheduler | ggml_backend_sched_reset |
| Set tensor backend assignment | ggml_backend_sched_set_tensor_backend |
| Synchronize scheduler | ggml_backend_sched_synchronize |
| Synchronize backend | ggml_backend_synchronize |
| Copy tensor asynchronously between backends | ggml_backend_tensor_copy_async |
| Backend Tensor Get and Sync | ggml_backend_tensor_get_and_sync |
| Get tensor data asynchronously | ggml_backend_tensor_get_async |
| Get Tensor Data via Backend | ggml_backend_tensor_get_data |
| Get First Float from Backend Tensor | ggml_backend_tensor_get_f32_first |
| Set tensor data asynchronously | ggml_backend_tensor_set_async |
| Set Tensor Data via Backend | ggml_backend_tensor_set_data |
| Unload backend | ggml_backend_unload |
| Create a Batch Normalization Layer Object | ggml_batch_norm |
| Get Block Size | ggml_blck_size |
| Build forward expand | ggml_build_forward_expand |
| Early stopping callback | ggml_callback_early_stopping |
| Check If Tensor Can Be Repeated | ggml_can_repeat |
| Ceiling (Graph) | ggml_ceil |
| Ceiling In-place (Graph) | ggml_ceil_inplace |
| Clamp (Graph) | ggml_clamp |
| Compile a Sequential Model | ggml_compile ggml_compile.ggml_functional_model ggml_compile.ggml_sequential_model |
| Concatenate Tensors (Graph) | ggml_concat |
| Make Contiguous (Graph) | ggml_cont |
| 1D Convolution (Graph) | ggml_conv_1d |
| Depthwise 1D Convolution (Graph) | ggml_conv_1d_dw |
| 2D Convolution (Graph) | ggml_conv_2d |
| Direct 2D Convolution (Graph) | ggml_conv_2d_direct |
| Depthwise 2D Convolution (Graph) | ggml_conv_2d_dw |
| Depthwise 2D Convolution, direct (Graph) | ggml_conv_2d_dw_direct |
| Transposed 1D Convolution (Graph) | ggml_conv_transpose_1d |
| Transposed 2D Convolution, zero padding (Graph) | ggml_conv_transpose_2d_p0 |
| Cosine (Graph) | ggml_cos |
| Count Equal Elements (Graph) | ggml_count_equal |
| Element-wise Addition (CPU Direct) | ggml_cpu_add |
| Get All CPU Features | ggml_cpu_features |
| Get RISC-V Vector Length | ggml_cpu_get_rvv_vlen |
| Get SVE Vector Length (ARM) | ggml_cpu_get_sve_cnt |
| CPU Feature Detection - AMX INT8 | ggml_cpu_has_amx_int8 |
| CPU Feature Detection - ARM FMA | ggml_cpu_has_arm_fma |
| CPU Feature Detection - AVX | ggml_cpu_has_avx |
| CPU Feature Detection - AVX-VNNI | ggml_cpu_has_avx_vnni |
| CPU Feature Detection - AVX2 | ggml_cpu_has_avx2 |
| CPU Feature Detection - AVX-512 | ggml_cpu_has_avx512 |
| CPU Feature Detection - AVX-512 BF16 | ggml_cpu_has_avx512_bf16 |
| CPU Feature Detection - AVX-512 VBMI | ggml_cpu_has_avx512_vbmi |
| CPU Feature Detection - AVX-512 VNNI | ggml_cpu_has_avx512_vnni |
| CPU Feature Detection - BMI2 | ggml_cpu_has_bmi2 |
| CPU Feature Detection - Dot Product (ARM) | ggml_cpu_has_dotprod |
| CPU Feature Detection - F16C | ggml_cpu_has_f16c |
| CPU Feature Detection - FMA | ggml_cpu_has_fma |
| CPU Feature Detection - FP16 Vector Arithmetic (ARM) | ggml_cpu_has_fp16_va |
| CPU Feature Detection - Llamafile | ggml_cpu_has_llamafile |
| CPU Feature Detection - INT8 Matrix Multiply (ARM) | ggml_cpu_has_matmul_int8 |
| CPU Feature Detection - NEON (ARM) | ggml_cpu_has_neon |
| CPU Feature Detection - RISC-V Vector | ggml_cpu_has_riscv_v |
| CPU Feature Detection - SME (ARM) | ggml_cpu_has_sme |
| CPU Feature Detection - SSE3 | ggml_cpu_has_sse3 |
| CPU Feature Detection - SSSE3 | ggml_cpu_has_ssse3 |
| CPU Feature Detection - SVE (ARM) | ggml_cpu_has_sve |
| CPU Feature Detection - VSX (PowerPC) | ggml_cpu_has_vsx |
| CPU Feature Detection - VXE (IBM z/Architecture) | ggml_cpu_has_vxe |
| CPU Feature Detection - WebAssembly SIMD | ggml_cpu_has_wasm_simd |
| Element-wise Multiplication (CPU Direct) | ggml_cpu_mul |
| Copy Tensor with Type Conversion (Graph) | ggml_cpy |
| Get CPU Cycles | ggml_cycles |
| Get CPU Cycles per Millisecond | ggml_cycles_per_ms |
| Default MLP builder for classification and regression | ggml_default_mlp |
| Create a Dense Layer Object | ggml_dense |
| Diagonal Matrix (Graph) | ggml_diag |
| Diagonal Mask with -Inf (Graph) | ggml_diag_mask_inf |
| Diagonal Mask with -Inf In-place (Graph) | ggml_diag_mask_inf_inplace |
| Diagonal Mask with Zero (Graph) | ggml_diag_mask_zero |
| Element-wise Division (Graph) | ggml_div |
| Element-wise Division In-place (Graph) | ggml_div_inplace |
| Duplicate Tensor (Graph) | ggml_dup |
| Duplicate Tensor In-place (Graph) | ggml_dup_inplace |
| Duplicate Tensor | ggml_dup_tensor |
| Get Element Size | ggml_element_size |
| ELU Activation (Graph) | ggml_elu |
| ELU Activation In-place (Graph) | ggml_elu_inplace |
| Create an Embedding Layer Object | ggml_embedding |
| Estimate Required Memory | ggml_estimate_memory |
| Evaluate a Trained Model | ggml_evaluate ggml_evaluate.ggml_functional_model ggml_evaluate.ggml_sequential_model |
| Exponential (Graph) | ggml_exp |
| Exponential In-place (Graph) | ggml_exp_inplace |
| Extract a feature-by-cell matrix from a single-cell container | ggml_extract ggml_extract.dgCMatrix ggml_extract.matrix ggml_extract.Seurat ggml_extract.SingleCellExperiment |
| Fit model with R-side epoch loop and callbacks | ggml_fit_opt |
| Train a Model (dispatcher) | ggml_fit ggml_fit.default ggml_fit.ggml_functional_model ggml_fit.ggml_sequential_model |
| Flash Attention Backward (Graph) | ggml_flash_attn_back |
| Flash Attention (Graph) | ggml_flash_attn_ext |
| Floor (Graph) | ggml_floor |
| Floor In-place (Graph) | ggml_floor_inplace |
| Free GGML context | ggml_free |
| Freeze Layer Weights | ggml_freeze_weights |
| Convert ftype to ggml_type | ggml_ftype_to_ggml_type |
| Allocate Memory for Graph | ggml_gallocr_alloc_graph |
| Free Graph Allocator | ggml_gallocr_free |
| Get Graph Allocator Buffer Size | ggml_gallocr_get_buffer_size |
| Create Graph Allocator | ggml_gallocr_new |
| Reserve Memory for Graph | ggml_gallocr_reserve |
| GeGLU (GELU Gated Linear Unit) (Graph) | ggml_geglu |
| GeGLU Quick (Fast GeGLU) (Graph) | ggml_geglu_quick |
| GeGLU Split (Graph) | ggml_geglu_split |
| GELU Activation (Graph) | ggml_gelu |
| Exact GELU Activation (Graph) | ggml_gelu_erf |
| GELU Activation In-place (Graph) | ggml_gelu_inplace |
| GELU Quick Activation (Graph) | ggml_gelu_quick |
| Get F32 data | ggml_get_f32 |
| Get Single Float Value by N-D Index | ggml_get_f32_nd |
| Get First Tensor from Context | ggml_get_first_tensor |
| Get I32 Data | ggml_get_i32 |
| Get Single Int32 Value by N-D Index | ggml_get_i32_nd |
| Get a Layer from a Sequential Model | ggml_get_layer |
| Get Maximum Tensor Size | ggml_get_max_tensor_size |
| Get Context Memory Size | ggml_get_mem_size |
| Get Number of Threads | ggml_get_n_threads |
| Get Tensor Name | ggml_get_name |
| Get Next Tensor from Context | ggml_get_next_tensor |
| Get No Allocation Mode | ggml_get_no_alloc |
| Get Tensor Operation Parameters | ggml_get_op_params |
| Get Float Op Parameter | ggml_get_op_params_f32 |
| Get Integer Op Parameter | ggml_get_op_params_i32 |
| Get Relative Position (Graph) | ggml_get_rel_pos |
| Get Rows by Indices (Graph) | ggml_get_rows |
| Get Rows Backward (Graph) | ggml_get_rows_back |
| Get Unary Operation from Tensor | ggml_get_unary_op |
| Generic GLU (Gated Linear Unit) (Graph) | ggml_glu |
| GLU Operation Types | GGML_GLU_OP_GEGLU GGML_GLU_OP_GEGLU_ERF GGML_GLU_OP_GEGLU_QUICK GGML_GLU_OP_REGLU GGML_GLU_OP_SWIGLU GGML_GLU_OP_SWIGLU_OAI |
| Generic GLU Split (Graph) | ggml_glu_split |
| Compute graph | ggml_graph_compute |
| Compute Graph with Context (Alternative Method) | ggml_graph_compute_with_ctx |
| Export Graph to DOT Format | ggml_graph_dump_dot |
| Get Tensor from Graph by Name | ggml_graph_get_tensor |
| Get Number of Nodes in Graph | ggml_graph_n_nodes |
| Get Graph Node | ggml_graph_node |
| Get Graph Overhead | ggml_graph_overhead |
| Print Graph Information | ggml_graph_print |
| Reset Graph (for backpropagation) | ggml_graph_reset |
| Create a View of a Subgraph | ggml_graph_view |
| Group Normalization (Graph) | ggml_group_norm |
| Group Normalization In-place (Graph) | ggml_group_norm_inplace |
| Create a GRU Layer Object | ggml_gru |
| Hard Sigmoid Activation (Graph) | ggml_hardsigmoid |
| Hard Swish Activation (Graph) | ggml_hardswish |
| Image to Column (Graph) | ggml_im2col |
| Initialize GGML context | ggml_init |
| Create Context with Auto-sizing | ggml_init_auto |
| Inject a single-cell result back into its container | ggml_inject ggml_inject.Seurat ggml_inject.SingleCellExperiment |
| Declare a Functional API Input Tensor | ggml_input |
| Check if GGML is available | ggml_is_available |
| Check if Tensor is Contiguous | ggml_is_contiguous |
| Check Tensor Contiguity (Dimension 0) | ggml_is_contiguous_0 |
| Check Tensor Contiguity (Dimensions >= 1) | ggml_is_contiguous_1 |
| Check Tensor Contiguity (Dimensions >= 2) | ggml_is_contiguous_2 |
| Check Channel-wise Contiguity | ggml_is_contiguous_channels |
| Check Row-wise Contiguity | ggml_is_contiguous_rows |
| Check If Tensor is Contiguously Allocated | ggml_is_contiguously_allocated |
| Check if Tensor is Permuted | ggml_is_permuted |
| Check If Type is Quantized | ggml_is_quantized |
| Check if Tensor is Transposed | ggml_is_transposed |
| L2 Normalization (Graph) | ggml_l2_norm |
| L2 Normalization In-place (Graph) | ggml_l2_norm_inplace |
| Element-wise Addition of Two Tensor Nodes | ggml_layer_add |
| Add Batch Normalization Layer | ggml_layer_batch_norm |
| Concatenate Tensor Nodes Along an Axis | ggml_layer_concatenate |
| Create a Conv1D Layer Object | ggml_layer_conv_1d |
| Create a Conv2D Layer Object | ggml_layer_conv_2d |
| Add Dense (Fully Connected) Layer | ggml_layer_dense |
| Add Dropout Layer | ggml_layer_dropout |
| Add Embedding Layer | ggml_layer_embedding |
| Add Flatten Layer | ggml_layer_flatten |
| Global Average Pooling for 2D Feature Maps | ggml_layer_global_average_pooling_2d |
| Global Max Pooling for 2D Feature Maps | ggml_layer_global_max_pooling_2d |
| Add a GRU Layer | ggml_layer_gru |
| Add an LSTM Layer | ggml_layer_lstm |
| Add 2D Max Pooling Layer | ggml_layer_max_pooling_2d |
| Leaky ReLU Activation (Graph) | ggml_leaky_relu |
| Load a Full Model (Architecture + Weights) | ggml_load_model |
| Load Model Weights from File | ggml_load_weights |
| Natural Logarithm (Graph) | ggml_log |
| Natural Logarithm In-place (Graph) | ggml_log_inplace |
| Check if R Logging is Enabled | ggml_log_is_r_enabled |
| Restore Default GGML Logging | ggml_log_set_default |
| Enable R-compatible GGML Logging | ggml_log_set_r |
| Create an LSTM Layer Object | ggml_lstm |
| Marshal a ggmlR model to an in-memory container | ggml_marshal_model |
| Mean (Graph) | ggml_mean |
| Create a Functional Model | ggml_model |
| Backend a fitted ggml model actually ran on | ggml_model_backend |
| Create a Sequential Neural Network Model | ggml_model_sequential |
| Multiply tensors | ggml_mul |
| Element-wise Multiplication In-place (Graph) | ggml_mul_inplace |
| Matrix Multiplication (Graph) | ggml_mul_mat |
| Matrix Multiplication with Expert Selection (Graph) | ggml_mul_mat_id |
| Get Number of Dimensions | ggml_n_dims |
| Get number of bytes | ggml_nbytes |
| Negation (Graph) | ggml_neg |
| Negation In-place (Graph) | ggml_neg_inplace |
| Get number of elements | ggml_nelements |
| Create Scalar F32 Tensor | ggml_new_f32 |
| Create Scalar I32 Tensor | ggml_new_i32 |
| Create Tensor with Arbitrary Dimensions | ggml_new_tensor |
| Create 1D tensor | ggml_new_tensor_1d |
| Create 2D tensor | ggml_new_tensor_2d |
| Create 3D Tensor | ggml_new_tensor_3d |
| Create 4D Tensor | ggml_new_tensor_4d |
| Layer Normalization (Graph) | ggml_norm |
| Layer Normalization In-place (Graph) | ggml_norm_inplace |
| Get Number of Rows | ggml_nrows |
| Check if Operation Can Be Done In-place | ggml_op_can_inplace |
| Get Operation Description from Tensor | ggml_op_desc |
| Get Operation Name | ggml_op_name |
| Get Operation Symbol | ggml_op_symbol |
| Supported single-cell operations | ggml_ops_registry |
| Allocate graph for evaluation | ggml_opt_alloc |
| Get optimizer type from context | ggml_opt_context_optimizer_type |
| Get data tensor from dataset | ggml_opt_dataset_data |
| Free optimization dataset | ggml_opt_dataset_free |
| Get batch from dataset | ggml_opt_dataset_get_batch |
| Create a new optimization dataset | ggml_opt_dataset_init |
| Get labels tensor from dataset | ggml_opt_dataset_labels |
| Get number of datapoints in dataset | ggml_opt_dataset_ndata |
| Shuffle dataset | ggml_opt_dataset_shuffle |
| Get dataset per-datapoint weights tensor | ggml_opt_dataset_weights |
| Get default optimizer parameters | ggml_opt_default_params |
| Run one training epoch | ggml_opt_epoch |
| Evaluate model | ggml_opt_eval |
| Fit model to dataset | ggml_opt_fit |
| Free optimizer context | ggml_opt_free |
| Get current learning rate from optimizer context | ggml_opt_get_lr |
| Get gradient accumulator for a tensor | ggml_opt_grad_acc |
| Initialize optimizer context | ggml_opt_init |
| Initialize optimizer context for R-side epoch loop | ggml_opt_init_for_fit |
| Get inputs tensor from optimizer context | ggml_opt_inputs |
| Get labels tensor from optimizer context | ggml_opt_labels |
| Get loss tensor from optimizer context | ggml_opt_loss |
| Loss type: Cross Entropy | ggml_opt_loss_type_cross_entropy |
| Loss type: Mean | ggml_opt_loss_type_mean |
| Loss type: Mean Squared Error | ggml_opt_loss_type_mse |
| Loss type: Sum | ggml_opt_loss_type_sum |
| Loss type: Weighted Mean Squared Error | ggml_opt_loss_type_weighted_mse |
| Get number of correct predictions tensor | ggml_opt_ncorrect |
| Get optimizer name | ggml_opt_optimizer_name |
| Optimizer type: AdamW | ggml_opt_optimizer_type_adamw |
| Optimizer type: SGD | ggml_opt_optimizer_type_sgd |
| Get outputs tensor from optimizer context | ggml_opt_outputs |
| Get predictions tensor from optimizer context | ggml_opt_pred |
| Prepare allocation for non-static graphs | ggml_opt_prepare_alloc |
| Reset optimizer context | ggml_opt_reset |
| Get accuracy from result | ggml_opt_result_accuracy |
| Free optimization result | ggml_opt_result_free |
| Initialize optimization result | ggml_opt_result_init |
| Get loss from result | ggml_opt_result_loss |
| Get number of datapoints from result | ggml_opt_result_ndata |
| Get predictions from result | ggml_opt_result_pred |
| Reset optimization result | ggml_opt_result_reset |
| Set learning rate in optimizer context | ggml_opt_set_lr |
| Check if using static graphs | ggml_opt_static_graphs |
| Outer Product (Graph) | ggml_out_prod |
| Pad Tensor with Zeros (Graph) | ggml_pad |
| Reflective 1D Padding (Graph) | ggml_pad_reflect_1d |
| Permute Tensor Dimensions (Graph) | ggml_permute |
| 1D Pooling (Graph) | GGML_OP_POOL_AVG GGML_OP_POOL_MAX ggml_pool_1d |
| 2D Pooling (Graph) | ggml_pool_2d |
| Remove the Last Layer from a Sequential Model | ggml_pop_layer |
| Predict Classes from a Trained Model | ggml_predict_classes |
| Get Predictions from a Trained Model | ggml_predict ggml_predict.ggml_functional_model ggml_predict.ggml_sequential_model |
| Print Context Memory Status | ggml_print_mem_status |
| Print Objects in Context | ggml_print_objects |
| Get Quantization Block Info | ggml_quant_block_info |
| Quantize Data Chunk | ggml_quantize_chunk |
| Free Quantization Resources | ggml_quantize_free |
| Initialize Quantization Tables | ggml_quantize_init |
| Check if Quantization Requires Importance Matrix | ggml_quantize_requires_imatrix |
| ReGLU (ReLU Gated Linear Unit) (Graph) | ggml_reglu |
| ReGLU Split (Graph) | ggml_reglu_split |
| ReLU Activation (Graph) | ggml_relu |
| ReLU Activation In-place (Graph) | ggml_relu_inplace |
| Repeat (Graph) | ggml_repeat |
| Repeat Backward (Graph) | ggml_repeat_back |
| Reset GGML Context | ggml_reset |
| Reshape to 1D (Graph) | ggml_reshape_1d |
| Reshape to 2D (Graph) | ggml_reshape_2d |
| Reshape to 3D (Graph) | ggml_reshape_3d |
| Reshape to 4D (Graph) | ggml_reshape_4d |
| Construct a single-cell result | ggml_result |
| RMS Normalization (Graph) | ggml_rms_norm |
| RMS Norm Backward (Graph) | ggml_rms_norm_back |
| RMS Normalization In-place (Graph) | ggml_rms_norm_inplace |
| Roll (Graph) | ggml_roll |
| Rotary Position Embedding (Graph) | ggml_rope |
| Extended RoPE with Frequency Scaling (Graph) | ggml_rope_ext |
| RoPE Extended Backward (Graph) | ggml_rope_ext_back |
| Extended RoPE Inplace (Graph) | ggml_rope_ext_inplace |
| Rotary Position Embedding In-place (Graph) | ggml_rope_inplace |
| Multi-RoPE for Vision Models (Graph) | ggml_rope_multi |
| Multi-RoPE Inplace (Graph) | ggml_rope_multi_inplace |
| Round (Graph) | ggml_round |
| Round In-place (Graph) | ggml_round_inplace |
| Run a single-cell task on the GGML backend | ggml_run |
| Save a Full Model (Architecture + Weights) | ggml_save_model |
| Save Model Weights to File | ggml_save_weights |
| Scale (Graph) | ggml_scale |
| Scale Tensor In-place (Graph) | ggml_scale_inplace |
| Cosine annealing LR scheduler | ggml_schedule_cosine_decay |
| Reduce on plateau LR scheduler | ggml_schedule_reduce_on_plateau |
| Step decay LR scheduler | ggml_schedule_step_decay |
| Set Tensor Region (Graph) | ggml_set |
| Set 1D Tensor Region (Graph) | ggml_set_1d |
| Set 2D Tensor Region (Graph) | ggml_set_2d |
| Restore Default Abort Behavior | ggml_set_abort_callback_default |
| Enable R-compatible Abort Handling | ggml_set_abort_callback_r |
| Set F32 data | ggml_set_f32 |
| Set Single Float Value by N-D Index | ggml_set_f32_nd |
| Set I32 Data | ggml_set_i32 |
| Set Single Int32 Value by N-D Index | ggml_set_i32_nd |
| Mark Tensor as Input | ggml_set_input |
| Set Number of Threads | ggml_set_n_threads |
| Set Tensor Name | ggml_set_name |
| Set No Allocation Mode | ggml_set_no_alloc |
| Set OpenMP Thread Count | ggml_set_omp_threads |
| Set Tensor Operation Parameters | ggml_set_op_params |
| Set Float Op Parameter | ggml_set_op_params_f32 |
| Set Integer Op Parameter | ggml_set_op_params_i32 |
| Mark Tensor as Output | ggml_set_output |
| Set Tensor as Trainable Parameter | ggml_set_param |
| Set the random seed for reproducible ggmlR runs | ggml_set_seed |
| Set Tensor to Zero | ggml_set_zero |
| Sign Function (Graph) | ggml_sgn |
| Sigmoid Activation (Graph) | ggml_sigmoid |
| Sigmoid Activation In-place (Graph) | ggml_sigmoid_inplace |
| SiLU Activation (Graph) | ggml_silu |
| SiLU Backward (Graph) | ggml_silu_back |
| SiLU Activation In-place (Graph) | ggml_silu_inplace |
| Sine (Graph) | ggml_sin |
| Softmax (Graph) | ggml_soft_max |
| Extended Softmax with Masking and Scaling (Graph) | ggml_soft_max_ext |
| Softmax Backward Extended (Graph) | ggml_soft_max_ext_back |
| Extended Softmax Backward Inplace (Graph) | ggml_soft_max_ext_back_inplace |
| Extended Softmax Inplace (Graph) | ggml_soft_max_ext_inplace |
| Softmax In-place (Graph) | ggml_soft_max_inplace |
| Softplus Activation (Graph) | ggml_softplus |
| Softplus Activation In-place (Graph) | ggml_softplus_inplace |
| Sort Order Constants | GGML_SORT_ORDER_ASC GGML_SORT_ORDER_DESC |
| Square (Graph) | ggml_sqr |
| Square In-place (Graph) | ggml_sqr_inplace |
| Square Root (Graph) | ggml_sqrt |
| Square Root In-place (Graph) | ggml_sqrt_inplace |
| Step Function (Graph) | ggml_step |
| Element-wise Subtraction (Graph) | ggml_sub |
| Element-wise Subtraction In-place (Graph) | ggml_sub_inplace |
| Sum (Graph) | ggml_sum |
| Sum Rows (Graph) | ggml_sum_rows |
| SwiGLU (Swish/SiLU Gated Linear Unit) (Graph) | ggml_swiglu |
| SwiGLU Split (Graph) | ggml_swiglu_split |
| Tanh Activation (Graph) | ggml_tanh |
| Tanh Activation In-place (Graph) | ggml_tanh_inplace |
| Construct a single-cell compute task | ggml_task |
| Copy Tensor Data | ggml_tensor_copy |
| Get Tensor Strides (nb) | ggml_tensor_nb |
| Count Tensors in Context | ggml_tensor_num |
| Get Tensor Overhead | ggml_tensor_overhead |
| Fill Tensor with Scalar | ggml_tensor_set_f32_scalar |
| Get Tensor Shape | ggml_tensor_shape |
| Get Tensor Type | ggml_tensor_type |
| Test GGML | ggml_test |
| Initialize GGML Timer | ggml_time_init |
| Get Time in Milliseconds | ggml_time_ms |
| Get Time in Microseconds | ggml_time_us |
| Timestep Embedding (Graph Operation) | ggml_timestep_embedding |
| Top-K Indices (Graph) | ggml_top_k |
| Training history of a fitted ggml model | ggml_training_history |
| Transpose (Graph) | ggml_transpose |
| GGML Data Types | GGML_TYPE_BF16 GGML_TYPE_F16 GGML_TYPE_F32 GGML_TYPE_I32 GGML_TYPE_Q2_K GGML_TYPE_Q3_K GGML_TYPE_Q4_0 GGML_TYPE_Q4_1 GGML_TYPE_Q4_K GGML_TYPE_Q5_K GGML_TYPE_Q6_K GGML_TYPE_Q8_0 |
| Get Type Name | ggml_type_name |
| Get Type Size in Bytes | ggml_type_size |
| Get Type Size as Float | ggml_type_sizef |
| Get Unary Operation Name | ggml_unary_op_name |
| Unfreeze Layer Weights | ggml_unfreeze_weights |
| Unmarshal a ggmlR model from an in-memory container | ggml_unmarshal_model |
| Upscale Tensor (Graph) | GGML_SCALE_MODE_BICUBIC GGML_SCALE_MODE_BILINEAR GGML_SCALE_MODE_NEAREST ggml_upscale |
| Get Used Memory | ggml_used_mem |
| Get GGML version | ggml_version |
| 1D View with Byte Offset (Graph) | ggml_view_1d |
| 2D View with Byte Offset (Graph) | ggml_view_2d |
| 3D View with Byte Offset (Graph) | ggml_view_3d |
| 4D View with Byte Offset (Graph) | ggml_view_4d |
| View Tensor | ggml_view_tensor |
| Check if Vulkan support is available | ggml_vulkan_available |
| Get Vulkan backend name | ggml_vulkan_backend_name |
| Get Vulkan device capabilities | ggml_vulkan_device_caps |
| Get number of Vulkan devices | ggml_vulkan_device_count |
| Get Vulkan device description | ggml_vulkan_device_description |
| Get Vulkan device memory | ggml_vulkan_device_memory |
| Free Vulkan backend | ggml_vulkan_free |
| Initialize Vulkan backend | ggml_vulkan_init |
| Check if backend is Vulkan | ggml_vulkan_is_backend |
| List all Vulkan devices | ggml_vulkan_list_devices |
| Print Vulkan status | ggml_vulkan_status |
| Window Partition (Graph) | ggml_win_part |
| Window Un-partition (Graph) | ggml_win_unpart |
| Execute with Temporary Context | ggml_with_temp_ctx |
| Free GGUF Resources | gguf_free |
| Load a GGUF File | gguf_load |
| Get GGUF Metadata | gguf_metadata |
| Extract Tensor Data | gguf_tensor_data |
| Get Tensor Info | gguf_tensor_info |
| List Tensor Names in a GGUF File | gguf_tensor_names |
| One-row summary of a fitted ggml parsnip model | glance.ggmlr_parsnip_model |
| Free IQ2 Quantization Tables | iq2xs_free_impl |
| Initialize IQ2 Quantization Tables | iq2xs_init_impl |
| Free IQ3 Quantization Tables | iq3xs_free_impl |
| Initialize IQ3 Quantization Tables | iq3xs_init_impl |
| Cosine-annealing learning rate scheduler | lr_scheduler_cosine |
| Step-decay learning rate scheduler | lr_scheduler_step |
| Topologically sort nodes reachable from output nodes | nn_topo_sort |
| ONNX model device/scheduler diagnostics | onnx_device_info |
| List ONNX model inputs | onnx_inputs |
| Load an ONNX model | onnx_load |
| Run ONNX model inference | onnx_run |
| ONNX model summary | onnx_summary |
| Create an Adam optimizer | optimizer_adam |
| Create an SGD optimizer | optimizer_sgd |
| Plot training history | plot.ggml_history |
| Predict with a Trained Model | predict.ggml_functional_model predict.ggml_sequential_model |
| Print method for ag_tensor | print.ag_tensor |
| Print method for ggml_functional_model | print.ggml_functional_model |
| Print method for ggml_history | print.ggml_history |
| Print method for ggml_sequential_model | print.ggml_sequential_model |
| Print ONNX model summary | print.onnx_model |
| Quantize Data (IQ) | quantize_iq1_m quantize_iq1_s quantize_iq2_s quantize_iq2_xs quantize_iq2_xxs quantize_iq3_s quantize_iq3_xxs quantize_iq4_nl quantize_iq4_xs |
| Quantize Data (MXFP4) | quantize_mxfp4 |
| Quantize Data (NVFP4) | quantize_nvfp4 |
| Quantize Data (Q1_0) | quantize_q1_0 |
| Quantize Data (K-quants) | quantize_q2_K quantize_q3_K quantize_q4_K quantize_q5_K quantize_q6_K |
| Quantize Data (Q4_0) | quantize_q4_0 quantize_q4_1 quantize_q5_0 quantize_q5_1 quantize_q8_0 |
| Quantize Row Reference (IQ) | quantize_row_iq2_s_ref quantize_row_iq3_s_ref quantize_row_iq3_xxs_ref quantize_row_iq4_nl_ref quantize_row_iq4_xs_ref |
| Quantize Row Reference (MXFP4) | quantize_row_mxfp4_ref |
| Quantize Row Reference (K-quants) | quantize_row_q2_K_ref quantize_row_q3_K_ref quantize_row_q4_K_ref quantize_row_q5_K_ref quantize_row_q6_K_ref quantize_row_q8_K_ref |
| Quantize Row Reference (Basic) | quantize_row_q4_0_ref quantize_row_q4_1_ref quantize_row_q5_0_ref quantize_row_q5_1_ref quantize_row_q8_0_ref quantize_row_q8_1_ref |
| Quantize Row Reference (Ternary) | quantize_row_tq1_0_ref quantize_row_tq2_0_ref |
| Quantize Data (Ternary) | quantize_tq1_0 quantize_tq2_0 |
| RoPE Mode Constants | GGML_ROPE_TYPE_MROPE GGML_ROPE_TYPE_NEOX GGML_ROPE_TYPE_NORM GGML_ROPE_TYPE_VISION rope_types |
| Run a GGML GPU operation on a Seurat object | RunGGML RunGGML.default RunGGML.Seurat RunGGML.SingleCellExperiment |
| Summary method for ggml_sequential_model | summary.ggml_sequential_model |
| Tidy a fitted ggml parsnip model into a per-layer table | tidy.ggmlr_parsnip_model |
| Run code with gradient tape enabled | with_grad_tape |