Package: ggmlR 0.7.9

ggmlR: 'GGML' Tensor Operations for Machine Learning

Provides 'R' bindings to the 'GGML' tensor library for machine learning, optimized for 'Vulkan' GPU acceleration with a transparent CPU fallback. The package features a 'Keras'-like sequential API and a 'PyTorch'-style 'autograd' engine for building, training, and deploying neural networks. Key capabilities include high-performance 5D tensor operations, 'f16' precision, and efficient quantization. It supports native 'ONNX' model import (50+ operators) and 'GGUF' weight loading from the 'llama.cpp' and 'Hugging Face' ecosystems. Designed for zero-overhead inference via dedicated weight buffering, it integrates seamlessly as a 'parsnip' engine for 'tidymodels' and provides first-class learners for the 'mlr3' framework. See <https://github.com/ggml-org/ggml> for more information about the underlying library.

Authors:Yuri Baramykov [aut, cre], Georgi Gerganov [ctb, cph], Jeffrey Quesnelle [ctb, cph], Bowen Peng [ctb, cph], Mozilla Foundation [ctb, cph]

ggmlR_0.7.9.tar.gz
ggmlR_0.7.9.zip(r-4.7)ggmlR_0.7.9.zip(r-4.6)ggmlR_0.7.9.zip(r-4.5)
ggmlR_0.7.9.tgz(r-4.6-x86_64)ggmlR_0.7.9.tgz(r-4.6-arm64)ggmlR_0.7.9.tgz(r-4.5-x86_64)ggmlR_0.7.9.tgz(r-4.5-arm64)
ggmlR_0.7.9.tar.gz(r-4.7-arm64)ggmlR_0.7.9.tar.gz(r-4.7-x86_64)ggmlR_0.7.9.tar.gz(r-4.6-arm64)ggmlR_0.7.9.tar.gz(r-4.6-x86_64)
ggmlR_0.7.9.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
ggmlR/json (API)
NEWS

# Install 'ggmlR' in R:
install.packages('ggmlR', repos = c('https://zabis13.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/zabis13/ggmlr/issues

Uses libs:
  • c++– GNU Standard C++ Library v3
  • openmp– GCC OpenMP (GOMP) support library

On CRAN:

Conda:

cppopenmp

8.61 score 13 stars 2 packages 69 scripts 738 downloads 652 exports 2 dependencies

Last updated from:3aac7fb9e7. Checks:8 ERROR, 1 OK, 3 FAIL. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-arm64ERROR250
linux-devel-x86_64ERROR306
source / vignettesOK368
linux-release-arm64ERROR272
linux-release-x86_64ERROR273
macos-release-arm64ERROR139
macos-release-x86_64ERROR330
macos-oldrel-arm64ERROR188
macos-oldrel-x86_64ERROR453
windows-develFAIL113
windows-releaseFAIL82
windows-oldrelFAIL79

Exports:ag_addag_batch_normag_clampag_cross_entropy_lossag_dataloaderag_default_deviceag_default_dtypeag_deviceag_dropoutag_dtypeag_embeddingag_evalag_expag_gradcheckag_linearag_load_modelag_logag_matmulag_meanag_mse_lossag_mulag_multihead_attentionag_paramag_powag_reluag_reshapeag_save_modelag_scaleag_sequentialag_sigmoidag_softmaxag_softmax_cross_entropy_lossag_subag_sumag_tanhag_tensorag_to_deviceag_trainag_transposebackwardclip_grad_normcompiledequantize_row_iq1_mdequantize_row_iq1_sdequantize_row_iq2_sdequantize_row_iq2_xsdequantize_row_iq2_xxsdequantize_row_iq3_sdequantize_row_iq3_xxsdequantize_row_iq4_nldequantize_row_iq4_xsdequantize_row_mxfp4dequantize_row_nvfp4dequantize_row_q1_0dequantize_row_q2_Kdequantize_row_q3_Kdequantize_row_q4_0dequantize_row_q4_1dequantize_row_q4_Kdequantize_row_q5_0dequantize_row_q5_1dequantize_row_q5_Kdequantize_row_q6_Kdequantize_row_q8_0dequantize_row_q8_Kdequantize_row_tq1_0dequantize_row_tq2_0dp_trainevaluatefitggml_abort_is_r_enabledggml_absggml_abs_inplaceggml_addggml_add_inplaceggml_add_rel_posggml_add1ggml_applyggml_arangeggml_are_same_layoutggml_are_same_shapeggml_are_same_strideggml_argmaxggml_argsortggml_backend_alloc_ctx_tensorsggml_backend_buffer_clearggml_backend_buffer_freeggml_backend_buffer_get_sizeggml_backend_buffer_get_usageggml_backend_buffer_is_hostggml_backend_buffer_is_multi_bufferggml_backend_buffer_nameggml_backend_buffer_resetggml_backend_buffer_set_usageggml_backend_buffer_usage_anyggml_backend_buffer_usage_computeggml_backend_buffer_usage_weightsggml_backend_cpu_initggml_backend_cpu_set_n_threadsggml_backend_dev_by_nameggml_backend_dev_by_typeggml_backend_dev_countggml_backend_dev_descriptionggml_backend_dev_getggml_backend_dev_get_propsggml_backend_dev_initggml_backend_dev_memoryggml_backend_dev_nameggml_backend_dev_offload_opggml_backend_dev_supports_buftggml_backend_dev_supports_opggml_backend_dev_typeggml_backend_device_registerggml_backend_device_type_accelggml_backend_device_type_cpuggml_backend_device_type_gpuggml_backend_device_type_igpuggml_backend_event_freeggml_backend_event_newggml_backend_event_recordggml_backend_event_synchronizeggml_backend_event_waitggml_backend_freeggml_backend_get_deviceggml_backend_graph_computeggml_backend_graph_compute_asyncggml_backend_graph_plan_computeggml_backend_graph_plan_createggml_backend_graph_plan_freeggml_backend_init_bestggml_backend_init_by_nameggml_backend_init_by_typeggml_backend_loadggml_backend_load_allggml_backend_meta_deviceggml_backend_multi_buffer_alloc_bufferggml_backend_multi_buffer_set_usageggml_backend_nameggml_backend_reg_by_nameggml_backend_reg_countggml_backend_reg_dev_countggml_backend_reg_dev_getggml_backend_reg_getggml_backend_reg_nameggml_backend_registerggml_backend_sched_alloc_graphggml_backend_sched_freeggml_backend_sched_get_backendggml_backend_sched_get_n_backendsggml_backend_sched_get_n_copiesggml_backend_sched_get_n_splitsggml_backend_sched_get_tensor_backendggml_backend_sched_graph_computeggml_backend_sched_graph_compute_asyncggml_backend_sched_newggml_backend_sched_reserveggml_backend_sched_resetggml_backend_sched_set_tensor_backendggml_backend_sched_synchronizeggml_backend_synchronizeggml_backend_tensor_copy_asyncggml_backend_tensor_get_and_syncggml_backend_tensor_get_asyncggml_backend_tensor_get_dataggml_backend_tensor_get_f32_firstggml_backend_tensor_set_asyncggml_backend_tensor_set_dataggml_backend_unloadggml_batch_normggml_blck_sizeggml_build_forward_expandggml_callback_early_stoppingggml_can_repeatggml_ceilggml_ceil_inplaceggml_clampggml_compileggml_concatggml_contggml_conv_1dggml_conv_1d_dwggml_conv_2dggml_conv_2d_directggml_conv_2d_dwggml_conv_2d_dw_directggml_conv_transpose_1dggml_conv_transpose_2d_p0ggml_cosggml_count_equalggml_cpu_addggml_cpu_featuresggml_cpu_get_rvv_vlenggml_cpu_get_sve_cntggml_cpu_has_amx_int8ggml_cpu_has_arm_fmaggml_cpu_has_avxggml_cpu_has_avx_vnniggml_cpu_has_avx2ggml_cpu_has_avx512ggml_cpu_has_avx512_bf16ggml_cpu_has_avx512_vbmiggml_cpu_has_avx512_vnniggml_cpu_has_bmi2ggml_cpu_has_dotprodggml_cpu_has_f16cggml_cpu_has_fmaggml_cpu_has_fp16_vaggml_cpu_has_llamafileggml_cpu_has_matmul_int8ggml_cpu_has_neonggml_cpu_has_riscv_vggml_cpu_has_smeggml_cpu_has_sse3ggml_cpu_has_ssse3ggml_cpu_has_sveggml_cpu_has_vsxggml_cpu_has_vxeggml_cpu_has_wasm_simdggml_cpu_mulggml_cpyggml_cyclesggml_cycles_per_msggml_default_mlpggml_denseggml_diagggml_diag_mask_infggml_diag_mask_inf_inplaceggml_diag_mask_zeroggml_divggml_div_inplaceggml_dupggml_dup_inplaceggml_dup_tensorggml_element_sizeggml_eluggml_elu_inplaceggml_embeddingggml_estimate_memoryggml_evaluateggml_expggml_exp_inplaceggml_extractggml_fitggml_fit_optggml_flash_attn_backggml_flash_attn_extggml_floorggml_floor_inplaceggml_freeggml_freeze_weightsggml_ftype_to_ggml_typeggml_gallocr_alloc_graphggml_gallocr_freeggml_gallocr_get_buffer_sizeggml_gallocr_newggml_gallocr_reserveggml_gegluggml_geglu_quickggml_geglu_splitggml_geluggml_gelu_erfggml_gelu_inplaceggml_gelu_quickggml_get_f32ggml_get_f32_ndggml_get_first_tensorggml_get_i32ggml_get_i32_ndggml_get_layerggml_get_max_tensor_sizeggml_get_mem_sizeggml_get_n_threadsggml_get_nameggml_get_next_tensorggml_get_no_allocggml_get_op_paramsggml_get_op_params_f32ggml_get_op_params_i32ggml_get_rel_posggml_get_rowsggml_get_rows_backggml_get_unary_opggml_gluGGML_GLU_OP_GEGLUGGML_GLU_OP_GEGLU_ERFGGML_GLU_OP_GEGLU_QUICKGGML_GLU_OP_REGLUGGML_GLU_OP_SWIGLUGGML_GLU_OP_SWIGLU_OAIggml_glu_splitggml_graph_computeggml_graph_compute_with_ctxggml_graph_dump_dotggml_graph_get_tensorggml_graph_n_nodesggml_graph_nodeggml_graph_overheadggml_graph_printggml_graph_resetggml_graph_viewggml_group_normggml_group_norm_inplaceggml_gruggml_hardsigmoidggml_hardswishggml_im2colggml_initggml_init_autoggml_injectggml_inputggml_is_availableggml_is_contiguousggml_is_contiguous_0ggml_is_contiguous_1ggml_is_contiguous_2ggml_is_contiguous_channelsggml_is_contiguous_rowsggml_is_contiguously_allocatedggml_is_permutedggml_is_quantizedggml_is_transposedggml_l2_normggml_l2_norm_inplaceggml_layer_addggml_layer_batch_normggml_layer_concatenateggml_layer_conv_1dggml_layer_conv_2dggml_layer_denseggml_layer_dropoutggml_layer_embeddingggml_layer_flattenggml_layer_global_average_pooling_2dggml_layer_global_max_pooling_2dggml_layer_gruggml_layer_lstmggml_layer_max_pooling_2dggml_leaky_reluggml_load_modelggml_load_weightsggml_logggml_log_inplaceggml_log_is_r_enabledggml_log_set_defaultggml_log_set_rggml_lstmggml_marshal_modelggml_meanggml_modelggml_model_backendggml_model_sequentialggml_mulggml_mul_inplaceggml_mul_matggml_mul_mat_idggml_n_dimsggml_nbytesggml_negggml_neg_inplaceggml_nelementsggml_new_f32ggml_new_i32ggml_new_tensorggml_new_tensor_1dggml_new_tensor_2dggml_new_tensor_3dggml_new_tensor_4dggml_normggml_norm_inplaceggml_nrowsggml_op_can_inplaceggml_op_descggml_op_nameGGML_OP_POOL_AVGGGML_OP_POOL_MAXggml_op_symbolggml_ops_registryggml_opt_allocggml_opt_context_optimizer_typeggml_opt_dataset_dataggml_opt_dataset_freeggml_opt_dataset_get_batchggml_opt_dataset_initggml_opt_dataset_labelsggml_opt_dataset_ndataggml_opt_dataset_shuffleggml_opt_dataset_weightsggml_opt_default_paramsggml_opt_epochggml_opt_evalggml_opt_fitggml_opt_freeggml_opt_get_lrggml_opt_grad_accggml_opt_initggml_opt_init_for_fitggml_opt_inputsggml_opt_labelsggml_opt_lossggml_opt_loss_type_cross_entropyggml_opt_loss_type_meanggml_opt_loss_type_mseggml_opt_loss_type_sumggml_opt_loss_type_weighted_mseggml_opt_ncorrectggml_opt_optimizer_nameggml_opt_optimizer_type_adamwggml_opt_optimizer_type_sgdggml_opt_outputsggml_opt_predggml_opt_prepare_allocggml_opt_resetggml_opt_result_accuracyggml_opt_result_freeggml_opt_result_initggml_opt_result_lossggml_opt_result_ndataggml_opt_result_predggml_opt_result_resetggml_opt_set_lrggml_opt_static_graphsggml_out_prodggml_padggml_pad_reflect_1dggml_permuteggml_pool_1dggml_pool_2dggml_pop_layerggml_predictggml_predict_classesggml_print_mem_statusggml_print_objectsggml_quant_block_infoggml_quantize_chunkggml_quantize_freeggml_quantize_initggml_quantize_requires_imatrixggml_regluggml_reglu_splitggml_reluggml_relu_inplaceggml_repeatggml_repeat_backggml_resetggml_reshape_1dggml_reshape_2dggml_reshape_3dggml_reshape_4dggml_resultggml_rms_normggml_rms_norm_backggml_rms_norm_inplaceggml_rollggml_ropeggml_rope_extggml_rope_ext_backggml_rope_ext_inplaceggml_rope_inplaceggml_rope_multiggml_rope_multi_inplaceGGML_ROPE_TYPE_MROPEGGML_ROPE_TYPE_NEOXGGML_ROPE_TYPE_NORMGGML_ROPE_TYPE_VISIONggml_roundggml_round_inplaceggml_runggml_save_modelggml_save_weightsggml_scaleggml_scale_inplaceGGML_SCALE_MODE_BICUBICGGML_SCALE_MODE_BILINEARGGML_SCALE_MODE_NEARESTggml_schedule_cosine_decayggml_schedule_reduce_on_plateauggml_schedule_step_decayggml_setggml_set_1dggml_set_2dggml_set_abort_callback_defaultggml_set_abort_callback_rggml_set_f32ggml_set_f32_ndggml_set_i32ggml_set_i32_ndggml_set_inputggml_set_n_threadsggml_set_nameggml_set_no_allocggml_set_omp_threadsggml_set_op_paramsggml_set_op_params_f32ggml_set_op_params_i32ggml_set_outputggml_set_paramggml_set_seedggml_set_zeroggml_sgnggml_sigmoidggml_sigmoid_inplaceggml_siluggml_silu_backggml_silu_inplaceggml_singgml_soft_maxggml_soft_max_extggml_soft_max_ext_backggml_soft_max_ext_back_inplaceggml_soft_max_ext_inplaceggml_soft_max_inplaceggml_softplusggml_softplus_inplaceGGML_SORT_ORDER_ASCGGML_SORT_ORDER_DESCggml_sqrggml_sqr_inplaceggml_sqrtggml_sqrt_inplaceggml_stepggml_subggml_sub_inplaceggml_sumggml_sum_rowsggml_swigluggml_swiglu_splitggml_tanhggml_tanh_inplaceggml_taskggml_tensor_copyggml_tensor_nbggml_tensor_numggml_tensor_overheadggml_tensor_set_f32_scalarggml_tensor_shapeggml_tensor_typeggml_testggml_time_initggml_time_msggml_time_usggml_timestep_embeddingggml_top_kggml_training_historyggml_transposeGGML_TYPE_BF16GGML_TYPE_F16GGML_TYPE_F32GGML_TYPE_I32ggml_type_nameGGML_TYPE_Q2_KGGML_TYPE_Q3_KGGML_TYPE_Q4_0GGML_TYPE_Q4_1GGML_TYPE_Q4_KGGML_TYPE_Q5_KGGML_TYPE_Q6_KGGML_TYPE_Q8_0ggml_type_sizeggml_type_sizefggml_unary_op_nameggml_unfreeze_weightsggml_unmarshal_modelggml_upscaleggml_used_memggml_versionggml_view_1dggml_view_2dggml_view_3dggml_view_4dggml_view_tensorggml_vulkan_availableggml_vulkan_backend_nameggml_vulkan_device_capsggml_vulkan_device_countggml_vulkan_device_descriptionggml_vulkan_device_memoryggml_vulkan_freeggml_vulkan_initggml_vulkan_is_backendggml_vulkan_list_devicesggml_vulkan_statusggml_win_partggml_win_unpartggml_with_temp_ctxggmlr_parsnip_fit_classifggmlr_parsnip_fit_regrgguf_freegguf_loadgguf_metadatagguf_tensor_datagguf_tensor_infogguf_tensor_namesiq2xs_free_impliq2xs_init_impliq3xs_free_impliq3xs_init_impllr_scheduler_cosinelr_scheduler_stepnn_topo_sortonnx_device_infoonnx_inputsonnx_loadonnx_runonnx_summaryoptimizer_adamoptimizer_sgdquantize_iq1_mquantize_iq1_squantize_iq2_squantize_iq2_xsquantize_iq2_xxsquantize_iq3_squantize_iq3_xxsquantize_iq4_nlquantize_iq4_xsquantize_mxfp4quantize_nvfp4quantize_q1_0quantize_q2_Kquantize_q3_Kquantize_q4_0quantize_q4_1quantize_q4_Kquantize_q5_0quantize_q5_1quantize_q5_Kquantize_q6_Kquantize_q8_0quantize_row_iq2_s_refquantize_row_iq3_s_refquantize_row_iq3_xxs_refquantize_row_iq4_nl_refquantize_row_iq4_xs_refquantize_row_mxfp4_refquantize_row_q2_K_refquantize_row_q3_K_refquantize_row_q4_0_refquantize_row_q4_1_refquantize_row_q4_K_refquantize_row_q5_0_refquantize_row_q5_1_refquantize_row_q5_K_refquantize_row_q6_K_refquantize_row_q8_0_refquantize_row_q8_1_refquantize_row_q8_K_refquantize_row_tq1_0_refquantize_row_tq2_0_refquantize_tq1_0quantize_tq2_0RunGGMLwith_grad_tape

Dependencies:genericsR6

Quickstart: from data to prediction in ~10 lines

Rendered fromquickstart.Rmdusingknitr::rmarkdownon Jun 14 2026.

Last update: 2026-06-08
Started: 2026-06-08

Keras-like API in ggmlR

Rendered fromkeras-like-api.Rmdusingknitr::rmarkdownon Jun 14 2026.

Last update: 2026-06-08
Started: 2026-04-13

Autograd Engine

Rendered fromautograd-engine.Rmdusingknitr::rmarkdownon Jun 14 2026.

Last update: 2026-06-08
Started: 2026-04-13

tidymodels / parsnip Integration

Rendered fromtidymodels-integration.Rmdusingknitr::rmarkdownon Jun 14 2026.

Last update: 2026-06-08
Started: 2026-04-13

mlr3 Integration

Rendered frommlr3-integration.Rmdusingknitr::rmarkdownon Jun 14 2026.

Last update: 2026-06-08
Started: 2026-04-13

GPU / Vulkan Backend

Rendered fromgpu-vulkan.Rmdusingknitr::rmarkdownon Jun 14 2026.

Last update: 2026-06-08
Started: 2026-04-13

Quantization

Rendered fromquantization.Rmdusingknitr::rmarkdownon Jun 14 2026.

Last update: 2026-06-08
Started: 2026-02-06

Data-Parallel Training

Rendered fromdata-parallel-training.Rmdusingknitr::rmarkdownon Jun 14 2026.

Last update: 2026-06-08
Started: 2026-04-13

ONNX Model Import

Rendered fromonnx-import.Rmdusingknitr::rmarkdownon Jun 14 2026.

Last update: 2026-06-08
Started: 2026-04-13

Using ggmlR as a Backend in Your Package

Rendered fromembedding-ggmlR.Rmdusingknitr::rmarkdownon Jun 14 2026.

Last update: 2026-06-08
Started: 2026-04-13

Readme and manuals

Help Manual

Help pageTopics
Element-wise addition with broadcastingag_add
Create a Batch Normalisation layerag_batch_norm
Element-wise clampag_clamp
Categorical Cross-Entropy lossag_cross_entropy_loss
Create a mini-batch data loaderag_dataloader
Return the current default compute deviceag_default_device
Return the current default dtype for GPU operationsag_default_dtype
Set the default compute device for ag_* operationsag_device
Create a Dropout layerag_dropout
Set the default floating-point precision for ag_* GPU operationsag_dtype
Create an Embedding layerag_embedding
Switch a layer or sequential model to eval modeag_eval
Element-wise exponentialag_exp
Numerical gradient check (like torch.autograd.gradcheck)ag_gradcheck
Create a dense layer with learnable parametersag_linear
Load an autograd module from a saved stateag_load_model
Element-wise natural logarithmag_log
Matrix multiplicationag_matmul
Mean of elements (or along a dim)ag_mean
Mean Squared Error lossag_mse_loss
Element-wise multiplicationag_mul
Create a Multi-Head Attention layerag_multihead_attention
Create a parameter tensor (gradient tracked)ag_param
Element-wise powerag_pow
ReLU activationag_relu
Reshape tensorag_reshape
Save an autograd module's state to diskag_save_model
Scale tensor by a scalar constantag_scale
Create a sequential container of layersag_sequential
Sigmoid activationag_sigmoid
Softmax activation (column-wise)ag_softmax
Fused softmax + cross-entropy loss (numerically stable)ag_softmax_cross_entropy_loss
Element-wise subtractionag_sub
Sum all elements (or along a dim): out = sum(x)ag_sum
Tanh activationag_tanh
Create a dynamic tensor (no gradient tracking)ag_tensor
Move a tensor to the specified deviceag_to_device
Switch a layer or sequential model to training modeag_train
Transpose a tensorag_transpose
Augment new data with predictions from a fitted ggml parsnip modelaugment.ggmlr_parsnip_model
Run backward pass from a scalar loss tensorbackward
Clip gradients by global L2 normclip_grad_norm
Compile a Modelcompile.ggml_functional_model compile.ggml_sequential_model
Dequantize Row (IQ)dequantize_row_iq1_m dequantize_row_iq1_s dequantize_row_iq2_s dequantize_row_iq2_xs dequantize_row_iq2_xxs dequantize_row_iq3_s dequantize_row_iq3_xxs dequantize_row_iq4_nl dequantize_row_iq4_xs
Dequantize Row (MXFP4)dequantize_row_mxfp4
Dequantize NVFP4 Datadequantize_row_nvfp4
Dequantize Q1_0 Datadequantize_row_q1_0
Dequantize Row (K-quants)dequantize_row_q2_K dequantize_row_q3_K dequantize_row_q4_K dequantize_row_q5_K dequantize_row_q6_K dequantize_row_q8_K
Dequantize Row (Q4_0)dequantize_row_q4_0 dequantize_row_q4_1 dequantize_row_q5_0 dequantize_row_q5_1 dequantize_row_q8_0
Dequantize Row (Ternary)dequantize_row_tq1_0 dequantize_row_tq2_0
Data-parallel training across multiple GPUsdp_train
Evaluate a Modelevaluate.ggml_functional_model evaluate.ggml_sequential_model
Train a Modelfit.ggml_functional_model fit.ggml_sequential_model
Check if R Abort Handler is Enabledggml_abort_is_r_enabled
Absolute Value (Graph)ggml_abs
Absolute Value In-place (Graph)ggml_abs_inplace
Add tensorsggml_add
Element-wise Addition In-place (Graph)ggml_add_inplace
Add Relative Position Bias (Graph)ggml_add_rel_pos
Add Scalar to Tensor (Graph)ggml_add1
Apply a Layer Object to a Tensor Nodeggml_apply
Arange (Graph)ggml_arange
Check if Two Tensors Have the Same Layoutggml_are_same_layout
Compare Tensor Shapesggml_are_same_shape
Compare Tensor Stridesggml_are_same_stride
Argmax (Graph)ggml_argmax
Argsort - Get Sorting Indices (Graph)ggml_argsort
Allocate Context Tensors to Backendggml_backend_alloc_ctx_tensors
Clear buffer memoryggml_backend_buffer_clear
Free Backend Bufferggml_backend_buffer_free
Get Backend Buffer Sizeggml_backend_buffer_get_size
Get buffer usageggml_backend_buffer_get_usage
Check if buffer is host memoryggml_backend_buffer_is_host
Check if buffer is a multi-bufferggml_backend_buffer_is_multi_buffer
Get Backend Buffer Nameggml_backend_buffer_name
Reset bufferggml_backend_buffer_reset
Set buffer usage hintggml_backend_buffer_set_usage
Buffer usage: Anyggml_backend_buffer_usage_any
Buffer usage: Computeggml_backend_buffer_usage_compute
Buffer usage: Weightsggml_backend_buffer_usage_weights
Initialize CPU Backendggml_backend_cpu_init
Set CPU Backend Threadsggml_backend_cpu_set_n_threads
Get device by nameggml_backend_dev_by_name
Get device by typeggml_backend_dev_by_type
Get number of available devicesggml_backend_dev_count
Get device descriptionggml_backend_dev_description
Get device by indexggml_backend_dev_get
Get device propertiesggml_backend_dev_get_props
Initialize backend from deviceggml_backend_dev_init
Get device memoryggml_backend_dev_memory
Get device nameggml_backend_dev_name
Check if device should offload operationggml_backend_dev_offload_op
Check if device supports buffer typeggml_backend_dev_supports_buft
Check if device supports operationggml_backend_dev_supports_op
Get device typeggml_backend_dev_type
Register a deviceggml_backend_device_register
Device type: Acceleratorggml_backend_device_type_accel
Device type: CPUggml_backend_device_type_cpu
Device type: GPUggml_backend_device_type_gpu
Device type: Integrated GPUggml_backend_device_type_igpu
Free eventggml_backend_event_free
Create new eventggml_backend_event_new
Record eventggml_backend_event_record
Synchronize eventggml_backend_event_synchronize
Wait for eventggml_backend_event_wait
Free Backendggml_backend_free
Get device from backendggml_backend_get_device
Compute Graph with Backendggml_backend_graph_compute
Compute graph asynchronouslyggml_backend_graph_compute_async
Execute graph planggml_backend_graph_plan_compute
Create graph execution planggml_backend_graph_plan_create
Free graph execution planggml_backend_graph_plan_free
Initialize best available backendggml_backend_init_best
Initialize backend by nameggml_backend_init_by_name
Initialize backend by typeggml_backend_init_by_type
Load backend from dynamic libraryggml_backend_load
Load all available backendsggml_backend_load_all
Create a Meta Backend Deviceggml_backend_meta_device
Allocate multi-bufferggml_backend_multi_buffer_alloc_buffer
Set usage for all buffers in a multi-bufferggml_backend_multi_buffer_set_usage
Get Backend Nameggml_backend_name
Get backend registry by nameggml_backend_reg_by_name
Get number of registered backendsggml_backend_reg_count
Get number of devices in registryggml_backend_reg_dev_count
Get device from registryggml_backend_reg_dev_get
Get backend registry by indexggml_backend_reg_get
Get registry nameggml_backend_reg_name
Register a backendggml_backend_register
Allocate graph on schedulerggml_backend_sched_alloc_graph
Free backend schedulerggml_backend_sched_free
Get backend from schedulerggml_backend_sched_get_backend
Get number of backends in schedulerggml_backend_sched_get_n_backends
Get number of tensor copiesggml_backend_sched_get_n_copies
Get number of graph splitsggml_backend_sched_get_n_splits
Get tensor backend assignmentggml_backend_sched_get_tensor_backend
Compute graph using schedulerggml_backend_sched_graph_compute
Compute graph asynchronouslyggml_backend_sched_graph_compute_async
Create a new backend schedulerggml_backend_sched_new
Reserve memory for schedulerggml_backend_sched_reserve
Reset schedulerggml_backend_sched_reset
Set tensor backend assignmentggml_backend_sched_set_tensor_backend
Synchronize schedulerggml_backend_sched_synchronize
Synchronize backendggml_backend_synchronize
Copy tensor asynchronously between backendsggml_backend_tensor_copy_async
Backend Tensor Get and Syncggml_backend_tensor_get_and_sync
Get tensor data asynchronouslyggml_backend_tensor_get_async
Get Tensor Data via Backendggml_backend_tensor_get_data
Get First Float from Backend Tensorggml_backend_tensor_get_f32_first
Set tensor data asynchronouslyggml_backend_tensor_set_async
Set Tensor Data via Backendggml_backend_tensor_set_data
Unload backendggml_backend_unload
Create a Batch Normalization Layer Objectggml_batch_norm
Get Block Sizeggml_blck_size
Build forward expandggml_build_forward_expand
Early stopping callbackggml_callback_early_stopping
Check If Tensor Can Be Repeatedggml_can_repeat
Ceiling (Graph)ggml_ceil
Ceiling In-place (Graph)ggml_ceil_inplace
Clamp (Graph)ggml_clamp
Compile a Sequential Modelggml_compile ggml_compile.ggml_functional_model ggml_compile.ggml_sequential_model
Concatenate Tensors (Graph)ggml_concat
Make Contiguous (Graph)ggml_cont
1D Convolution (Graph)ggml_conv_1d
Depthwise 1D Convolution (Graph)ggml_conv_1d_dw
2D Convolution (Graph)ggml_conv_2d
Direct 2D Convolution (Graph)ggml_conv_2d_direct
Depthwise 2D Convolution (Graph)ggml_conv_2d_dw
Depthwise 2D Convolution, direct (Graph)ggml_conv_2d_dw_direct
Transposed 1D Convolution (Graph)ggml_conv_transpose_1d
Transposed 2D Convolution, zero padding (Graph)ggml_conv_transpose_2d_p0
Cosine (Graph)ggml_cos
Count Equal Elements (Graph)ggml_count_equal
Element-wise Addition (CPU Direct)ggml_cpu_add
Get All CPU Featuresggml_cpu_features
Get RISC-V Vector Lengthggml_cpu_get_rvv_vlen
Get SVE Vector Length (ARM)ggml_cpu_get_sve_cnt
CPU Feature Detection - AMX INT8ggml_cpu_has_amx_int8
CPU Feature Detection - ARM FMAggml_cpu_has_arm_fma
CPU Feature Detection - AVXggml_cpu_has_avx
CPU Feature Detection - AVX-VNNIggml_cpu_has_avx_vnni
CPU Feature Detection - AVX2ggml_cpu_has_avx2
CPU Feature Detection - AVX-512ggml_cpu_has_avx512
CPU Feature Detection - AVX-512 BF16ggml_cpu_has_avx512_bf16
CPU Feature Detection - AVX-512 VBMIggml_cpu_has_avx512_vbmi
CPU Feature Detection - AVX-512 VNNIggml_cpu_has_avx512_vnni
CPU Feature Detection - BMI2ggml_cpu_has_bmi2
CPU Feature Detection - Dot Product (ARM)ggml_cpu_has_dotprod
CPU Feature Detection - F16Cggml_cpu_has_f16c
CPU Feature Detection - FMAggml_cpu_has_fma
CPU Feature Detection - FP16 Vector Arithmetic (ARM)ggml_cpu_has_fp16_va
CPU Feature Detection - Llamafileggml_cpu_has_llamafile
CPU Feature Detection - INT8 Matrix Multiply (ARM)ggml_cpu_has_matmul_int8
CPU Feature Detection - NEON (ARM)ggml_cpu_has_neon
CPU Feature Detection - RISC-V Vectorggml_cpu_has_riscv_v
CPU Feature Detection - SME (ARM)ggml_cpu_has_sme
CPU Feature Detection - SSE3ggml_cpu_has_sse3
CPU Feature Detection - SSSE3ggml_cpu_has_ssse3
CPU Feature Detection - SVE (ARM)ggml_cpu_has_sve
CPU Feature Detection - VSX (PowerPC)ggml_cpu_has_vsx
CPU Feature Detection - VXE (IBM z/Architecture)ggml_cpu_has_vxe
CPU Feature Detection - WebAssembly SIMDggml_cpu_has_wasm_simd
Element-wise Multiplication (CPU Direct)ggml_cpu_mul
Copy Tensor with Type Conversion (Graph)ggml_cpy
Get CPU Cyclesggml_cycles
Get CPU Cycles per Millisecondggml_cycles_per_ms
Default MLP builder for classification and regressionggml_default_mlp
Create a Dense Layer Objectggml_dense
Diagonal Matrix (Graph)ggml_diag
Diagonal Mask with -Inf (Graph)ggml_diag_mask_inf
Diagonal Mask with -Inf In-place (Graph)ggml_diag_mask_inf_inplace
Diagonal Mask with Zero (Graph)ggml_diag_mask_zero
Element-wise Division (Graph)ggml_div
Element-wise Division In-place (Graph)ggml_div_inplace
Duplicate Tensor (Graph)ggml_dup
Duplicate Tensor In-place (Graph)ggml_dup_inplace
Duplicate Tensorggml_dup_tensor
Get Element Sizeggml_element_size
ELU Activation (Graph)ggml_elu
ELU Activation In-place (Graph)ggml_elu_inplace
Create an Embedding Layer Objectggml_embedding
Estimate Required Memoryggml_estimate_memory
Evaluate a Trained Modelggml_evaluate ggml_evaluate.ggml_functional_model ggml_evaluate.ggml_sequential_model
Exponential (Graph)ggml_exp
Exponential In-place (Graph)ggml_exp_inplace
Extract a feature-by-cell matrix from a single-cell containerggml_extract ggml_extract.dgCMatrix ggml_extract.matrix ggml_extract.Seurat
Fit model with R-side epoch loop and callbacksggml_fit_opt
Train a Model (dispatcher)ggml_fit ggml_fit.default ggml_fit.ggml_functional_model ggml_fit.ggml_sequential_model
Flash Attention Backward (Graph)ggml_flash_attn_back
Flash Attention (Graph)ggml_flash_attn_ext
Floor (Graph)ggml_floor
Floor In-place (Graph)ggml_floor_inplace
Free GGML contextggml_free
Freeze Layer Weightsggml_freeze_weights
Convert ftype to ggml_typeggml_ftype_to_ggml_type
Allocate Memory for Graphggml_gallocr_alloc_graph
Free Graph Allocatorggml_gallocr_free
Get Graph Allocator Buffer Sizeggml_gallocr_get_buffer_size
Create Graph Allocatorggml_gallocr_new
Reserve Memory for Graphggml_gallocr_reserve
GeGLU (GELU Gated Linear Unit) (Graph)ggml_geglu
GeGLU Quick (Fast GeGLU) (Graph)ggml_geglu_quick
GeGLU Split (Graph)ggml_geglu_split
GELU Activation (Graph)ggml_gelu
Exact GELU Activation (Graph)ggml_gelu_erf
GELU Activation In-place (Graph)ggml_gelu_inplace
GELU Quick Activation (Graph)ggml_gelu_quick
Get F32 dataggml_get_f32
Get Single Float Value by N-D Indexggml_get_f32_nd
Get First Tensor from Contextggml_get_first_tensor
Get I32 Dataggml_get_i32
Get Single Int32 Value by N-D Indexggml_get_i32_nd
Get a Layer from a Sequential Modelggml_get_layer
Get Maximum Tensor Sizeggml_get_max_tensor_size
Get Context Memory Sizeggml_get_mem_size
Get Number of Threadsggml_get_n_threads
Get Tensor Nameggml_get_name
Get Next Tensor from Contextggml_get_next_tensor
Get No Allocation Modeggml_get_no_alloc
Get Tensor Operation Parametersggml_get_op_params
Get Float Op Parameterggml_get_op_params_f32
Get Integer Op Parameterggml_get_op_params_i32
Get Relative Position (Graph)ggml_get_rel_pos
Get Rows by Indices (Graph)ggml_get_rows
Get Rows Backward (Graph)ggml_get_rows_back
Get Unary Operation from Tensorggml_get_unary_op
Generic GLU (Gated Linear Unit) (Graph)ggml_glu
GLU Operation TypesGGML_GLU_OP_GEGLU GGML_GLU_OP_GEGLU_ERF GGML_GLU_OP_GEGLU_QUICK GGML_GLU_OP_REGLU GGML_GLU_OP_SWIGLU GGML_GLU_OP_SWIGLU_OAI
Generic GLU Split (Graph)ggml_glu_split
Compute graphggml_graph_compute
Compute Graph with Context (Alternative Method)ggml_graph_compute_with_ctx
Export Graph to DOT Formatggml_graph_dump_dot
Get Tensor from Graph by Nameggml_graph_get_tensor
Get Number of Nodes in Graphggml_graph_n_nodes
Get Graph Nodeggml_graph_node
Get Graph Overheadggml_graph_overhead
Print Graph Informationggml_graph_print
Reset Graph (for backpropagation)ggml_graph_reset
Create a View of a Subgraphggml_graph_view
Group Normalization (Graph)ggml_group_norm
Group Normalization In-place (Graph)ggml_group_norm_inplace
Create a GRU Layer Objectggml_gru
Hard Sigmoid Activation (Graph)ggml_hardsigmoid
Hard Swish Activation (Graph)ggml_hardswish
Image to Column (Graph)ggml_im2col
Initialize GGML contextggml_init
Create Context with Auto-sizingggml_init_auto
Inject a single-cell result back into its containerggml_inject ggml_inject.Seurat
Declare a Functional API Input Tensorggml_input
Check if GGML is availableggml_is_available
Check if Tensor is Contiguousggml_is_contiguous
Check Tensor Contiguity (Dimension 0)ggml_is_contiguous_0
Check Tensor Contiguity (Dimensions >= 1)ggml_is_contiguous_1
Check Tensor Contiguity (Dimensions >= 2)ggml_is_contiguous_2
Check Channel-wise Contiguityggml_is_contiguous_channels
Check Row-wise Contiguityggml_is_contiguous_rows
Check If Tensor is Contiguously Allocatedggml_is_contiguously_allocated
Check if Tensor is Permutedggml_is_permuted
Check If Type is Quantizedggml_is_quantized
Check if Tensor is Transposedggml_is_transposed
L2 Normalization (Graph)ggml_l2_norm
L2 Normalization In-place (Graph)ggml_l2_norm_inplace
Element-wise Addition of Two Tensor Nodesggml_layer_add
Add Batch Normalization Layerggml_layer_batch_norm
Concatenate Tensor Nodes Along an Axisggml_layer_concatenate
Create a Conv1D Layer Objectggml_layer_conv_1d
Create a Conv2D Layer Objectggml_layer_conv_2d
Add Dense (Fully Connected) Layerggml_layer_dense
Add Dropout Layerggml_layer_dropout
Add Embedding Layerggml_layer_embedding
Add Flatten Layerggml_layer_flatten
Global Average Pooling for 2D Feature Mapsggml_layer_global_average_pooling_2d
Global Max Pooling for 2D Feature Mapsggml_layer_global_max_pooling_2d
Add a GRU Layerggml_layer_gru
Add an LSTM Layerggml_layer_lstm
Add 2D Max Pooling Layerggml_layer_max_pooling_2d
Leaky ReLU Activation (Graph)ggml_leaky_relu
Load a Full Model (Architecture + Weights)ggml_load_model
Load Model Weights from Fileggml_load_weights
Natural Logarithm (Graph)ggml_log
Natural Logarithm In-place (Graph)ggml_log_inplace
Check if R Logging is Enabledggml_log_is_r_enabled
Restore Default GGML Loggingggml_log_set_default
Enable R-compatible GGML Loggingggml_log_set_r
Create an LSTM Layer Objectggml_lstm
Marshal a ggmlR model to an in-memory containerggml_marshal_model
Mean (Graph)ggml_mean
Create a Functional Modelggml_model
Backend a fitted ggml model actually ran onggml_model_backend
Create a Sequential Neural Network Modelggml_model_sequential
Multiply tensorsggml_mul
Element-wise Multiplication In-place (Graph)ggml_mul_inplace
Matrix Multiplication (Graph)ggml_mul_mat
Matrix Multiplication with Expert Selection (Graph)ggml_mul_mat_id
Get Number of Dimensionsggml_n_dims
Get number of bytesggml_nbytes
Negation (Graph)ggml_neg
Negation In-place (Graph)ggml_neg_inplace
Get number of elementsggml_nelements
Create Scalar F32 Tensorggml_new_f32
Create Scalar I32 Tensorggml_new_i32
Create Tensor with Arbitrary Dimensionsggml_new_tensor
Create 1D tensorggml_new_tensor_1d
Create 2D tensorggml_new_tensor_2d
Create 3D Tensorggml_new_tensor_3d
Create 4D Tensorggml_new_tensor_4d
Layer Normalization (Graph)ggml_norm
Layer Normalization In-place (Graph)ggml_norm_inplace
Get Number of Rowsggml_nrows
Check if Operation Can Be Done In-placeggml_op_can_inplace
Get Operation Description from Tensorggml_op_desc
Get Operation Nameggml_op_name
Get Operation Symbolggml_op_symbol
Supported single-cell operationsggml_ops_registry
Allocate graph for evaluationggml_opt_alloc
Get optimizer type from contextggml_opt_context_optimizer_type
Get data tensor from datasetggml_opt_dataset_data
Free optimization datasetggml_opt_dataset_free
Get batch from datasetggml_opt_dataset_get_batch
Create a new optimization datasetggml_opt_dataset_init
Get labels tensor from datasetggml_opt_dataset_labels
Get number of datapoints in datasetggml_opt_dataset_ndata
Shuffle datasetggml_opt_dataset_shuffle
Get dataset per-datapoint weights tensorggml_opt_dataset_weights
Get default optimizer parametersggml_opt_default_params
Run one training epochggml_opt_epoch
Evaluate modelggml_opt_eval
Fit model to datasetggml_opt_fit
Free optimizer contextggml_opt_free
Get current learning rate from optimizer contextggml_opt_get_lr
Get gradient accumulator for a tensorggml_opt_grad_acc
Initialize optimizer contextggml_opt_init
Initialize optimizer context for R-side epoch loopggml_opt_init_for_fit
Get inputs tensor from optimizer contextggml_opt_inputs
Get labels tensor from optimizer contextggml_opt_labels
Get loss tensor from optimizer contextggml_opt_loss
Loss type: Cross Entropyggml_opt_loss_type_cross_entropy
Loss type: Meanggml_opt_loss_type_mean
Loss type: Mean Squared Errorggml_opt_loss_type_mse
Loss type: Sumggml_opt_loss_type_sum
Loss type: Weighted Mean Squared Errorggml_opt_loss_type_weighted_mse
Get number of correct predictions tensorggml_opt_ncorrect
Get optimizer nameggml_opt_optimizer_name
Optimizer type: AdamWggml_opt_optimizer_type_adamw
Optimizer type: SGDggml_opt_optimizer_type_sgd
Get outputs tensor from optimizer contextggml_opt_outputs
Get predictions tensor from optimizer contextggml_opt_pred
Prepare allocation for non-static graphsggml_opt_prepare_alloc
Reset optimizer contextggml_opt_reset
Get accuracy from resultggml_opt_result_accuracy
Free optimization resultggml_opt_result_free
Initialize optimization resultggml_opt_result_init
Get loss from resultggml_opt_result_loss
Get number of datapoints from resultggml_opt_result_ndata
Get predictions from resultggml_opt_result_pred
Reset optimization resultggml_opt_result_reset
Set learning rate in optimizer contextggml_opt_set_lr
Check if using static graphsggml_opt_static_graphs
Outer Product (Graph)ggml_out_prod
Pad Tensor with Zeros (Graph)ggml_pad
Reflective 1D Padding (Graph)ggml_pad_reflect_1d
Permute Tensor Dimensions (Graph)ggml_permute
1D Pooling (Graph)GGML_OP_POOL_AVG GGML_OP_POOL_MAX ggml_pool_1d
2D Pooling (Graph)ggml_pool_2d
Remove the Last Layer from a Sequential Modelggml_pop_layer
Predict Classes from a Trained Modelggml_predict_classes
Get Predictions from a Trained Modelggml_predict ggml_predict.ggml_functional_model ggml_predict.ggml_sequential_model
Print Context Memory Statusggml_print_mem_status
Print Objects in Contextggml_print_objects
Get Quantization Block Infoggml_quant_block_info
Quantize Data Chunkggml_quantize_chunk
Free Quantization Resourcesggml_quantize_free
Initialize Quantization Tablesggml_quantize_init
Check if Quantization Requires Importance Matrixggml_quantize_requires_imatrix
ReGLU (ReLU Gated Linear Unit) (Graph)ggml_reglu
ReGLU Split (Graph)ggml_reglu_split
ReLU Activation (Graph)ggml_relu
ReLU Activation In-place (Graph)ggml_relu_inplace
Repeat (Graph)ggml_repeat
Repeat Backward (Graph)ggml_repeat_back
Reset GGML Contextggml_reset
Reshape to 1D (Graph)ggml_reshape_1d
Reshape to 2D (Graph)ggml_reshape_2d
Reshape to 3D (Graph)ggml_reshape_3d
Reshape to 4D (Graph)ggml_reshape_4d
Construct a single-cell resultggml_result
RMS Normalization (Graph)ggml_rms_norm
RMS Norm Backward (Graph)ggml_rms_norm_back
RMS Normalization In-place (Graph)ggml_rms_norm_inplace
Roll (Graph)ggml_roll
Rotary Position Embedding (Graph)ggml_rope
Extended RoPE with Frequency Scaling (Graph)ggml_rope_ext
RoPE Extended Backward (Graph)ggml_rope_ext_back
Extended RoPE Inplace (Graph)ggml_rope_ext_inplace
Rotary Position Embedding In-place (Graph)ggml_rope_inplace
Multi-RoPE for Vision Models (Graph)ggml_rope_multi
Multi-RoPE Inplace (Graph)ggml_rope_multi_inplace
Round (Graph)ggml_round
Round In-place (Graph)ggml_round_inplace
Run a single-cell task on the GGML backendggml_run
Save a Full Model (Architecture + Weights)ggml_save_model
Save Model Weights to Fileggml_save_weights
Scale (Graph)ggml_scale
Scale Tensor In-place (Graph)ggml_scale_inplace
Cosine annealing LR schedulerggml_schedule_cosine_decay
Reduce on plateau LR schedulerggml_schedule_reduce_on_plateau
Step decay LR schedulerggml_schedule_step_decay
Set Tensor Region (Graph)ggml_set
Set 1D Tensor Region (Graph)ggml_set_1d
Set 2D Tensor Region (Graph)ggml_set_2d
Restore Default Abort Behaviorggml_set_abort_callback_default
Enable R-compatible Abort Handlingggml_set_abort_callback_r
Set F32 dataggml_set_f32
Set Single Float Value by N-D Indexggml_set_f32_nd
Set I32 Dataggml_set_i32
Set Single Int32 Value by N-D Indexggml_set_i32_nd
Mark Tensor as Inputggml_set_input
Set Number of Threadsggml_set_n_threads
Set Tensor Nameggml_set_name
Set No Allocation Modeggml_set_no_alloc
Set OpenMP Thread Countggml_set_omp_threads
Set Tensor Operation Parametersggml_set_op_params
Set Float Op Parameterggml_set_op_params_f32
Set Integer Op Parameterggml_set_op_params_i32
Mark Tensor as Outputggml_set_output
Set Tensor as Trainable Parameterggml_set_param
Set the random seed for reproducible ggmlR runsggml_set_seed
Set Tensor to Zeroggml_set_zero
Sign Function (Graph)ggml_sgn
Sigmoid Activation (Graph)ggml_sigmoid
Sigmoid Activation In-place (Graph)ggml_sigmoid_inplace
SiLU Activation (Graph)ggml_silu
SiLU Backward (Graph)ggml_silu_back
SiLU Activation In-place (Graph)ggml_silu_inplace
Sine (Graph)ggml_sin
Softmax (Graph)ggml_soft_max
Extended Softmax with Masking and Scaling (Graph)ggml_soft_max_ext
Softmax Backward Extended (Graph)ggml_soft_max_ext_back
Extended Softmax Backward Inplace (Graph)ggml_soft_max_ext_back_inplace
Extended Softmax Inplace (Graph)ggml_soft_max_ext_inplace
Softmax In-place (Graph)ggml_soft_max_inplace
Softplus Activation (Graph)ggml_softplus
Softplus Activation In-place (Graph)ggml_softplus_inplace
Sort Order ConstantsGGML_SORT_ORDER_ASC GGML_SORT_ORDER_DESC
Square (Graph)ggml_sqr
Square In-place (Graph)ggml_sqr_inplace
Square Root (Graph)ggml_sqrt
Square Root In-place (Graph)ggml_sqrt_inplace
Step Function (Graph)ggml_step
Element-wise Subtraction (Graph)ggml_sub
Element-wise Subtraction In-place (Graph)ggml_sub_inplace
Sum (Graph)ggml_sum
Sum Rows (Graph)ggml_sum_rows
SwiGLU (Swish/SiLU Gated Linear Unit) (Graph)ggml_swiglu
SwiGLU Split (Graph)ggml_swiglu_split
Tanh Activation (Graph)ggml_tanh
Tanh Activation In-place (Graph)ggml_tanh_inplace
Construct a single-cell compute taskggml_task
Copy Tensor Dataggml_tensor_copy
Get Tensor Strides (nb)ggml_tensor_nb
Count Tensors in Contextggml_tensor_num
Get Tensor Overheadggml_tensor_overhead
Fill Tensor with Scalarggml_tensor_set_f32_scalar
Get Tensor Shapeggml_tensor_shape
Get Tensor Typeggml_tensor_type
Test GGMLggml_test
Initialize GGML Timerggml_time_init
Get Time in Millisecondsggml_time_ms
Get Time in Microsecondsggml_time_us
Timestep Embedding (Graph Operation)ggml_timestep_embedding
Top-K Indices (Graph)ggml_top_k
Training history of a fitted ggml modelggml_training_history
Transpose (Graph)ggml_transpose
GGML Data TypesGGML_TYPE_BF16 GGML_TYPE_F16 GGML_TYPE_F32 GGML_TYPE_I32 GGML_TYPE_Q2_K GGML_TYPE_Q3_K GGML_TYPE_Q4_0 GGML_TYPE_Q4_1 GGML_TYPE_Q4_K GGML_TYPE_Q5_K GGML_TYPE_Q6_K GGML_TYPE_Q8_0
Get Type Nameggml_type_name
Get Type Size in Bytesggml_type_size
Get Type Size as Floatggml_type_sizef
Get Unary Operation Nameggml_unary_op_name
Unfreeze Layer Weightsggml_unfreeze_weights
Unmarshal a ggmlR model from an in-memory containerggml_unmarshal_model
Upscale Tensor (Graph)GGML_SCALE_MODE_BICUBIC GGML_SCALE_MODE_BILINEAR GGML_SCALE_MODE_NEAREST ggml_upscale
Get Used Memoryggml_used_mem
Get GGML versionggml_version
1D View with Byte Offset (Graph)ggml_view_1d
2D View with Byte Offset (Graph)ggml_view_2d
3D View with Byte Offset (Graph)ggml_view_3d
4D View with Byte Offset (Graph)ggml_view_4d
View Tensorggml_view_tensor
Check if Vulkan support is availableggml_vulkan_available
Get Vulkan backend nameggml_vulkan_backend_name
Get Vulkan device capabilitiesggml_vulkan_device_caps
Get number of Vulkan devicesggml_vulkan_device_count
Get Vulkan device descriptionggml_vulkan_device_description
Get Vulkan device memoryggml_vulkan_device_memory
Free Vulkan backendggml_vulkan_free
Initialize Vulkan backendggml_vulkan_init
Check if backend is Vulkanggml_vulkan_is_backend
List all Vulkan devicesggml_vulkan_list_devices
Print Vulkan statusggml_vulkan_status
Window Partition (Graph)ggml_win_part
Window Un-partition (Graph)ggml_win_unpart
Execute with Temporary Contextggml_with_temp_ctx
Free GGUF Resourcesgguf_free
Load a GGUF Filegguf_load
Get GGUF Metadatagguf_metadata
Extract Tensor Datagguf_tensor_data
Get Tensor Infogguf_tensor_info
List Tensor Names in a GGUF Filegguf_tensor_names
One-row summary of a fitted ggml parsnip modelglance.ggmlr_parsnip_model
Free IQ2 Quantization Tablesiq2xs_free_impl
Initialize IQ2 Quantization Tablesiq2xs_init_impl
Free IQ3 Quantization Tablesiq3xs_free_impl
Initialize IQ3 Quantization Tablesiq3xs_init_impl
Cosine-annealing learning rate schedulerlr_scheduler_cosine
Step-decay learning rate schedulerlr_scheduler_step
Topologically sort nodes reachable from output nodesnn_topo_sort
ONNX model device/scheduler diagnosticsonnx_device_info
List ONNX model inputsonnx_inputs
Load an ONNX modelonnx_load
Run ONNX model inferenceonnx_run
ONNX model summaryonnx_summary
Create an Adam optimizeroptimizer_adam
Create an SGD optimizeroptimizer_sgd
Plot training historyplot.ggml_history
Predict with a Trained Modelpredict.ggml_functional_model predict.ggml_sequential_model
Print method for ag_tensorprint.ag_tensor
Print method for ggml_functional_modelprint.ggml_functional_model
Print method for ggml_historyprint.ggml_history
Print method for ggml_sequential_modelprint.ggml_sequential_model
Print ONNX model summaryprint.onnx_model
Quantize Data (IQ)quantize_iq1_m quantize_iq1_s quantize_iq2_s quantize_iq2_xs quantize_iq2_xxs quantize_iq3_s quantize_iq3_xxs quantize_iq4_nl quantize_iq4_xs
Quantize Data (MXFP4)quantize_mxfp4
Quantize Data (NVFP4)quantize_nvfp4
Quantize Data (Q1_0)quantize_q1_0
Quantize Data (K-quants)quantize_q2_K quantize_q3_K quantize_q4_K quantize_q5_K quantize_q6_K
Quantize Data (Q4_0)quantize_q4_0 quantize_q4_1 quantize_q5_0 quantize_q5_1 quantize_q8_0
Quantize Row Reference (IQ)quantize_row_iq2_s_ref quantize_row_iq3_s_ref quantize_row_iq3_xxs_ref quantize_row_iq4_nl_ref quantize_row_iq4_xs_ref
Quantize Row Reference (MXFP4)quantize_row_mxfp4_ref
Quantize Row Reference (K-quants)quantize_row_q2_K_ref quantize_row_q3_K_ref quantize_row_q4_K_ref quantize_row_q5_K_ref quantize_row_q6_K_ref quantize_row_q8_K_ref
Quantize Row Reference (Basic)quantize_row_q4_0_ref quantize_row_q4_1_ref quantize_row_q5_0_ref quantize_row_q5_1_ref quantize_row_q8_0_ref quantize_row_q8_1_ref
Quantize Row Reference (Ternary)quantize_row_tq1_0_ref quantize_row_tq2_0_ref
Quantize Data (Ternary)quantize_tq1_0 quantize_tq2_0
RoPE Mode ConstantsGGML_ROPE_TYPE_MROPE GGML_ROPE_TYPE_NEOX GGML_ROPE_TYPE_NORM GGML_ROPE_TYPE_VISION rope_types
Run a GGML GPU operation on a Seurat objectRunGGML RunGGML.default RunGGML.Seurat
Summary method for ggml_sequential_modelsummary.ggml_sequential_model
Tidy a fitted ggml parsnip model into a per-layer tabletidy.ggmlr_parsnip_model
Run code with gradient tape enabledwith_grad_tape