]> git.djapps.eu Git - pkg/ggml/sources/llama.cpp/commitdiff
convert : use n_groups instead of hardcoded values in reshape (#18929)
authorDaniel Bevenius <redacted>
Tue, 20 Jan 2026 05:55:24 +0000 (06:55 +0100)
committerGitHub <redacted>
Tue, 20 Jan 2026 05:55:24 +0000 (06:55 +0100)
* convert : use n_groups instead of hardcoded values in reshape

This commit modifies the conversion script for NemotronHModel to use
the 'n_groups' hyperparameter, and allow Python to calculate the the
last dimension, using -1, when reshaping the 'mixer.norm.weight' tensor.

* use self.n_group instead of self.hparams["n_groups"]

convert_hf_to_gguf.py

index becbad046d7a522994091a0fedb5fdd36cb86c28..ab015dd2c3a2d30125490ea52606496828cfdb0c 100755 (executable)
@@ -9212,7 +9212,7 @@ class NemotronHModel(GraniteHybridModel):
                 return [(mapped_name, reshaped_data)]
 
             if name.endswith("mixer.norm.weight"):
-                reshaped_data = data_torch.reshape(8, 512)
+                reshaped_data = data_torch.reshape(self.n_group, -1)
                 mapped_name = self.map_tensor_name(name)
                 return [(mapped_name, reshaped_data)]