Pruning
BMPrune
Here is the example configuration for BMPrune:
"pruning": {
"is_pruning": true, "pruning_mask_path": "prune_mask.bin",
"pruned_module": ["ffn.ffn.w_in.w.weight", "ffn.ffn.w_out.weight", "input_embedding"],
"mask_method": "m4n2_1d"
}
Practitioners can turn on pruning by is_pruning. The pruning mask is stored in pruning_mask_path. The pruned modules are specified by pruned_module. To simplify the list, practitioners can only provide the suffix of the modules. The mask method mask_method` is to choose the algorithm for the computation of the pruning mask.