Contents
MLP
A unit-scaled implementation of an MLP layer using SwiGLU.
hidden_size (int) – the hidden dimension size of the input.
expansion_factor (int) – the factor by which the MLP’s intermediate size increases relative to hidden_size.