Rough example representing Kolmogrov-Arnold Network (KAN) as a Block Diagonal Multilayer-Perceptron Network (MLP)
Format from awesome-kan : Awesome KAN(Kolmogorov-Arnold Network) | )
Examples are based on FastKAN : Very Fast Calculation of Kolmogorov-Arnold Networks (KAN) |
I used the Gaussian as the activation function, which I think reduces to a Gaussian Radial Basis Function KAN with variable grid points. See ChebyKAN : Kolmogorov-Arnold Networks (KAN) using Chebyshev polynomials instead of B-splines. |
Example for one layer, 2 inputs, 3 basis curves, and 3 outputs. For each output, you need to stack the block diagonal matrices.
The first linear layer output is then
Apply activation function
Lastly, do