license: bsd-3-clause
A Deep Learning Architecture Utilizing A Global Dynamic Hyperbolic Tangent Function for Token Mixing Operation