Mistral is a Small Language Model originally released in September 2023.
Notable Architectural Features
Sliding Window Attention
A trade off in the application of self-attention that allows the model to use less memory by reducing (but not removing) a token’s influence on other tokens that are further away from it in the input stream.