OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework
Machine Learning Research at Apple
APRIL 23, 2024
The reproducibility and transparency of large language models are crucial for advancing open research, ensuring the trustworthiness of results, and enabling investigations into data and model biases, as well as potential risks. To this end, we release OpenELM, a state-of-the-art open language model. OpenELM uses a layer-wise scaling strategy to efficiently allocate parameters within each layer of the transformer model, leading to enhanced accuracy.
Let's personalize your content