Meta feels pretty good about the launch of Llama 3
Just one week after Meta Lama 3 was released to developers, the models have been downloaded more than 1.2 million times and developers are sharing hundreds of derived models.
Meta seems quite happy with the launch of Llama 3: Since its release a week ago, the models have been downloaded more than 1.2 million times, according to Meta, and more than 600 model variations have been shared on Hugging Face.
The Llama 3 GitHub repository already has over 17,000 stars. Llama 3 70B Instruct is tied for first place in English language ratings and sixth place overall on the LMSYS Chatbot Arena Leaderboard. This makes it the highest rated open-source model, just behind closed, proprietary models.
The response from the community has been “awesome”, the company writes. Developers have doubled Llama 3’s (tiny) context window and are working on quantization, web navigation, tool usage, low-precision fine-tuning, and local usage. However, one group has shown that Llama 3’s safety can be easily bypassed.
Ad
Ad
Prof. Dr. med. Llama 3
Partners in the chip, hardware and cloud computing industries have begun to adapt Llama 3 for their applications, Meta says.
At the Yale School of Medicine, teams collaborated with the EPFL School of Computer and Communication Sciences to fine-tune Meta Llama 3 within 24 hours of its release and introduced Llama-3[8B]-MeditronV1.0 – the first fine-tuned Llama-3 8B model for medicine. This new model outperforms all available state-of-the-art models in its parameter class on standard tests such as MedQA and MedMCQA.
Meta emphasizes that this is just the beginning for Llama 3. In the coming months, the company plans to release models with new features, including multimodality, the ability to communicate in multiple languages, a longer context window, and improved overall capabilities. A 400 billion GPT-4 Opus competitor is still in training.
With the success of Llama 3, Meta CEO Mark Zuckerberg is getting closer to his goal of making the company’s own AI model the standard for AI applications and reaping financial benefits in the medium to long term, such as lower development and innovation costs.