Skip to main content

Meta unveils Llama 3.1, its biggest and best open source model yet

llama 3.1 logo
Meta

Facebook parent company Meta announced the release of its Llama 3.1 open source large language model on Tuesday. The new LLM will be available in three sizes — 8B, 70B, and 405B parameters — the latter being the largest open-source AI built to date, which Meta CEO Mark Zuckerberg describes as “the first frontier-level open source AI model.”

“Last year, Llama 2 was only comparable to an older generation of models behind the frontier,” Zuckerberg wrote in a blog post Tuesday. “This year, Llama 3 is competitive with the most advanced models and leading in some areas. Starting next year, we expect future Llama models to become the most advanced in the industry.”

llama 3.1-405B benchmarks
Meta

Trained on 15 trillion tokens using 16,000 H100 GPUs, Meta claims that the 405B model is significantly larger than its Llama 3 predecessor. It reportedly rivals today’s top closed source models, such as OpenAI’s GPT-4o, Google’s Gemini 1.5, or Anthropic’s Claude 3.5 in “general knowledge, math, tool use, and multilingual translation. Zuckerberg predicted on Instagram on Tuesday that Meta AI would surpass ChatGPT as the most widely used AI assistant by the end of the year.

Recommended Videos

The company notes that all three versions of Llama 3.1 will enjoy expanded prompt lengths of 128k tokens, enabling users to provide added context and up to a book’s worth of supporting documentation. They’ll also support eight languages at launch. What’s more, Meta has amended its license agreement to allow developers to use Llama 3.1 outputs to train other models.

Meta also announced that it is partnering with more than a dozen other companies in the industry to further develop the Llama ecosystem. Amazon, Databricks, and Nvidia will launch full-service software suites to help developers fine-tune their own models based off Llama, while the startup Groq has “built low-latency, low-cost inference serving” for the new family of 3.1 models, Zuckerberg wrote.

Being open-source, Llama 3.1 will be available on all the major cloud services including AWS, Google Cloud, and Azure.

Andrew Tarantola
Andrew Tarantola is a journalist with more than a decade reporting on emerging technologies ranging from robotics and machine…
How you can try OpenAI’s new o1-preview model for yourself
The openAI o1 logo

Despite months of rumored development, OpenAI's release of its Project Strawberry last week came as something of a surprise, with many analysts believing the model wouldn't be ready for weeks at least, if not later in the fall.

The new o1-preview model, and its o1-mini counterpart, are already available for use and evaluation, here's how to get access for yourself.

Read more
A new definition of ‘open source’ could spell trouble for Big AI
Meta AI can generate images within a chat in about five seconds.

The Open Source Initiative (OSI), self-proclaimed steward of the open source definition, the most widely used standard for open-source software, announced an update to what constitutes an "open source AI" on Thursday. The new wording could now exclude models from industry heavyweights like Meta and Google.

"Open Source has demonstrated that massive benefits accrue to everyone after removing the barriers to learning, using, sharing, and improving software systems," the OSI wrote in a recent blog post. "For AI, society needs the same essential freedoms of Open Source to enable AI developers, deployers, and end users to enjoy those same benefits."

Read more
Meta’s new AI model can turn text into 3D images in under a minute
an array of 3D generated images made by Meta 3D Gen

Meta's latest foray into AI image generation is a quick one. The company introduced its new "3D Gen" model on Tuesday, a "state-of-the-art, fast pipeline" for transforming input text into high-fidelity 3D images that can output them in under a minute.

What's more, the system is reportedly able to apply new textures and skins to both generated and artist-produced images using text prompts.

Read more