Meta quietly releases Llama 2 Lengthy AI mannequin

[ad_1]

VentureBeat presents: AI Unleashed – An unique government occasion for enterprise knowledge leaders. Community and be taught with business friends. Be taught Extra

Meta Platforms confirmed off a bevy of recent AI options for its consumer-facing companies Fb, Instagram and WhatsApp at its annual Meta Join convention in Menlo Park, California, this week.

However the greatest information from Mark Zuckerberg’s firm might have truly come within the type of a pc science paper revealed with out fanfare by Meta researchers on the open entry and non-peer reviewed web site arXiv.org.

The paper introduces Llama 2 Lengthy, a brand new AI mannequin based mostly on Meta’s open supply Llama 2 launched in the summertime, however that has undergone “continuous pretraining from Llama 2 with longer coaching sequences and on a dataset the place lengthy texts are upsampled,” in accordance with the researcher-authors of the paper.

On account of this, Meta’s newly elongated AI mannequin outperforms a number of the main competitors in producing responses to lengthy (larger character rely) consumer prompts, together with OpenAI’s GPT-3.5 Turbo with 16,000-character context window, in addition to Claude 2 with its 100,000-character context window.

Occasion

AI Unleashed

An unique invite-only night of insights and networking, designed for senior enterprise executives overseeing knowledge stacks and methods.

Be taught Extra

Graph of Llama 2 Lengthy outcomes taken from the paper “Efficient Lengthy-Context Scaling of Basis Fashions,” dated September 27, 2023.

[ad_2]

Meta quietly releases Llama 2 Lengthy AI mannequin

Occasion

How LLama 2 Lengthy got here to be

Leave a comment Cancel reply