is the 4k context length of llama2 for real?

actually-a-cat@sh.itjust.works · 1 year ago

is the 4k context length of llama2 for real?

creolestudios@sh.itjust.works · 5 months ago

Yes, the 4k context length of Llama2 is indeed real. Llama2 is a cutting-edge language model developed by OpenAI, and its impressive capability to understand and generate text with such a lengthy context is one of its remarkable features. If you’re interested in leveraging advanced AI models like Llama2 for chatbot development or other applications, you may consider reaching out to an AI chatbot development company for assistance in harnessing this technology effectively.

Sims@lemmy.ml · 1 year ago

No experience, but just adding that long context models have a tendency of ‘forgetting’ whats in the middle of the text. Worth noting if you work on long texts I assume. I can’t remember the paper tho. There’s so many…

flamdragparadiddle@sh.itjust.works · 1 year ago

Lost in the middle: https://arxiv.org/abs/2307.03172

Happens for all models, not just Llama and it is really frustrating to deal with.

Sims@lemmy.ml · 1 year ago

I was unaware that the smaller context models exhibited the same effect. It does seem logical that broad important information and conclusions is naturally put at the ends of a sentence by us. I haven’t read the paper yet, but wonder if the training set - our communication - also contains more information at the ends, so the effect isn’t caused by the algorithm, but by the data. I’ll give the paper a read, thx…