What is the length of the prompt considered by BLOOM?

#236

by akratz - opened Apr 15, 2023

Apr 15, 2023

•

edited Apr 15, 2023

See subject line. How long can a prompt be to be considered in its entirety? What happens if it exceeds some length, is the beginning cut off?

christopher

BigScience Workshop org Apr 17, 2023

•

edited Apr 19, 2023 by

julien-c

BLOOM was trained with sequences of length 2048, but uses ALiBi position embeddings (https://arxiv.org/abs/2108.12409) -- meaning it can be used with longer sequences.

akratz

Apr 17, 2023

•

edited Apr 17, 2023

So ALIBI allows input of what length to be considered? You write that it allows for “longer sequences” but this way I only know it is more than 2048…

nomadrp

Feb 26, 2024

Any update on this? @cakiki

christopher

BigScience Workshop org Jun 30, 2024

I'd refer you to both the paper I linked to and empirical experimentation to answer that question. The practical limit will likely sooner be your hardware than any theoretical one.

christopher changed discussion status to closed Jun 30, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment