How to use with batch size > 1?

by RylanSchaeffer - opened Aug 20

Aug 20

The demo code uses a batch size of 1. When I try passing a list of strings to the tokenizer, I receive the error:

{ValueError}ValueError("Unable to create tensor, you should probably activate truncation and/or padding with 'padding=True' 'truncation=Tr...h. Perhaps your features (`input_ids` in this case) have excessive nesting (inputs type `list` where type `int` is expected).")

How do I fix this?

RylanSchaeffer

Aug 20

What is an acceptable padding token to choose?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment