Why is the "add_bos_token" set to True in tokenizer_config.json?
#17
by
heya5
- opened
Should I keep the bos_token_id in my input_ids for downstream tasks?
Hey @heyaa,
OPT uses a GPT2Tokenizer but prepends every prompt with a BOS TOKEN (e.g. <s> Hello there instead of Hello there)