Discrepency in template

#22
by edmond - opened

Using pipe and the example we get those tokens
[32006, 887, 526, 263, 8444, 319, 29902, 20255, 29889, 32007,
32010, 1815, 366, 3867, 5837, 304, 17545, 18240, 310, 9892,
16397, 322, 8338, 265, 29888, 21211, 29973, 32007, 32001, 18585,
29991, 2266, 526, 777, 5837, 304, 17545, 9892, 16397, 322,
8338, 265, 29888, 21211, 4208, 29901, 29871, 29896, 29889, 10765,
1648, 322, 8338, 265, 29888, 9216, 10597, 347, 29901, 3164,
355, 9892, 16397, 322, 8338, 265, 29888, 21211, 4208, 411,
777, 27274, 322, 298, 4992, 29889, 29871, 29906, 29889, 10765,
1648, 322, 8338, 265, 29888, 9216, 4497, 328, 29901, 23478,
269, 506, 287, 9892, 16397, 322, 8338, 265, 29888, 21211,
4208, 411, 777, 454, 3712, 3623, 625, 322, 298, 4992,
29889, 32007, 32010, 1724, 1048, 17069, 385, 29871, 29906, 29916,
718, 29871, 29941, 353, 29871, 29955, 6306, 29973, 32007, 32001],
which correspond to
'<|system|> You are a helpful AI assistant.<|end|><|user|> Can you provide ways to eat combinations of bananas and dragonfruits?<|end|><|assistant|> Sure! Here are some ways to eat ...'.
This doesn't correspond to the template given in the webpage where \n are present.
What actual template was used for training ? With or without jumping lines ?

Microsoft org

Thanks for your interest in Phi! The actual string the model sees is without the newline.

edmond changed discussion status to closed

Sign up or log in to comment