• Martin Iglesias Goyanes's avatar
    Fixing frequency penalty (#1811) · 9192de57
    Martin Iglesias Goyanes authored
    Thank you so much for the work you are doing, this is my little
    contribution to this great thing you have built. I hope it is useful and
    helpful, please don't hesitate to discuss any matters that are not
    clear!
    
    I am basing my implementation of frequency penalty on OpenAI's
    implementation:
    https://platform.openai.com/docs/guides/text-generation/parameter-details
    
    The problem I see with TGI's current implementation is that is not
    taking into account the frequency of tokens which have already been
    sampled in the current generation stream. Also, the scaling is of the
    adjusted token logits is done differently for positive and negative
    logits. While in OpenAI's implementation token frequency is taking into
    account and the scaling is always done with a subtraction (if penalty is
    positive) or add operation (if penalty is negative).
    
    This leads to corrupt generations as I mentioned in issue #1810 .
    Moreover, after my tests, other issues are also gone like the one...
    9192de57
.gitignore 356 Bytes