1. 01 May, 2024 1 commit
    • Nicolas Patry's avatar
      Adding scripts to prepare load data. (#1841) · 0038e602
      Nicolas Patry authored
      # What does this PR do?
      
      <!--
      Congratulations! You've made it this far! You're not quite done yet
      though.
      
      Once merged, your PR is going to appear in the release notes with the
      title you set, so make sure it's a great title that fully reflects the
      extent of your awesome contribution.
      
      Then, please replace this with a description of the change and which
      issue is fixed (if applicable). Please also include relevant motivation
      and context. List any dependencies (if any) that are required for this
      change.
      
      Once you're done, someone will review your PR shortly (see the section
      "Who can review?" below to tag some potential reviewers). They may
      suggest changes to make the code even better. If no one reviewed your PR
      after a week has passed, don't hesitate to post a new comment
      @-mentioning the same persons---sometimes notifications get lost.
      -->
      
      <!-- Remove if not applicable -->
      
      Fixes # (issue)
      
      
      ## Before submitting
      - [ ] This PR fixes a typo or improves the docs (you can dismiss the
      other checks if that's the case).
      - [ ] Did you read the [contributor
      guideline](https://github.com/huggingface/transformers/blob/main/CONTRIBUTING.md#start-contributing-pull-requests),
            Pull Request section?
      - [ ] Was this discussed/approved via a Github issue or the
      [forum](https://discuss.huggingface.co/)? Please add a link
            to it if that's the case.
      - [ ] Did you make sure to update the documentation with your changes?
      Here are the
      [documentation
      guidelines](https://github.com/huggingface/transformers/tree/main/docs),
      and
      [here are tips on formatting
      docstrings](https://github.com/huggingface/transformers/tree/main/docs#writing-source-documentation).
      - [ ] Did you write any new necessary tests?
      
      
      ## Who can review?
      
      Anyone in the community is free to review the PR once the tests have
      passed. Feel free to tag
      members/contributors who may be interested in your PR.
      
      <!-- Your PR will be replied to more quickly if you can figure out the
      right person to tag with @
      
      
      @OlivierDehaene OR @Narsil
      
       -->
      0038e602
  2. 30 Apr, 2024 1 commit
    • Martin Iglesias Goyanes's avatar
      Fixing frequency penalty (#1811) · 9192de57
      Martin Iglesias Goyanes authored
      Thank you so much for the work you are doing, this is my little
      contribution to this great thing you have built. I hope it is useful and
      helpful, please don't hesitate to discuss any matters that are not
      clear!
      
      I am basing my implementation of frequency penalty on OpenAI's
      implementation:
      https://platform.openai.com/docs/guides/text-generation/parameter-details
      
      The problem I see with TGI's current implementation is that is not
      taking into account the frequency of tokens which have already been
      sampled in the current generation stream. Also, the scaling is of the
      adjusted token logits is done differently for positive and negative
      logits. While in OpenAI's implementation token frequency is taking into
      account and the scaling is always done with a subtraction (if penalty is
      positive) or add operation (if penalty is negative).
      
      This leads to corrupt generations as I mentioned in issue #1810 .
      Moreover, after my tests, other issues are also gone like the one...
      9192de57
  3. 16 Feb, 2024 1 commit
  4. 26 Jan, 2024 1 commit
  5. 08 Jun, 2023 1 commit
  6. 25 Apr, 2023 1 commit
  7. 20 Oct, 2022 1 commit
  8. 11 Oct, 2022 1 commit