1. 09 Dec, 2024 1 commit
  2. 06 Dec, 2024 1 commit
  3. 05 Dec, 2024 1 commit
    • [tokenizer] feat: support tokenizers whose pad_token_id is none (#36) · b4a3d6b9
      * [tokenizer] feat: support tokenizers whose pad_token_id is none
      
      * add test to ci
      
      * install test version
      
      * update ci
      
      * dont use gemma for testing
      
      * dont use gemma for testing
      
      * add proxy
      
      * revert dataset test
      
      * add back tests
      
      * fix format
      
      * fix format
      
      * fix deps
      
      * use git clone instead of https download
      
      * fix path
      
      * revert and use one yaml for gpu instead
      
      * fix path
      
      * cleanup
      
      * limit pyarrow version
      
      * Revert "limit pyarrow version"
      
      This reverts commit b924f79a79088c21636269d11a4ec3095af10c09.
      
      * lfs
      
      * try lfs
      
      * do not clone if exist
      HL committed
  4. 03 Dec, 2024 2 commits
  5. 02 Dec, 2024 2 commits
  6. 01 Dec, 2024 1 commit
  7. 30 Nov, 2024 1 commit
  8. 28 Nov, 2024 1 commit
  9. 27 Nov, 2024 1 commit
  10. 25 Nov, 2024 1 commit
  11. 22 Nov, 2024 1 commit
  12. 21 Nov, 2024 1 commit
  13. 11 Nov, 2024 1 commit
  14. 01 Nov, 2024 2 commits
  15. 31 Oct, 2024 3 commits