-
Notifications
You must be signed in to change notification settings - Fork 226
Issues: dvmazur/mixtral-offloading
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Mixtral Instruct tokenizer from Colab notebook doesn't work.
#38
opened Jul 8, 2024 by
jmuntaner-smd
How to split the model parameter safetensors file into multiple small files
#34
opened Apr 18, 2024 by
YLSnowy
Implementation of benchmarks (C4 perplexity, Wikitext perplexity)
#33
opened Apr 14, 2024 by
ChengSashankh
a strange issue with default parameters " RuntimeError about memory"
#26
opened Mar 24, 2024 by
a1564740774
need mixtral offload for NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO
#19
opened Jan 16, 2024 by
githubpradeep
Enhancing the Efficacy of MoE Offloading with Speculative Prefetching Strategies
#10
opened Jan 2, 2024 by
yihong1120
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.