Can we use any CausalLM model from HuggingFace in 4bit in Spark-NLP? #13864
-
🙋🏻♂️ Hie there!JSL has an amazing collection of models, but for me I am willing to use instruction tuned models like 😕 The issueIn our program, we are willing to use the less amount of resources as possible. Thus, if I can use any model in 4bit quantization by using this library, this would be wonderful. Will you please help me through this? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Hi, Not currently, we only have TensorFlow engine and it's not compatible with 4-bit. We also don't have any annotators that accept CasualLM like Falcon or MPT. We are working on adding ONNX to the mix and after that we can start adding support for CasualLM. |
Beta Was this translation helpful? Give feedback.
Hi,
Not currently, we only have TensorFlow engine and it's not compatible with 4-bit. We also don't have any annotators that accept CasualLM like Falcon or MPT.
We are working on adding ONNX to the mix and after that we can start adding support for CasualLM.