Spaces:

somosnlp-hackathon-2023
/

PodcastNER-GPTJ

Sleeping

DavidFM43 commited on Apr 10, 2023

Commit

e8e65ad

1 Parent(s): 5fdf6e2

Set half revision from base model weights

Files changed (1) hide show

app.py CHANGED Viewed

@@ -10,6 +10,8 @@ model = AutoModelForCausalLM.from_pretrained(
     return_dict=True,
     load_in_8bit=True,
     device_map="auto",
 )
 tokenizer = AutoTokenizer.from_pretrained(peft_model_id)
 # Load the Lora model

     return_dict=True,
     load_in_8bit=True,
     device_map="auto",
+    revision="half",
+    # low_cpu_mem_usage=True
 )
 tokenizer = AutoTokenizer.from_pretrained(peft_model_id)
 # Load the Lora model