成人手机视频在线观看,国产一级毛片高清视频完整版,国产91精品不卡在线

!pip install accelerate==0.18.0
!pip install appdirs==1.4.4
!pip install bitsandbytes==0.37.2
!pip install datasets==2.10.1
!pip install fire==0.5.0
!pip install git+https://github.com/huggingface/peft.git
!pip install git+https://github.com/huggingface/transformers.git
!pip install torch==2.0.0
!pip install sentencepiece==0.1.97
!pip install tensorboardX==2.6
!pip install gradio==3.23.0

安裝依賴項(xiàng)后，我們將繼續(xù)導(dǎo)入所有必要的庫并配置 matplotlib 繪圖的各項(xiàng)設(shè)置：

import transformers

import textwrap

from transformers import LlamaTokenizer, LlamaForCausalLM

import os

import sys

from typing import List



from peft import (

    LoraConfig,

    get_peft_model,

    get_peft_model_state_dict,

    prepare_model_for_int8_training,

)



import fire

import torch

from datasets import load_dataset

import pandas as pd



import matplotlib.pyplot as plt

import matplotlib as mpl

import seaborn as sns

from pylab import rcParams



%matplotlib inline

sns.set(rc={'figure.figsize':(10, 7)})

sns.set(rc={'figure.dpi':100})

sns.set(style='white', palette='muted', font_scale=1.2)



DEVICE = "cuda" if torch.cuda.is_available() else "cpu"

DEVICE

數(shù)據(jù)

我們將使用Kaggle上提供的BTC Tweets Sentiment數(shù)據(jù)集，該數(shù)據(jù)集包含約50,000條與比特幣相關(guān)的推文。為了清理數(shù)據(jù)，我刪除了所有以“RT”開頭或包含鏈接的推文?，F(xiàn)在讓我們開始下載數(shù)據(jù)集：

!gdown 1xQ89cpZCnafsW5T3G3ZQWvR7q682t2BN

我們可以使用 Pandas 來加載 CSV：

df = pd.read_csv("bitcoin-sentiment-tweets.csv")

df.head()

	日期	推文內(nèi)容	情感傾向
0	2018 年 3 月 23 日星期五 00：40：40 +0000	@p0nd3ea 比特幣不是為了在交易所生存而創(chuàng)建的。	正面（1）
1	2018 年 3 月 23 日星期五 00：40：40 +0000	@historyinflicks 伙計，如果我患有 Bannon 那種19世紀(jì)的一連串疾病，我也想成為比特幣。	正面（1）
2	2018 年 3 月 23 日星期五 00：40：42 +0000	@eatBCH @Bitcoin @signalapp @myWickr @Samsung @tipprbot耐心確實(shí)是一種美德	負(fù)面（1）
3	2018 年 3 月 23 日星期五 00：41：04 +0000	@aantonop 即使比特幣明天早上就崩盤，它的技術(shù)仍然是革命性的。一種簡化的方式。#我必須成為其中的一部分	負(fù)面（0）
4	2018 年 3 月 23 日星期五 00：41：07 +0000	我正在試驗(yàn)我是否可以只用捐贈的比特幣生活。	正面（1）

我們的數(shù)據(jù)集包含了約1900條推文，這些推文的情緒標(biāo)簽用數(shù)字表示：負(fù)數(shù)情緒為-1，中性情緒為0，積極情緒為1。

讓我們來看看它們的分布情況：

df.sentiment.value_counts()

 0.0    860

 1.0    779

-1.0    258

Name: sentiment, dtype: int64

df.sentiment.value_counts().plot(kind='bar');

負(fù)面情緒的分布相對較低，這一點(diǎn)在評估微調(diào)模型的性能時應(yīng)予以考慮。

構(gòu)建 JSON 數(shù)據(jù)集

原始Alpaca倉庫中的數(shù)據(jù)集格式為一個JSON文件，該文件包含一個對象列表，這些對象具有instruction、input和output字符串屬性。

現(xiàn)在，讓我們將Pandas數(shù)據(jù)幀轉(zhuǎn)換為符合原始Alpaca倉庫格式的JSON文件：

def sentiment_score_to_name(score: float):

    if score > 0:

        return "Positive"

    elif score < 0:

        return "Negative"

    return "Neutral"



dataset_data = [

    {

        "instruction": "Detect the sentiment of the tweet.",

        "input": row_dict["tweet"],

        "output": sentiment_score_to_name(row_dict["sentiment"])

    }

    for row_dict in df.to_dict(orient="records")

]



dataset_data[0]

{

  "instruction": "Detect the sentiment of the tweet.",

  "input": "@p0nd3ea Bitcoin wasn't built to live on exchanges.",

  "output": "Positive"

}

最后，我們將保存生成的 JSON 文件用于訓(xùn)練模型：

import json

with open("alpaca-bitcoin-sentiment-dataset.json", "w") as f:

   json.dump(dataset_data, f)

模型權(quán)重

盡管原始的Llama模型權(quán)重?zé)o法獲取，但它們已被泄露并隨后被適配用于HuggingFace Transformers庫。我們將使用decapoda-research提供的權(quán)重：

BASE_MODEL = "decapoda-research/llama-7b-hf"



model = LlamaForCausalLM.from_pretrained(

    BASE_MODEL,

    load_in_8bit=True,

    torch_dtype=torch.float16,

    device_map="auto",

)



tokenizer = LlamaTokenizer.from_pretrained(BASE_MODEL)



tokenizer.pad_token_id = (

    0  # unk. we want this to be different from the eos token

)

tokenizer.padding_side = "left"

這段代碼使用Hugging Face Transformers庫中的類來加載預(yù)訓(xùn)練的Llama模型。參數(shù)load_in_8bit=True表示以8位量化的方式加載模型，以減少內(nèi)存使用并提高推理速度。

代碼還使用相同的類加載了該Llama模型的分詞器，并為填充令牌設(shè)置了一些附加屬性。具體來說，它將pad_token_id設(shè)置為0來表示未知令牌，并將padding_side設(shè)置為”left”以在左側(cè)填充序列。

數(shù)據(jù)

現(xiàn)在我們已經(jīng)加載了模型和分詞器，接下來可以使用HuggingFace datasets庫中的load_dataset()函數(shù)來加載我們之前保存的JSON文件。

data = load_dataset("json", data_files="alpaca-bitcoin-sentiment-dataset.json")data["train"]

Dataset({

    features: ['instruction', 'input', 'output'],

    num_rows: 1897

})

接下來，我們需要從加載的數(shù)據(jù)集中創(chuàng)建提示并對其進(jìn)行標(biāo)記：

def generate_prompt(data_point):

    return f"""Below is an instruction that describes a task, paired with an input that provides further context. Write a response that appropriately completes the request.  # noqa: E501

### Instruction:

{data_point["instruction"]}

### Input:

{data_point["input"]}

### Response:

{data_point["output"]}"""



def tokenize(prompt, add_eos_token=True):

    result = tokenizer(

        prompt,

        truncation=True,

        max_length=CUTOFF_LEN,

        padding=False,

        return_tensors=None,

    )

    if (

        result["input_ids"][-1] != tokenizer.eos_token_id

        and len(result["input_ids"]) < CUTOFF_LEN

        and add_eos_token

    ):

        result["input_ids"].append(tokenizer.eos_token_id)

        result["attention_mask"].append(1)



    result["labels"] = result["input_ids"].copy()



    return result



def generate_and_tokenize_prompt(data_point):

    full_prompt = generate_prompt(data_point)

    tokenized_full_prompt = tokenize(full_prompt)

    return tokenized_full_prompt

第一個函數(shù)從數(shù)據(jù)集中取出一個數(shù)據(jù)點(diǎn)，通過結(jié)合 INSTRUCTION、INPUT 和 OUTPUT 來生成一個提示。第二個函數(shù)接收生成的提示并使用之前定義的分詞器對其進(jìn)行分詞。它還會在輸入序列中添加一個序列結(jié)束令牌，并將標(biāo)簽設(shè)置為與輸入序列相同。第三個函數(shù)將前兩個函數(shù)結(jié)合，以一步完成提示的生成和分詞。generate_prompt、tokenize和generate_and_tokenize_prompt分別表示這三個函數(shù)。

數(shù)據(jù)準(zhǔn)備的最后一步是將數(shù)據(jù)集拆分為獨(dú)立的訓(xùn)練集和驗(yàn)證集：

train_val = data["train"].train_test_split(

    test_size=200, shuffle=True, seed=42

)

train_data = (

    train_val["train"].map(generate_and_tokenize_prompt)

)

val_data = (

    train_val["test"].map(generate_and_tokenize_prompt)

)

我們需要 200 個驗(yàn)證集樣本，并對數(shù)據(jù)進(jìn)行隨機(jī)排序。對于訓(xùn)練和驗(yàn)證集中的每個樣本，我們都會應(yīng)用一個函數(shù)來生成并標(biāo)記提示（tokenize prompts），該函數(shù)名為generate_and_tokenize_prompt()。

訓(xùn)練

訓(xùn)練過程需要多個參數(shù)，這些參數(shù)大多來源于原始存儲庫中的微調(diào)腳本：

LORA_R = 8

LORA_ALPHA = 16

LORA_DROPOUT= 0.05

LORA_TARGET_MODULES = [

    "q_proj",

    "v_proj",

]



BATCH_SIZE = 128

MICRO_BATCH_SIZE = 4

GRADIENT_ACCUMULATION_STEPS = BATCH_SIZE // MICRO_BATCH_SIZE

LEARNING_RATE = 3e-4

TRAIN_STEPS = 300

OUTPUT_DIR = "experiments"

我們現(xiàn)在可以準(zhǔn)備用于訓(xùn)練的模型：

model = prepare_model_for_int8_training(model)

config = LoraConfig(

    r=LORA_R,

    lora_alpha=LORA_ALPHA,

    target_modules=LORA_TARGET_MODULES,

    lora_dropout=LORA_DROPOUT,

    bias="none",

    task_type="CAUSAL_LM",

)

model = get_peft_model(model, config)

model.print_trainable_parameters()

trainable params: 4194304 || all params: 6742609920 || trainable%: 0.06220594176090199

我們使用LORA算法初始化以準(zhǔn)備模型訓(xùn)練，LORA算法是一種量化形式，可以在不顯著損失精度的情況下減小模型大小和內(nèi)存使用量。

LoraConfig是一個類，用于指定LORA算法的超參數(shù)，如正則化強(qiáng)度（lora_alpha）、丟棄概率（lora_dropout）以及要壓縮的目標(biāo)模塊（target_modules）。

在訓(xùn)練過程中，我們將使用Hugging Face Transformers庫中的Trainer類。

training_arguments = transformers.TrainingArguments(

    per_device_train_batch_size=MICRO_BATCH_SIZE,

    gradient_accumulation_steps=GRADIENT_ACCUMULATION_STEPS,

    warmup_steps=100,

    max_steps=TRAIN_STEPS,

    learning_rate=LEARNING_RATE,

    fp16=True,

    logging_steps=10,

    optim="adamw_torch",

    evaluation_strategy="steps",

    save_strategy="steps",

    eval_steps=50,

    save_steps=50,

    output_dir=OUTPUT_DIR,

    save_total_limit=3,

    load_best_model_at_end=True,

    report_to="tensorboard"

)

這段代碼創(chuàng)建了一個TrainingArguments對象，該對象指定了訓(xùn)練模型時的各種設(shè)置和超參數(shù)。這些設(shè)置包括：

gradient_accumulation_steps：在進(jìn)行反向傳播/更新之前，累積梯度的更新步驟數(shù)。
warmup_steps：優(yōu)化器的預(yù)熱步驟數(shù)。
max_steps：要執(zhí)行的總訓(xùn)練步驟數(shù)。
learning_rate：優(yōu)化器的學(xué)習(xí)率。
fp16：是否使用16位精度進(jìn)行訓(xùn)練。

data_collator = transformers.DataCollatorForSeq2Seq(

    tokenizer, pad_to_multiple_of=8, return_tensors="pt", padding=True

)

DataCollatorForSeq2Seq是 Transformers 庫中的一個類，用于為序列到序列（seq2seq）模型創(chuàng)建輸入/輸出序列的批次。在這段代碼中，我們實(shí)例化了一個對象，并使用了以下參數(shù)：

pad_to_multiple_of：一個整數(shù)，表示最大序列長度，會向上取整到該值的最近倍數(shù)。
padding：一個布爾值，指示是否將序列填充到指定的最大長度。

現(xiàn)在我們已經(jīng)有了所有必要的組件，可以繼續(xù)進(jìn)行模型的訓(xùn)練了。

trainer = transformers.Trainer(

    model=model,

    train_dataset=train_data,

    eval_dataset=val_data,

    args=training_arguments,

    data_collator=data_collator

)

model.config.use_cache = False

old_state_dict = model.state_dict

model.state_dict = (

    lambda self, *_, **__: get_peft_model_state_dict(

        self, old_state_dict()

    )

).__get__(model, type(model))



model = torch.compile(model)



trainer.train()

model.save_pretrained(OUTPUT_DIR)

在實(shí)例化Trainer之后，代碼將模型配置中的use_cache設(shè)置為False，并使用get_peft_model_state_dict()函數(shù)為模型創(chuàng)建一個狀態(tài)字典，該函數(shù)通過使用低精度算術(shù)來準(zhǔn)備模型進(jìn)行訓(xùn)練。

然后，在模型上調(diào)用torch.compile()函數(shù)，該函數(shù)會編譯 model 的計算圖，并使用 PyTorch 2 準(zhǔn)備進(jìn)行訓(xùn)練。

在A100上，訓(xùn)練過程大約持續(xù)了2個小時。讓我們在Tensorboard上查看結(jié)果：

訓(xùn)練損失和評估損失似乎都在穩(wěn)步下降。而且這還只是第一次嘗試！

我們將把訓(xùn)練好的模型上傳到Hugging Face Model Hub，以便輕松復(fù)用：

from huggingface_hub import notebook_login



notebook_login()



model.push_to_hub("curiousily/alpaca-bitcoin-tweets-sentiment", use_auth_token=True)

推斷

我們將首先復(fù)制該存儲庫，然后使用腳本對模型進(jìn)行測試：generate.py

!git clone https://github.com/tloen/alpaca-lora.git

%cd alpaca-lora

!git checkout a48d947

該腳本啟動的Gradio應(yīng)用程序?qū)⑹刮覀兡軌蚶梦覀兡Ｐ偷臋?quán)重：

!python generate.py \

    --load_8bit \

    --base_model 'decapoda-research/llama-7b-hf' \

    --lora_weights 'curiousily/alpaca-bitcoin-tweets-sentiment' \

    --share_gradio

以下是我們的應(yīng)用程序：

讓我們一起來測試模型吧！

結(jié)論

總之，我們已經(jīng)成功使用Alpaca LoRa方法對Llama模型進(jìn)行了微調(diào)，使其能夠檢測比特幣推文中的情緒。在這個過程中，我們借助了Hugging Face的Transformers庫和datasets庫來加載并預(yù)處理數(shù)據(jù)，同時利用Transformers訓(xùn)練器完成了模型的訓(xùn)練。最后，我們將模型部署到了Hugging Face模型庫，并展示了如何在Gradio應(yīng)用程序中使用它。

原文鏈接：https://www.mlexpert.io/blog/alpaca-fine-tuning