91亚洲精品国产自在现线,国产欧美日韩免费不卡,国产成人免费片在线视频观看

import numpy as np
import matplotlib.pyplot as plt
plt.rcParams['font.sans-serif'] = 'SimHei'
plt.rcParams['axes.unicode_minus'] = False
from sklearn.model_selection import train_test_split
from sklearn.ensemble import RandomForestClassifier

# 讀取數據集
df = pd.read_csv('Dataset.csv')

# 劃分特征和目標變量
X = df.drop(['target'], axis=1)
y = df['target']

# 劃分訓練集和測試集
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2, random_state=42, stratify=df['target'])

# 創建隨機森林分類器實例
rf_classifier = RandomForestClassifier(
n_estimators=100,
criterion='gini',
max_depth=None,
min_samples_split=2,
min_samples_leaf=1,
min_weight_fraction_leaf=0.0,
random_state=42,
max_leaf_nodes=None,
min_impurity_decrease=0.0
)

# 訓練模型
rf_classifier.fit(X_train, y_train)

代碼加載了UCI上的Heart Disease數據集，劃分了訓練集和測試集，并使用隨機森林分類器對訓練數據進行訓練，以構建一個預測心臟病患病風險的分類模型，該模型的目的是通過分析患者的特征來判斷其是否患有心臟病，從而為早期診斷和治療提供支持

模型單樣本解釋

from lime.lime_tabular import LimeTabularExplainer

import warnings

warnings.filterwarnings("ignore", message=".*does not have valid feature names.*")



# 創建 LIME 解釋器對象

explainer = LimeTabularExplainer(

    training_data=X_train.values,

    feature_names=X.columns,

    class_names=['False', 'True'],  # 根據你的具體分類任務調整類名

    mode='classification'

)



# 選擇一個要解釋的測試集樣本（例如，選擇第一個樣本）

sample_index = 0

sample = X_test.iloc[sample_index].values.reshape(1, -1)



# 生成解釋

exp = explainer.explain_instance(

    data_row=sample.flatten(),

    predict_fn=rf_classifier.predict_proba

)



# 顯示解釋結果

exp.show_in_notebook(show_table=True)

使用 LIME 解釋了隨機森林分類器對 UCI Heart Disease 數據集中第一個測試樣本的預測結果，在解釋中，類別設置為 False 表示未患病，True 表示患病，以便更直觀地理解模型的決策過程

結果解讀

根據 LIME 的解釋結果，模型預測該樣本未患病的概率為 96%，患病的概率為 4%。以下是對特征貢獻度的詳細解釋：

對“未患病” (False) 預測結果有正向貢獻的特征：

cp：值為 2.0，低于 3.0，強烈支持“未患病”的預測
ca：值為 0，表明沒有阻塞的主要血管，這也是“未患病”的強支持因素
thal：值為 0.0，支持“未患病”
slope：值為 1.0，屬于較平緩的斜率，支持“未患病”
oldpeak：值為 0.0，表示沒有顯著的 ST 段抑制，支持“未患病”
chol：值為 157.0，處于正常范圍，也支持“未患病”

其余對“未患病” (False) 預測結果有正向貢獻的特征類似解讀，這里不一一給出

對“患病” (True) 預測結果有正向貢獻的特征：

sex：值為 1.0，表示男性，男性通常患心臟病的風險更高，因此略微傾向于“患病”

總體來說，模型認為該樣本中的大多數特征指向“未患病”，這些特征的組合使得模型更傾向于預測該樣本為“未患病”，只有性別這個特征對“患病”有一定的貢獻，但不足以改變整體的預測結果

同樣本shap力圖解釋

import shap

# 構建 shap解釋器

explainer = shap.TreeExplainer(rf_classifier)

# 計算測試集的shap值

shap_values = explainer.shap_values(X_test)



# 繪制單個樣本的SHAP解釋（Force Plot）

sample_index = 0  # 選擇一個樣本索引進行解釋

shap.force_plot(explainer.expected_value[0], shap_values[:, :, 0][sample_index], X_test.iloc[sample_index], matplotlib=True)

shap.force_plot(explainer.expected_value[1], shap_values[:, :, 1][sample_index], X_test.iloc[sample_index], matplotlib=True)