Auditing Model Bias with Balanced Datasets with Mimesis

KDnuggets

KDnuggets2026年5月25日

Auditing Model Bias with Balanced Datasets with Mimesis

8.5Score

TL;DR · AI Summary

通过使用 Mimesis 库生成平衡数据集，可以有效审计机器学习模型中的性别偏见。

Key Takeaways

Mimesis 库可以帮助生成平衡的数据集以测试模型偏见。
在贷款审批模型中，男性比女性更容易获得批准。
通过创建具有相同财务背景但不同性别的测试用户，可以检测模型的性别歧视。

Outline

Jump quickly between sections.

§引言
介绍机器学习模型可能存在的偏见问题及其审计的重要性。
·Mimesis 库简介
介绍 Mimesis 库的功能及其在生成平衡数据集中的应用。
·数据集生成
描述如何使用 Mimesis 生成包含性别和收入特征的不平衡数据集。
·模型训练
展示如何训练一个简单的决策树分类器来处理生成的数据集。
·反事实数据生成
说明如何使用 Mimesis 创建反事实数据集以测试模型偏见。
·模型审计
解释如何通过比较反事实数据集的结果来检测模型的性别偏见。

Mindmap

See how the topics connect at a glance.

查看大纲文本（无障碍 / 无 JS 友好）

模型偏见审计

Highlights

Key sentences worth saving and sharing.

Mimesis 是一个开源库，可以帮助生成平衡的、反事实的数据集。
— 第 2 段
⬇︎ 下载 PNG 𝕏 分享到 X
在生成的数据集中，男性比女性更容易获得贷款批准。
— 第 4 段
⬇︎ 下载 PNG 𝕏 分享到 X
通过创建具有相同财务背景但不同性别的测试用户，可以检测模型的性别歧视。
— 第 6 段
⬇︎ 下载 PNG 𝕏 分享到 X

#Mimesis#机器学习#模型审计#性别偏见#数据集

Open original article

Auditing Model Bias with Balanced Datasets with Mimesis - KDnuggets

Image 1: Auditing Model Bias with Balanced Datasets with Mimesis

Introduction

Whether they are well-established classifiers or state-of-the-art massive models like large language models (LLMs), building machine learning solutions often involves a risk: algorithms might silently adopt biases present in the historical training datasets they were trained on. However, in high-stakes scenarios or when dealing with sensitive data, how can we audit whether a model is biased without compromising real-world information?

This hands-on article guides you through training a simple classification model for "loan approval" on biased data. Using [Mimesis](https://mimesis.name/en/master/), an open-source library that helps generate a perfectly balanced, _counterfactual_ dataset, you'll be able to test "fake" users with identical financial backgrounds but different demographic characteristics, thereby determining whether the model discriminates against certain groups.

Step-by-Step Guide

Start by installing the Mimesis library if you are new to using it, or if you are working in a cloud notebook environment like Colab:

pip install mimesis

Before auditing a model, we need to have one! In this example, we will synthetically generate a dataset of 1,000 bank customers with just two features: gender and income. These features are categorical and numerical, respectively. The data creation will be intentionally manipulated so that the gender attribute unfairly influences the binary outcome: loan approval. Specifically, for labeling the dataset, we will consider a scenario in which men are generally approved, whereas women are only approved when they have remarkably high income.

The process to create this clearly biased dataset and train a decision tree classifier on it is shown below:

code

import pandas as pd
import numpy as np
from sklearn.tree import DecisionTreeClassifier

# 1. Simulating biased historical data (1000 instances)
np.random.seed(42)
n_train = 1000
genders = np.random.choice(['Male', 'Female'], n_train)
incomes = np.random.randint(30000, 120000, n_train)

approvals = []
for gender, income in zip(genders, incomes):
    if gender == 'Male':
        # Historically, males are approved
        approvals.append(1)
    else:
        # Only females with high income are approved
        approvals.append(1 if income > 80000 else 0)

train_df = pd.DataFrame({'Gender': genders, 'Income': incomes, 'Approved': approvals})

# Converting categories to numbers for the machine learning model
train_df['Gender_Code'] = train_df['Gender'].map({'Male': 1, 'Female': 0})

# 2. Training a Decision Tree classifier
model = DecisionTreeClassifier(max_depth=3)
model.fit(train_df[['Gender_Code', 'Income']], train_df['Approved'])

The next step demonstrates Mimesis in action. We will use this library to generate a small set of test subjects using the Generic class. This will be done by defining three base financial profiles that contain random UUIDs (universally unique identifiers) and a moderate income ranging between 40K and 70K. Note that these profiles do not yet include gender information:

code

from mimesis import Generic

generic = Generic('en')

# Generating 3 base financial profiles
base_profiles = []
for _ in range(3):
    profile = {
        'Applicant_ID': generic.cryptographic.uuid(),
        'Income': generic.random.randint(40000, 70000) # Moderate income
    }
    base_profiles.append(profile)

For example, the three newly created profiles may look something like:

code

[{'Applicant_ID': '1f1721e1-19af-4bd1-8488-6abf01404ef9', 'Income': 44815},
 {'Applicant_ID': '5c862597-7f55-43f4-9d6e-ac9cc0b9083e', 'Income': 47436},
 {'Applicant_ID': '3479d4cf-0d9b-4f06-9c43-1c3b7e787830', 'Income': 58194}]

Let's finish building our counterfactual set of examples, which forms the core of our auditing process! For each of the three base profiles, we will create two cloned counterfactual instances: one male and one female. For each pair of test customers, their application ID and income will be identical, with the only difference being gender: any difference in how our trained decision tree model treats them will undoubtedly be evidence of gender bias.

code

counterfactual_data = []

for profile in base_profiles:
    # Version A: Male Counterfactual
    counterfactual_data.append({
        'Applicant_ID': profile['Applicant_ID'], 
        'Gender': 'Male', 
        'Gender_Code': 1, 
        'Income': profile['Income']
    })
    
    # Version B: Female Counterfactual
    counterfactual_data.append({
        'Applicant_ID': profile['Applicant_ID'], 
        'Gender': 'Female', 
        'Gender_Code': 0, 
        'Income': profile['Income']
    })

audit_df = pd.DataFrame(counterfactual_data)

This is what the three pairs of customers may look like:

code

1f1721e1-19af-4bd1-8488-6abf01404ef9	Male	1	44815
1	1f1721e1-19af-4bd1-8488-6abf01404ef9	Female	0	44815
2	5c862597-7f55-43f4-9d6e-ac9cc0b9083e	Male	1	47436
3	5c862597-7f55-43f4-9d6e-ac9cc0b9083e	Female	0	47436
4	3479d4cf-0d9b-4f06-9c43-1c3b7e787830	Male	1	58194
5	3479d4cf-0d9b-4f06-9c43-1c3b7e787830	Female	0	58194

A key point to emphasize here: we have just used Mimesis to instantly create perfectly matched "clones" of loan applicants with identical income but different genders. This highlights the library's value in providing total statistical control, isolating a protected attribute.

Now it's time to probe the model and see what it reveals.

Asking the model to predict approval for our counterfactuals

audit_df['Predicted_Approval'] = model.predict(audit_df[['Gender_Code', 'Income']])

Formatting the output for readability (1 = Approved, 0 = Denied)

audit_df['Predicted_Approval'] = audit_df['Predicted_Approval'].map({1: 'Approved', 0: 'Denied'})

print("\n--- Model Audit Results ---") print(audit_df[['Applicant_ID', 'Gender', 'Income', 'Predicted_Approval']].sort_values('Applicant_ID'))

The decision-making results yielded by our model could not be clearer:

code

--- Model Audit Results ---
                           Applicant_ID  Gender  Income Predicted_Approval
0  1f1721e1-19af-4bd1-8488-6abf01404ef9    Male   44815           Approved
1  1f1721e1-19af-4bd1-8488-6abf01404ef9  Female   44815             Denied
4  3479d4cf-0d9b-4f06-9c43-1c3b7e787830    Male   58194           Approved
5  3479d4cf-0d9b-4f06-9c43-1c3b7e787830  Female   58194             Denied
2  5c862597-7f55-43f4-9d6e-ac9cc0b9083e    Male   47436           Approved
3  5c862597-7f55-43f4-9d6e-ac9cc0b9083e  Female   47436             Denied

Notice that for the exact same Applicant_ID and Income, male clones are approved for the loan. Meanwhile, female clones with such moderate income are generally denied. The Mimesis functionalities we used based on profiles helped us hold all other variables constant, thereby successfully isolating and exposing the model's discriminatory decision-making.

#Wrapping Up

Throughout this hands-on article, we have shown how Mimesis can be used to generate balanced, counterfactual data examples — without privacy or sensitive data constraints — that can help audit a model's behavior and identify whether the model is behaving in a biased manner or not. Next steps to take if your model is biased may include:

Augmenting your training data with more balanced profiles to correct historical skewness or bias.
Depending on the model type, using model re-weighting strategies.
Utilizing open-source toolkits for fairness — for instance, [AI Fairness 360](https://ai-fairness-360.org/) — which are helpful for bias mitigation in machine learning pipelines.

[](https://www.linkedin.com/in/ivanpc/)**[Iván Palomares Carrascosa](https://www.linkedin.com/in/ivanpc/)** is a leader, writer, speaker, and adviser in AI, machine learning, deep learning & LLMs. He trains and guides others in harnessing AI in the real world.