用MobileNetV1复现FaceNet人脸识别：从PyTorch代码到CASIA-WebFace数据集实战

张

张建站

2026/6/2 16:00:05

10分钟阅读

用MobileNetV1复现FaceNet人脸识别：从PyTorch代码到CASIA-WebFace数据集实战

基于MobileNetV1的FaceNet人脸识别实战从零构建PyTorch模型到CASIA-WebFace训练全流程人脸识别技术正逐渐渗透到日常生活的各个角落从手机解锁到门禁系统其核心在于如何将人脸图像转化为具有判别性的特征向量。本文将带你完整实现一个基于MobileNetV1的FaceNet模型使用PyTorch框架和CASIA-WebFace数据集从环境搭建到模型训练逐步解析每个关键环节。1. 环境准备与数据预处理1.1 基础环境配置首先需要准备Python 3.8环境和必要的库。推荐使用conda创建虚拟环境conda create -n facenet python3.8 conda activate facenet pip install torch torchvision torchaudio pip install opencv-python pillow matplotlib tqdm对于GPU加速建议安装对应CUDA版本的PyTorch。可以通过以下命令检查GPU是否可用import torch print(torch.cuda.is_available()) # 应输出True print(torch.__version__) # 建议1.8.01.2 CASIA-WebFace数据集处理CASIA-WebFace包含10,575个身份的494,414张人脸图像。下载后需要按以下步骤预处理目录结构调整确保每个身份对应一个独立文件夹人脸检测与对齐使用MTCNN进行人脸检测和关键点对齐from facenet_pytorch import MTCNN import cv2 import os mtcnn MTCNN(keep_allTrue) def align_face(image_path, save_path): img cv2.imread(image_path) img_rgb cv2.cvtColor(img, cv2.COLOR_BGR2RGB) boxes, probs mtcnn.detect(img_rgb) if boxes is not None: box boxes[0] cv2.rectangle(img, (int(box[0]), int(box[1])), (int(box[2]), int(box[3])), (0,255,0), 2) cv2.imwrite(save_path, img)注意对齐后的人脸图像建议统一调整为160×160像素这是MobileNetV1的标准输入尺寸2. MobileNetV1骨干网络实现2.1 深度可分离卷积实现MobileNetV1的核心是深度可分离卷积它将标准卷积分解为深度卷积和点卷积import torch.nn as nn class DepthwiseSeparableConv(nn.Module): def __init__(self, in_channels, out_channels, stride): super().__init__() self.depthwise nn.Sequential( nn.Conv2d(in_channels, in_channels, 3, stride, 1, groupsin_channels, biasFalse), nn.BatchNorm2d(in_channels), nn.ReLU6(inplaceTrue) ) self.pointwise nn.Sequential( nn.Conv2d(in_channels, out_channels, 1, 1, 0, biasFalse), nn.BatchNorm2d(out_channels), nn.ReLU6(inplaceTrue) ) def forward(self, x): x self.depthwise(x) x self.pointwise(x) return x2.2 完整MobileNetV1架构基于上述模块构建完整的MobileNetV1class MobileNetV1(nn.Module): def __init__(self): super().__init__() self.features nn.Sequential( nn.Conv2d(3, 32, 3, 2, 1, biasFalse), nn.BatchNorm2d(32), nn.ReLU6(inplaceTrue), DepthwiseSeparableConv(32, 64, 1), DepthwiseSeparableConv(64, 128, 2), DepthwiseSeparableConv(128, 128, 1), DepthwiseSeparableConv(128, 256, 2), DepthwiseSeparableConv(256, 256, 1), DepthwiseSeparableConv(256, 512, 2), # 重复6次512通道的深度可分离卷积 *[DepthwiseSeparableConv(512, 512, 1) for _ in range(6)], DepthwiseSeparableConv(512, 1024, 2), DepthwiseSeparableConv(1024, 1024, 1) ) self.avg_pool nn.AdaptiveAvgPool2d((1,1)) def forward(self, x): x self.features(x) x self.avg_pool(x) return x.view(x.size(0), -1)提示MobileNetV1相比原始FaceNet使用的Inception-ResNet体积缩小约10倍适合移动端部署3. FaceNet模型构建与损失函数3.1 特征嵌入层实现在MobileNetV1基础上添加嵌入层和L2标准化class FaceNet(nn.Module): def __init__(self, embedding_size128, num_classesNone): super().__init__() self.backbone MobileNetV1() self.bottleneck nn.Linear(1024, embedding_size, biasFalse) self.bn nn.BatchNorm1d(embedding_size, eps0.001, momentum0.1) if num_classes is not None: self.classifier nn.Linear(embedding_size, num_classes) def forward(self, x): x self.backbone(x) x self.bottleneck(x) x self.bn(x) x nn.functional.normalize(x, p2, dim1) return x3.2 Triplet Loss与交叉熵组合实现三元组损失并配合交叉熵辅助训练class TripletLoss(nn.Module): def __init__(self, margin0.2): super().__init__() self.margin margin def forward(self, anchor, positive, negative): pos_dist torch.sum((anchor - positive)**2, dim1) neg_dist torch.sum((anchor - negative)**2, dim1) losses torch.relu(pos_dist - neg_dist self.margin) return losses.mean() class CombinedLoss(nn.Module): def __init__(self, triplet_weight1.0, ce_weight0.5): super().__init__() self.triplet TripletLoss() self.ce nn.CrossEntropyLoss() self.triplet_weight triplet_weight self.ce_weight ce_weight def forward(self, anchor, positive, negative, logits, labels): triplet_loss self.triplet(anchor, positive, negative) ce_loss self.ce(logits, labels) return self.triplet_weight*triplet_loss self.ce_weight*ce_loss4. 训练流程与调优技巧4.1 数据加载与三元组采样高效的三元组采样对训练至关重要from torch.utils.data import Dataset import random class TripletDataset(Dataset): def __init__(self, root_dir, transformNone): self.classes os.listdir(root_dir) self.class_to_idx {c:i for i,c in enumerate(self.classes)} self.samples [] for c in self.classes: class_dir os.path.join(root_dir, c) for img_name in os.listdir(class_dir): self.samples.append((os.path.join(class_dir, img_name), self.class_to_idx[c])) def __getitem__(self, index): anchor_path, anchor_label self.samples[index] # 随机选择正样本 positive_samples [s for s in self.samples if s[1]anchor_label] positive_path, _ random.choice(positive_samples) # 随机选择负样本 negative_samples [s for s in self.samples if s[1]!anchor_label] negative_path, _ random.choice(negative_samples) anchor load_and_transform(anchor_path) positive load_and_transform(positive_path) negative load_and_transform(negative_path) return anchor, positive, negative, anchor_label def __len__(self): return len(self.samples)4.2 训练循环实现完整的训练流程包含以下关键步骤学习率调度使用余弦退火策略模型保存基于验证集准确率保存最佳模型混合精度训练加速训练过程from torch.optim import Adam from torch.optim.lr_scheduler import CosineAnnealingLR def train_model(model, train_loader, val_loader, epochs50): device torch.device(cuda if torch.cuda.is_available() else cpu) model model.to(device) optimizer Adam(model.parameters(), lr0.001) scheduler CosineAnnealingLR(optimizer, T_maxepochs) criterion CombinedLoss() best_acc 0.0 for epoch in range(epochs): model.train() for batch in train_loader: anchor, pos, neg, labels [x.to(device) for x in batch] optimizer.zero_grad() anchor_emb model(anchor) pos_emb model(pos) neg_emb model(neg) logits model.classifier(anchor_emb) loss criterion(anchor_emb, pos_emb, neg_emb, logits, labels) loss.backward() optimizer.step() scheduler.step() # 验证集评估 val_acc evaluate(model, val_loader) if val_acc best_acc: best_acc val_acc torch.save(model.state_dict(), best_model.pth) print(fEpoch {epoch1}/{epochs} | Loss: {loss.item():.4f} | Val Acc: {val_acc:.2f}%)4.3 常见问题与解决方案在实际训练中可能会遇到以下典型问题问题现象可能原因解决方案损失不下降学习率过大/过小尝试0.0001-0.01范围内的学习率准确率波动大批次内三元组质量差增加批次大小或使用在线困难样本挖掘过拟合明显模型容量过大增加Dropout或减少嵌入维度训练速度慢未使用GPU加速检查CUDA配置或减小输入尺寸提示对于小型数据集可以冻结MobileNetV1的部分层只训练顶层参数5. 模型评估与应用部署5.1 人脸验证评估在LFW数据集上评估模型性能from sklearn.metrics import roc_curve, auc def evaluate_verification(model, pairs_path): model.eval() distances, labels [], [] with open(pairs_path) as f: for line in f: img1, img2, same line.strip().split() emb1 model(load_image(img1).unsqueeze(0)) emb2 model(load_image(img2).unsqueeze(0)) dist torch.norm(emb1 - emb2, p2).item() distances.append(dist) labels.append(int(same)) fpr, tpr, _ roc_curve(labels, distances) roc_auc auc(fpr, tpr) return roc_auc5.2 实时人脸识别部署使用OpenCV实现实时人脸识别import cv2 from PIL import Image def live_recognition(model, threshold0.7): cap cv2.VideoCapture(0) mtcnn MTCNN() known_faces {} # 预存已知人脸特征 while True: ret, frame cap.read() if not ret: break frame_rgb cv2.cvtColor(frame, cv2.COLOR_BGR2RGB) boxes, _ mtcnn.detect(frame_rgb) if boxes is not None: for box in boxes: x1, y1, x2, y2 map(int, box) face frame_rgb[y1:y2, x1:x2] face Image.fromarray(face).resize((160,160)) emb model(transform(face).unsqueeze(0)) # 计算与已知人脸的相似度 match False for name, known_emb in known_faces.items(): dist torch.norm(emb - known_emb, p2).item() if dist threshold: cv2.putText(frame, name, (x1,y1-10), cv2.FONT_HERSHEY_SIMPLEX, 0.9, (0,255,0), 2) match True break if not match: cv2.putText(frame, Unknown, (x1,y1-10), cv2.FONT_HERSHEY_SIMPLEX, 0.9, (0,0,255), 2) cv2.rectangle(frame, (x1,y1), (x2,y2), (0,255,0), 2) cv2.imshow(Face Recognition, frame) if cv2.waitKey(1) 0xFF ord(q): break cap.release() cv2.destroyAllWindows()5.3 模型量化与优化为移动端部署进行模型量化model FaceNet().eval() quantized_model torch.quantization.quantize_dynamic( model, {nn.Linear}, dtypetorch.qint8 ) torch.jit.save(torch.jit.script(quantized_model), facenet_quantized.pt)经过量化后模型大小可缩减至原来的1/4左右推理速度提升2-3倍而准确率损失通常不超过2%。在实际项目中这种平衡往往是可以接受的。

如何将PS4/PS5手柄打造成PC游戏万能控制器：DS4Windows深度配置指南

如何将PS4/PS5手柄打造成PC游戏万能控制器：DS4Windows深度配置指南【免费下载链接】DS4Windows Like those other ds4tools, but sexier 项目地址: https://gitcode.com/gh_mirrors/ds/DS4Windows 当你在PC上使用PlayStation控制器时，是否遇到过…...

2026/6/2 15:58:01 阅读更多 →

从一次授权测试复盘：我是如何利用参数污染漏洞绕过限制拿到管理员权限的

一次授权渗透测试实战：参数污染漏洞的思维突破与权限提升那天下午三点二十七分，客户系统的登录界面在Burp Suite的Proxy历史记录中闪烁着可疑的API调用。作为安全工程师的第六感告诉我，这个返回302状态码的/api/v1/identity/verify端点背后&a…...

2026/6/2 15:53:31 阅读更多 →

别再只跑官方例程了！OpenMV人脸识别项目深度优化指南：提升识别率与响应速度

OpenMV人脸识别项目深度优化实战：从数据采集到系统调优的全方位指南当你第一次在OpenMV上成功运行人脸识别例程时，那种兴奋感可能很快就会被现实问题冲淡——识别率不稳定、响应速度慢、容易被照片欺骗。这些痛点恰恰是官方例程没有深入探讨的领域。本文…...

2026/6/2 15:52:52 阅读更多 →

Windows防撤回终极指南：如何永久保存微信QQ撤回消息

Windows防撤回终极指南：如何永久保存微信QQ撤回消息【免费下载链接】RevokeMsgPatcher :trollface: A hex editor for WeChat/QQ/TIM - PC版微信/QQ/TIM防撤回补丁（我已经看到了，撤回也没用了） 项目地址: https://gitcode.com/…...

2026/6/2 8:59:57 阅读更多 →

终极视频下载解决方案：VideoDownloadHelper 完全指南

终极视频下载解决方案：VideoDownloadHelper 完全指南【免费下载链接】VideoDownloadHelper Chrome Extension to Help Download Video for Some Video Sites. 项目地址: https://gitcode.com/gh_mirrors/vi/VideoDownloadHelper 还在为无法保存网络上的精彩…...

2026/6/1 16:51:08 阅读更多 →

小微企业合作网络与成长预测解析方案【附代码】

✨ 长期致力于小微企业、合作网络、网络结构、企业成长、成长预测研究工作，擅长数据搜集与处理、建模仿真、程序编写、仿真设计。 ✅ 专业定制毕设、代码 ✅ 如需沟通交流，点击《获取方式》 （1）基于提名生成法的合作网络构建与结构…...

2026/6/1 16:51:08 阅读更多 →

终极键盘映射工具：如何免费解决游戏按键冲突问题

终极键盘映射工具：如何免费解决游戏按键冲突问题【免费下载链接】socd Key remapper for epic gamers 项目地址: https://gitcode.com/gh_mirrors/so/socd 你是否曾在激烈的游戏中因为同时按下左右方向键而让角色卡顿不前？是否在关键时刻因为按键…...

2026/6/1 5:51:17 阅读更多 →