用CycleGAN实现马变斑马：5分钟搞定无配对数据风格迁移（附PyTorch代码）

张

张建站

2026/4/22 11:50:46

10分钟阅读

用CycleGAN实现马变斑马5分钟搞定无配对数据风格迁移附PyTorch代码想象一下你手头有一批马的图片但需要它们变成斑马的样子——不是简单的滤镜处理而是保留马的姿态、背景只改变纹理和颜色特征。传统方法需要成对的马和斑马图片同一匹马在不同场景下的两种形态但现实中这种数据几乎不存在。这就是CycleGAN的用武之地它能在没有配对数据的情况下实现两个图像域之间的高质量转换。1. 环境准备与数据加载1.1 安装必要依赖推荐使用Python 3.8和PyTorch 1.10环境。以下命令可快速安装所需库pip install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu113 pip install matplotlib opencv-python tqdm1.2 准备数据集我们从Kaggle获取两个公开数据集马图像数据集包含1067张图片斑马图像数据集包含1334张图片from torchvision.datasets import ImageFolder from torchvision import transforms transform transforms.Compose([ transforms.Resize(256), transforms.CenterCrop(256), transforms.ToTensor(), transforms.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5)) ]) horse_dataset ImageFolder(horse2zebra/trainA, transformtransform) zebra_dataset ImageFolder(horse2zebra/trainB, transformtransform)提示数据集无需成对匹配但建议两个域的图片数量相近且内容类型相似如都是动物全身照2. 模型架构实现2.1 生成器设计CycleGAN采用改进的U-Net结构包含下采样、残差块和上采样三部分import torch.nn as nn class ResidualBlock(nn.Module): def __init__(self, in_features): super().__init__() self.block nn.Sequential( nn.ReflectionPad2d(1), nn.Conv2d(in_features, in_features, 3), nn.InstanceNorm2d(in_features), nn.ReLU(inplaceTrue), nn.ReflectionPad2d(1), nn.Conv2d(in_features, in_features, 3), nn.InstanceNorm2d(in_features) ) def forward(self, x): return x self.block(x) class Generator(nn.Module): def __init__(self, input_nc3, output_nc3, n_residual_blocks9): super().__init__() # 初始卷积块 model [ nn.ReflectionPad2d(3), nn.Conv2d(input_nc, 64, 7), nn.InstanceNorm2d(64), nn.ReLU(inplaceTrue) ] # 下采样 in_features 64 out_features in_features*2 for _ in range(2): model [ nn.Conv2d(in_features, out_features, 3, stride2, padding1), nn.InstanceNorm2d(out_features), nn.ReLU(inplaceTrue) ] in_features out_features out_features in_features*2 # 残差块 for _ in range(n_residual_blocks): model [ResidualBlock(in_features)] # 上采样 out_features in_features//2 for _ in range(2): model [ nn.ConvTranspose2d(in_features, out_features, 3, stride2, padding1, output_padding1), nn.InstanceNorm2d(out_features), nn.ReLU(inplaceTrue) ] in_features out_features out_features in_features//2 # 输出层 model [ nn.ReflectionPad2d(3), nn.Conv2d(64, output_nc, 7), nn.Tanh() ] self.model nn.Sequential(*model) def forward(self, x): return self.model(x)2.2 判别器设计使用PatchGAN结构对图像的局部区域进行真伪判断class Discriminator(nn.Module): def __init__(self, input_nc3): super().__init__() def discriminator_block(in_filters, out_filters, normalizeTrue): layers [nn.Conv2d(in_filters, out_filters, 4, stride2, padding1)] if normalize: layers.append(nn.InstanceNorm2d(out_filters)) layers.append(nn.LeakyReLU(0.2, inplaceTrue)) return layers self.model nn.Sequential( *discriminator_block(input_nc, 64, normalizeFalse), *discriminator_block(64, 128), *discriminator_block(128, 256), *discriminator_block(256, 512), nn.ZeroPad2d((1, 0, 1, 0)), nn.Conv2d(512, 1, 4, padding1) ) def forward(self, img): return self.model(img)3. 训练策略与损失函数3.1 对抗损失标准的GAN损失函数用于让生成图像逼近目标域分布criterion_GAN torch.nn.MSELoss() # 计算生成器对抗损失 def compute_generator_loss(fake_pred): return criterion_GAN(fake_pred, torch.ones_like(fake_pred)) # 计算判别器损失 def compute_discriminator_loss(real_pred, fake_pred): real_loss criterion_GAN(real_pred, torch.ones_like(real_pred)) fake_loss criterion_GAN(fake_pred, torch.zeros_like(fake_pred)) return (real_loss fake_loss) * 0.53.2 循环一致性损失确保转换后的图像能还原回原始图像criterion_cycle torch.nn.L1Loss() def compute_cycle_loss(real_img, cycled_img): return criterion_cycle(cycled_img, real_img) * 10.0 # λ103.3 身份损失保持输入图像的颜色分布criterion_identity torch.nn.L1Loss() def compute_identity_loss(input_img, identity_img): return criterion_identity(identity_img, input_img) * 0.5 # λ0.54. 完整训练流程4.1 初始化模型与优化器device torch.device(cuda if torch.cuda.is_available() else cpu) G_A2B Generator().to(device) # 马→斑马 G_B2A Generator().to(device) # 斑马→马 D_A Discriminator().to(device) # 判别马 D_B Discriminator().to(device) # 判别斑马 optimizer_G torch.optim.Adam( list(G_A2B.parameters()) list(G_B2A.parameters()), lr0.0002, betas(0.5, 0.999) ) optimizer_D torch.optim.Adam( list(D_A.parameters()) list(D_B.parameters()), lr0.0002, betas(0.5, 0.999) )4.2 训练循环关键代码for epoch in range(200): for i, (real_A, real_B) in enumerate(zip(horse_loader, zebra_loader)): # 前向传播 fake_B G_A2B(real_A) cycled_A G_B2A(fake_B) fake_A G_B2A(real_B) cycled_B G_A2B(fake_A) # 身份映射 identity_A G_B2A(real_A) identity_B G_A2B(real_B) # 判别器输出 pred_real_A D_A(real_A) pred_fake_A D_A(fake_A.detach()) pred_real_B D_B(real_B) pred_fake_B D_B(fake_B.detach()) # 生成器损失 loss_G_A2B compute_generator_loss(D_B(fake_B)) loss_G_B2A compute_generator_loss(D_A(fake_A)) loss_cycle_A compute_cycle_loss(real_A, cycled_A) loss_cycle_B compute_cycle_loss(real_B, cycled_B) loss_id_A compute_identity_loss(real_A, identity_A) loss_id_B compute_identity_loss(real_B, identity_B) loss_G (loss_G_A2B loss_G_B2A loss_cycle_A loss_cycle_B loss_id_A loss_id_B) # 判别器损失 loss_D_A compute_discriminator_loss(pred_real_A, pred_fake_A) loss_D_B compute_discriminator_loss(pred_real_B, pred_fake_B) loss_D (loss_D_A loss_D_B) * 0.5 # 反向传播 optimizer_G.zero_grad() loss_G.backward() optimizer_G.step() optimizer_D.zero_grad() loss_D.backward() optimizer_D.step()4.3 训练技巧与参数设置参数推荐值作用说明学习率0.0002Adam优化器初始学习率β10.5Adam动量参数Batch Size1-4受限于显存小批量也能工作残差块数量9平衡模型容量与训练难度λ_cycle10循环一致性损失权重λ_identity0.5身份损失权重注意训练初期约前10个epoch可暂时禁用身份损失待模型初步收敛后再加入5. 效果展示与应用扩展5.1 转换效果对比经过200个epoch的训练后我们得到以下典型转换效果马→斑马转换保留原始姿态和背景成功添加斑马条纹自然调整颜色至斑马特征斑马→马转换去除条纹纹理调整为单色毛发保持四肢结构和场景不变5.2 其他应用场景只需更换训练数据同一套代码可用于风景照片的季节转换夏↔冬素描与照片的相互转换不同艺术风格间的迁移医学图像域适应CT↔MRI# 快速测试模型 def test_transform(image_path): image Image.open(image_path) transform transforms.Compose([ transforms.Resize(256), transforms.ToTensor(), transforms.Normalize((0.5, 0.5, 0.5), (0.5, 0.5, 0.5)) ]) return transform(image).unsqueeze(0) horse_img test_transform(test_horse.jpg).to(device) zebra_result G_A2B(horse_img) save_image(zebra_result*0.50.5, converted_zebra.jpg)在实际项目中我发现调整身份损失的权重对颜色保持特别关键。当处理人像照片时λ_identity0.5能有效防止肤色异常变化而对于风景照则可以适当降低到0.2-0.3。另一个实用技巧是在训练后期最后20%的epoch将学习率线性衰减到零这能显著提升生成图像的细节质量。

终极抖音下载工具：免费快速批量保存无水印视频

终极抖音下载工具：免费快速批量保存无水印视频【免费下载链接】douyin-downloader A practical Douyin downloader for both single-item and profile batch downloads, with progress display, retries, SQLite deduplication, and browser fallback support. 抖音…...

2026/4/22 11:44:42 阅读更多 →

别再傻傻分不清了！电脑手机里的RAM、ROM、Cache到底谁管谁？

别再傻傻分不清了！电脑手机里的RAM、ROM、Cache到底谁管谁？ 每次看到手机参数里写着"8GB256GB"或是电脑配置单上的"16GB RAM 1TB SSD"，你是不是总觉得这些数字很熟悉却又说不清具体区别？更别提那些突然跳出来…...

2026/4/22 11:44:29 阅读更多 →

告别重启！在ZYNQ Linux系统里热插拔FPGA逻辑的保姆级教程（基于FPGA Manager）

ZYNQ Linux系统中FPGA逻辑动态加载实战指南引言在嵌入式系统开发领域，ZYNQ系列SoC因其独特的ARM处理器与FPGA结合架构而广受欢迎。传统开发流程中，每次修改FPGA逻辑都需要重新烧录整个系统，这种"全有或全无"的方式在工业现场维…...

2026/4/22 11:44:16 阅读更多 →

从T3到T5：全志工控处理器性能跃迁与工业应用场景深度解析

1. 全志T3与T5处理器核心架构解析全志T3（A40I）和T5（T507）作为两代工控处理器，在核心架构上有着显著差异。T3采用四核Cortex-A7架构，主频1.2GHz，搭配Mali400MP2 GPU，属于经典的"…...

2026/4/21 5:14:24 阅读更多 →

Elasticsearch 运维必备：列出集群所有索引的5种方法（最全+图解+实战）

Elasticsearch 运维必备：列出集群所有索引的5种方法（最全图解实战）一、前言二、列出 ES 所有索引：整体流程流程图三、Elasticsearch 列出所有索引：核心命令3.1 方法1：_cat/indices（最常用、运维…...

2026/4/21 5:14:28 阅读更多 →

SAP PI/PO HTTPS接口调用实战：从SSL证书导入到彻底告别iaik.security.ssl.SSLCertificateException

1. 当SAP PI/PO遇到HTTPS接口报错时发生了什么？ 最近在帮客户调试SAP PI系统调用外部HTTPS接口时，遇到了一个让人头疼的问题。系统在调用Swagger Petstore的API时，控制台突然抛出"iaik.security.ssl.SSLCertificateException: Peer cert…...

2026/4/22 11:40:58 阅读更多 →