HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis本文是论文HiFi-GAN: Generative Adve
2025-01-03
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
DDPM: Denoising Diffusion Probabilistic Model
Pytorch实现: VQ-VAE
Introduction: Vector Quantization
Multimodal Large Language Model 总结
通用信息抽取(下) - UniEX, Mirror, RexUIE
通用信息抽取(上) - UIE, USM, InstructUIE
2024-元旦
Vision & Language Pretrained Model 总结
大模型并行优化
QIDN: Query-based Instance Discrimination Network for Relational Triple Extraction