Official JAX implementation of XMC-GAN for text-to-image generation using cross-modal contrastive learning.
XMC-GAN is a research implementation for generating images from text descriptions using cross-modal contrastive learning. It leverages a novel approach to align text and image representations, producing high-quality, semantically consistent images.
XMC-GAN emphasizes robust cross-modal alignment through contrastive learning, aiming to bridge the gap between textual descriptions and visual content with high fidelity and efficiency.
Open-Awesome is built by the community, for the community. Submit a project, suggest an awesome list, or help improve the catalog on GitHub.