Google Research ‚Vlogger‘ – Multimodal Diffusion for Embodied Avatar Synthesis

Google Research ‚Vlogger‘ – a method for text and audio-driven talking human video generation from a single input image of a person which builds on the success of recent generative diffusion models

Leave a Reply

You must be logged in to post a comment.