Top Important Computer Vision Papers for the Week from 18/12 to 24/12
Stay Updated with Recent Computer Vision Research
Every week, several top-tier academic conferences and journals showcased innovative research in computer vision, presenting exciting breakthroughs in various subfields such as image recognition, vision model optimization, generative adversarial networks (GANs), image segmentation, video analysis, and more.
This article provides a comprehensive overview of the most significant papers published in the third week of December 2023, highlighting the latest research and advancements in computer vision. Whether you’re a researcher, practitioner, or enthusiast, this article will provide valuable insights into the state-of-the-art techniques and tools in computer vision.
Table of Contents:
Stable Diffusion
Vision Language Models
Image Generation & Editing
Video Generation & Editing
Image Segmentation
Image Recognition
My E-book: Data Science Portfolio for Success Is Out!
I recently published my first e-book Data Science Portfolio for Success which is a practical guide on how to build your data science portfolio. The book covers the following topics: The Importance of Having a Portfolio as a Data Scientist How to Build a Data Science Portfolio That Will Land You a Job?
1. Stable Diffusion
DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models
Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models
SCEdit: Efficient and Controllable Image Diffusion Generation via Skip Connection Editing
Adaptive Guidance: Training-free Acceleration of Conditional Diffusion Models
VolumeDiffusion: Flexible Text-to-3D Generation with Efficient Volumetric Encoder
Towards Accurate Guided Diffusion Sampling through Symplectic Adjoint Method
StreamDiffusion: A Pipeline-level Solution for Real-time Interactive Generation
Carve3D: Improving Multi-view Reconstruction Consistency for Diffusion Models with RL Finetuning
Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model
RadEdit: stress-testing biomedical vision models via diffusion image editing
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
2. Vision Language Models
Silkie: Preference Distillation for Large Visual Language Models
Jack of All Tasks, Master of Many: Designing General-purpose Coarse-to-Fine Vision-Language Model
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise
3. Image Generation & Editing
GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning
MAG-Edit: Localized Image Editing in Complex Scenarios via Mask-Based Attention-Adjusted Guidance
StarVector: Generating Scalable Vector Graphics Code from Images
HAAR: Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles
TIP: Text-Driven Image Processing with Semantic and Restoration Instructions
Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior
Unlocking Pre-trained Image Backbones for Semantic Image Synthesis
ShowRoom3D: Text to High-Quality 3D Room Generation Using 3D Priors
DreamTuner: Single Image is Enough for Subject-Driven Generation
PIA: Your Personalized Image Animator via Plug-and-Play Modules in Text-to-Image Models
4. Video Generation & Editing
GauFRe: Gaussian Deformation Fields for Real-time Dynamic Novel View Synthesis
Text-Conditioned Resampler For Long Form Video Understanding
HeadCraft: Modeling High-Detail Shape Variations for Animated 3DMMs
VideoPoet: A Large Language Model for Zero-Shot Video Generation
InstructVideo: Instructing Video Diffusion Models with Human Feedback
MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers
Fairy: Fast Parallelized Instruction-Guided Video-to-Video Synthesis
5. Image Segmentation
6. Object Tracking
Are you looking to start a career in data science and AI and do not know how? I offer data science mentoring sessions and long-term career mentoring:
Mentoring sessions: https://lnkd.in/dXeg3KPW
Long-term mentoring: https://lnkd.in/dtdUYBrM