Skip to content

Latest commit

 

History

History
355 lines (332 loc) · 55.1 KB

README.md

File metadata and controls

355 lines (332 loc) · 55.1 KB

Updated on 2024.12.19

Image Generation

Publish Date Title Authors PDF Code
2024-12-18 E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling Zhihang Yuan et.al. 2412.14170v1 null
2024-12-18 Autoregressive Video Generation without Vector Quantization Haoge Deng et.al. 2412.14169v1 link
2024-12-18 FashionComposer: Compositional Fashion Image Generation Sihui Ji et.al. 2412.14168v1 null
2024-12-18 VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Runtao Liu et.al. 2412.14167v1 null
2024-12-18 Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation Localization Xuekang Zhu et.al. 2412.13753v1 link
2024-12-18 Text2Relight: Creative Portrait Relighting with Text Guidance Junuk Cha et.al. 2412.13734v1 null
2024-12-18 Diffusion models and stochastic quantisation in lattice field theory Gert Aarts et.al. 2412.13704v1 null
2024-12-18 MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing Chuang Yang et.al. 2412.13684v1 null
2024-12-18 Self-control: A Better Conditional Mechanism for Masked Autoregressive Model Qiaoying Qu et.al. 2412.13635v1 null
2024-12-17 Posterior Mean Matching: Generative Modeling through Online Bayesian Inference Sebastian Salazar et.al. 2412.13286v1 null
2024-12-17 F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration Lu Liu et.al. 2412.13155v1 null
2024-12-17 Prompt Augmentation for Self-supervised Text-guided Image Manipulation Rumeysa Bodur et.al. 2412.13081v1 null
2024-12-17 3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation Haoshen Wang et.al. 2412.13059v1 null
2024-12-17 Stable Diffusion is a Natural Cross-Modal Decoder for Layered AI-generated Image Compression Ruijie Chen et.al. 2412.12982v1 null
2024-12-17 Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance Wenhao Sun et.al. 2412.12974v2 link
2024-12-17 Unsupervised Region-Based Image Editing of Denoising Diffusion Models Zixiang Li et.al. 2412.12912v1 null
2024-12-17 ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction Zhongjie Duan et.al. 2412.12888v2 link
2024-12-17 Rethinking Diffusion-Based Image Generators for Fundus Fluorescein Angiography Synthesis on Limited Data Chengzhou Yu et.al. 2412.12778v1 null
2024-12-17 Guided and Variance-Corrected Fusion with One-shot Style Alignment for Large-Content Image Generation Shoukun Sun et.al. 2412.12771v1 null
2024-12-17 Addressing Small and Imbalanced Medical Image Datasets Using Generative Models: A Comparative Study of DDPM and PGGANs with Random and Greedy K Sampling Iman Khazrak et.al. 2412.12532v1 link
2024-12-17 Pattern Analogies: Learning to Perform Programmatic Image Edits by Analogy Aditya Ganeshan et.al. 2412.12463v1 null
2024-12-17 Numerical Pruning for Efficient Autoregressive Models Xuan Shen et.al. 2412.12441v1 null
2024-12-16 Efficient Scaling of Diffusion Transformers for Text-to-Image Generation Hao Li et.al. 2412.12391v1 null
2024-12-16 OmniPrism: Learning Disentangled Visual Concept for Image Generation Yangyang Li et.al. 2412.12242v1 null
2024-12-16 You Only Submit One Image to Find the Most Suitable Generative Model Zhi Zhou et.al. 2412.12232v1 null
2024-12-16 Causal Diffusion Transformers for Generative Modeling Chaorui Deng et.al. 2412.12095v2 link
2024-12-16 Instruction-based Image Manipulation by Watching How Things Move Mingdeng Cao et.al. 2412.12087v1 null
2024-12-16 A LoRA is Worth a Thousand Pictures Chenxi Liu et.al. 2412.12048v1 null
2024-12-16 IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation Yiren Song et.al. 2412.11638v1 null
2024-12-16 3D$^2$-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling Zichen Tang et.al. 2412.11599v1 link

Light Field Super Resolution

Publish Date Title Authors PDF Code
2024-10-14 SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators Rasoul Shafipour et.al. 2410.10714v2 null
2024-09-26 LGFN: Lightweight Light Field Image Super-Resolution using Local Convolution Modulation and Global Attention Feature Extraction Zhongxin Yu et.al. 2409.17759v1 null
2024-07-22 Efficient Multi-disparity Transformer for Light Field Image Super-resolution Zeke Zexi Hu et.al. 2407.15329v1 null
2024-06-23 Mamba-based Light Field Super-Resolution with Efficient Subspace Scanning Ruisheng Gao et.al. 2406.16083v1 null
2024-06-18 LFMamba: Light Field Image Super-Resolution with State Space Model Wang xia et.al. 2406.12463v1 null
2024-05-11 Incorporating Degradation Estimation in Light Field Spatial Super-Resolution Zeyu Xiao et.al. 2405.07012v1 null
2024-04-18 Pseudo-random generators using linear feedback shift registers with output extraction Holger Nobach et.al. 2404.12011v1 null
2024-02-29 Unsupervised Learning of High-resolution Light Field Imaging via Beam Splitter-based Hybrid Lenses Jianxin Lei et.al. 2402.19020v1 null
2024-02-16 Lightweight ciphers based on chaotic Map -- LFSR architectures M. Garcia-Bosque et.al. 2402.10871v1 null
2024-01-01 Beyond Subspace Isolation: Many-to-Many Transformer for Light Field Image Super-resolution Zeke Zexi Hu et.al. 2401.00740v1 null
2023-07-31 LFSR based RNG on low cost FPGA for QKD applications Pooja Chandravanshi et.al. 2307.16431v1 null
2023-07-05 A Scheme to resist Fast Correlation Attack for Word Oriented LFSR based Stream Cipher Subrata Nandi et.al. 2307.02182v1 null
2023-06-07 Security Analysis of WG-7 Lightweight Stream Cipher against Cube Attack Bijoy Das et.al. 2306.04352v1 null
2023-05-30 Toward Real-World Light Field Super-Resolution Zeyu Xiao et.al. 2305.18994v1 link
2023-05-12 A Lightweight Authentication Protocol against Modeling Attacks based on a Novel LFSR-APUF Yao Wang et.al. 2305.07254v1 null
2023-04-30 On Rueppel's Linear Complexity Conjecture Graham H. Norton et.al. 2305.00405v1 null
2023-04-20 NTIRE 2023 Challenge on Light Field Image Super-Resolution: Dataset, Methods and Results Yingqian Wang et.al. 2304.10415v1 link
2023-04-11 Towards Power Characterization of FPGA Architectures To Enable Open-Source Power Estimation Using Micro-Benchmarks Stefan Riesenberger et.al. 2304.05326v1 null
2023-03-16 Linear Codes from Simplicial Complexes over $\mathbb{F}_{2^n}$ Hongwei Liu et.al. 2303.09292v1 null
2023-03-05 A Provably Secure Strong PUF based on LWE: Construction and Implementation Xiaodan Xi et.al. 2303.02802v1 null
2022-10-09 Learning Texture Transformer Network for Light Field Super-Resolution Javeria Shabbir et.al. 2210.09293v1 null
2022-08-12 Software implementation of the SNOW 3G Generator on iOS and Android platforms Jezabel Molina-Gil et.al. 2208.06147v1 null
2022-08-06 RFID authentication protocol based on a novel EPC Gen2 PRNG Pino Caballero-Gil et.al. 2208.05345v1 null
2022-08-06 Weak Equivalents for Nonlinear Filtering Functions Amparo Fúster-Sabater et.al. 2208.04734v1 null
2022-07-31 Ordered Orthogonal Array Construction Using LFSR Sequences André Guerino Castoldi et.al. 2208.00333v1 null
2022-07-25 Sub-Aperture Feature Adaptation in Single Image Super-resolution Model for Light Field Imaging Aupendu Kar et.al. 2207.11894v2 null
2022-06-09 A GPU-Accelerated Light-field Super-resolution Framework Based on Mixed Noise Model and Weighted Regularization Trung-Hieu Tran et.al. 2206.05047v1 null
2022-01-02 Detail-Preserving Transformer for Light Field Image Super-Resolution Shunzhou Wang et.al. 2201.00346v1 link
2021-11-07 Texture-enhanced Light Field Super-resolution with Spatio-Angular Decomposition Kernels Zexi Hu et.al. 2111.04069v2 null
2021-10-07 Shift-BNN: Highly-Efficient Probabilistic Bayesian Neural Network Training via Memory-Friendly Pattern Retrieving Qiyu Wan et.al. 2110.03553v1 null

Light Field Depth Estimation

Publish Date Title Authors PDF Code
2024-03-04 Iterative Occlusion-Aware Light Field Depth Estimation using 4D Geometrical Cues Rui Lourenço et.al. 2403.02043v1 null
2023-05-28 OccCasNet: Occlusion-aware Cascade Cost Volume for Light Field Depth Estimation Wentao Chao et.al. 2305.17710v1 link
2023-01-20 Unsupervised Light Field Depth Estimation via Multi-view Feature Matching with Occlusion Prediction Shansi Zhang et.al. 2301.08433v2 null
2022-08-20 Learning Sub-Pixel Disparity Distribution for Light Field Depth Estimation Wentao Chao et.al. 2208.09688v3 link
2022-03-29 Light Field Depth Estimation via Stitched Epipolar Plane Images Ping Zhou et.al. 2203.15201v3 link
2022-03-29 Self-Supervised Light Field Depth Estimation Using Epipolar Plane Images Kunyuan Li et.al. 2203.15171v1 null
2022-03-04 OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation Peng Li et.al. 2203.02231v3 null
2022-03-03 Occlusion-Aware Cost Constructor for Light Field Depth Estimation Yingqian Wang et.al. 2203.01576v1 link
2021-06-06 Occlusion-aware Unsupervised Learning of Depth from 4-D Light Fields Jing Jin et.al. 2106.03043v2 link
2021-04-13 Learning Multi-modal Information for Robust Light Field Depth Estimation Yongri Piao et.al. 2104.05971v1 link
2021-04-13 Dynamic Fusion Network For Light Field Depth Estimation Yongri Piao et.al. 2104.05969v1 null
2020-09-09 View-consistent 4D Light Field Depth Estimation Numair Khan et.al. 2009.04065v1 link
2020-07-09 EPI-based Oriented Relation Networks for Light Field Depth Estimation Kunyuan Li et.al. 2007.04538v2 link
2019-09-19 Learning to Think Outside the Box: Wide-Baseline Light Field Depth Estimation with EPI-Shift Titus Leistner et.al. 1909.09059v1 null
2019-07-31 Rapid Light Field Depth Estimation with Semi-Global Matching Yuriy Anisimov et.al. 1907.13449v1 null
2018-04-06 EPINET: A Fully-Convolutional Neural Network Using Epipolar Geometry for Depth from Light Field Images Changha Shin et.al. 1804.02379v1 null
2017-08-07 Accurate Light Field Depth Estimation with Superpixel Regularization over Partially Occluded Regions Jie Chen et.al. 1708.01964v1 null
2016-08-15 Occlusion-Model Guided Anti-Occlusion Depth Estimation in Light Field Hao Zhu et.al. 1608.04187v2 null

Light Field View Synthesis

Publish Date Title Authors PDF Code
2024-03-15 Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience Xiaohang Yu et.al. 2403.09973v1 null
2024-03-02 Neural radiance fields-based holography [Invited] Minsung Kang et.al. 2403.01137v2 null
2023-11-14 Learning based Deep Disentangling Light Field Reconstruction and Disparity Estimation Application Langqing Shi et.al. 2311.08129v1 null
2023-09-04 ImmersiveNeRF: Hybrid Radiance Fields for Unbounded Immersive Light Field Reconstruction Xiaohang Yu et.al. 2309.01374v1 null
2023-07-06 RealLiFe: Real-Time Light Field Reconstruction via Hierarchical Sparse Gradient Descent Yijie Deng et.al. 2307.03017v3 null
2022-12-23 Quantum correlation light-field microscope with extreme depth of field Yingwen Zhang et.al. 2212.12582v2 null
2022-09-22 Fast Disparity Estimation from a Single Compressed Light Field Measurement Emmanuel Martinez et.al. 2209.11342v1 null
2022-04-26 Acquiring a Dynamic Light Field through a Single-Shot Coded Image Ryoya Mizuno et.al. 2204.12089v1 null
2022-04-01 Epipolar Focus Spectrum: A Novel Light Field Representation and Application in Dense-view Reconstruction Yaning Li et.al. 2204.00193v1 null
2021-08-27 A Novel Hierarchical Light Field Coding Scheme Based on Hybrid Stacked Multiplicative Layers and Fourier Disparity Layers for Glasses-Free 3D Displays Joshitha Ravishankar et.al. 2108.12399v1 null
2021-08-08 Efficient Light Field Reconstruction via Spatio-Angular Dense Network Zexi Hu et.al. 2108.03635v1 link
2021-06-04 Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering Vincent Sitzmann et.al. 2106.02634v2 null
2021-03-24 Light Field Reconstruction Using Convolutional Network on EPI and Extended Applications Gaochang Wu et.al. 2103.13043v1 link
2021-02-14 Light Field Reconstruction via Deep Adaptive Fusion of Hybrid Lenses Jing Jin et.al. 2102.07085v3 link
2020-12-03 Light-field view synthesis using convolutional block attention module M. Shahzeb Khan Gul et.al. 2012.01900v2 null
2020-09-07 Light Field View Synthesis via Aperture Disparity and Warping Confidence Map Nan Meng et.al. 2009.02978v2 null
2020-08-12 Self-supervised Light Field View Synthesis Using Cycle Consistency Yang Chen et.al. 2008.05084v1 null
2020-07-23 Deep Spatial-angular Regularization for Compressive Light Field Reconstruction over Coded Apertures Mantang Guo et.al. 2007.11882v1 link
2020-07-05 Spatial-Angular Attention Network for Light Field Reconstruction Gaochang Wu et.al. 2007.02252v2 link
2020-05-13 A Generative Model for Generic Light Field Reconstruction Paramanand Chandramouli et.al. 2005.06508v2 null
2020-03-20 Self-Supervised Light Field Reconstruction Using Shearlet Transform and Cycle Consistency Yuan Gao et.al. 2003.09294v1 null
2020-03-19 DRST: Deep Residual Shearlet Transform for Densely Sampled Light Field Reconstruction Yuan Gao et.al. 2003.08865v1 null
2020-02-26 Learning Light Field Angular Super-Resolution via a Geometry-Aware Network Jing Jin et.al. 2002.11263v1 link
2020-01-14 Seeing the World in a Bag of Chips Jeong Joon Park et.al. 2001.04642v2 null
2019-10-03 High-dimensional Dense Residual Convolutional Neural Network for Light Field Reconstruction Nan Meng et.al. 1910.01426v4 link
2019-08-31 Deep Coarse-to-fine Dense Light Field Reconstruction with Flexible Sampling and Geometry-aware Fusion Jing Jin et.al. 1909.01341v3 link
2019-02-17 LapEPI-Net: A Laplacian Pyramid EPI structure for Learning-based Dense Light Field Reconstruction Gaochang Wu et.al. 1902.06221v1 null
2018-12-26 A Unified Learning Based Framework for Light Field Reconstruction from Coded Projections Anil Kumar Vadathya et.al. 1812.10532v2 null
2018-10-20 A System for Acquiring, Processing, and Rendering Panoramic Light Field Stills for Virtual Reality Ryan S. Overbeck et.al. 1810.08860v1 null
2018-06-14 Dense Light Field Reconstruction From Sparse Sampling Using Residual Network Mantang Guo et.al. 1806.05506v2 null

Light Field Other Applications

Publish Date Title Authors PDF Code
2024-11-21 Segment Anything in Light Fields for Real-Time Applications via Constrained Prompting Nikolai Goncharov et.al. 2411.13840v1 link
2023-03-13 View Adaptive Light Field Deblurring Networks with Depth Perception Zeqi Shen et.al. 2303.06860v1 null
2022-04-28 Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency Detection Mingtao Feng et.al. 2204.13456v1 link
2021-10-02 Light Field Saliency Detection with Dual Local Graph Learning andReciprocative Guidance Nian Liu et.al. 2110.00698v1 link
2020-12-30 DUT-LFSaliency: Versatile Dataset and Light Field-to-RGB Saliency Detection Yongri Piao et.al. 2012.15124v1 null
2020-10-25 Fast and Accurate Light Field Saliency Detection through Deep Encoding Sahan Hemachandra et.al. 2010.13073v2 null
2019-06-19 Light Field Saliency Detection with Deep Convolutional Networks Jun Zhang et.al. 1906.08331v2 link
2019-03-31 Fast and Full-Resolution Light Field Deblurring using a Deep Neural Network Jonathan Samuel Lumentut et.al. 1904.00352v1 null
2017-12-20 Light Field Segmentation From Super-pixel Graph Representation Xianqiang Lv et.al. 1712.07394v1 null
2017-11-29 Joint Blind Motion Deblurring and Depth Estimation of Light Field Dongwoo Lee et.al. 1711.10918v2 null

Diffusion

Publish Date Title Authors PDF Code
2024-12-18 AniDoc: Animation Creation Made Easier Yihao Meng et.al. 2412.14173v1 null
2024-12-18 E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling Zhihang Yuan et.al. 2412.14170v1 null
2024-12-18 Autoregressive Video Generation without Vector Quantization Haoge Deng et.al. 2412.14169v1 link
2024-12-18 VideoDPO: Omni-Preference Alignment for Video Diffusion Generation Runtao Liu et.al. 2412.14167v1 null
2024-12-18 AKiRa: Augmentation Kit on Rays for optical video generation Xi Wang et.al. 2412.14158v1 null
2024-12-18 MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation Shenhao Zhu et.al. 2412.14148v1 null
2024-12-18 Measuring collective diffusion properties by counting particles in boxes Adam Carter et.al. 2412.14122v1 null
2024-12-18 SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation Tong Chen et.al. 2412.14018v1 null
2024-12-18 A perturbative approach to the macroscopic fluctuation theory Thierry Bodineau et.al. 2412.13991v1 null
2024-12-18 Double sine-Gordon class of universal coarsening dynamics in a spin-1 Bose gas Ido Siovitz et.al. 2412.13986v1 null
2024-12-18 Gravitational wave astronomy and the expansion history of the Universe Massimo Giovannini et.al. 2412.13968v1 null
2024-12-18 Comparative Analysis of Machine Learning-Based Imputation Techniques for Air Quality Datasets with High Missing Data Rates Sen Yan et.al. 2412.13966v1 null
2024-12-18 Anomalous Diffusion of Superparamagnetic Walkers with Tailored Statistics Alessia Gentili et.al. 2412.13960v1 null
2024-12-18 On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process Gereziher Adhane et.al. 2412.13943v1 null
2024-12-18 Spatio-Temporal Forecasting of PM2.5 via Spatial-Diffusion guided Encoder-Decoder Architecture Malay Pandey et.al. 2412.13935v1 null
2024-12-18 Investigating the Effects of Diffusion-based Conditional Generative Speech Models Used for Speech Enhancement on Dysarthric Speech Joanna Reszka et.al. 2412.13933v1 null
2024-12-18 Two-states Brownian particle in a Harmonic Potential Giovanni Battista Carollo et.al. 2412.13921v1 null
2024-12-18 X-ray Binaries: a potential dominant contributor to the cosmic ray spectral knee structure Hua Yue et.al. 2412.13889v1 null
2024-12-18 IDEQ: an improved diffusion model for the TSP Mickael Basson et.al. 2412.13858v1 null
2024-12-18 Coupled Eikonal problems to model cardiac reentries in Purkinje network and myocardium Samuele Brunati et.al. 2412.13837v1 null
2024-12-18 Object Style Diffusion for Generalized Object Detection in Urban Scene Hao Li et.al. 2412.13815v1 null
2024-12-18 Spatial Brain Tumor Concentration Estimation for Individualized Radiotherapy Planning Jonas Weidner et.al. 2412.13811v1 null
2024-12-18 SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor Chenyu Yang et.al. 2412.13786v1 null
2024-12-18 Minimum nonlinearity for pattern-forming Turing instability in a mathematical autocatalytic model Javier López-Pedrares et.al. 2412.13783v1 null
2024-12-18 Transport theory and spin-transfer physics in d-wave altermagnets Ricardo Zarzuela et.al. 2412.13763v1 null
2024-12-18 Text2Relight: Creative Portrait Relighting with Text Guidance Junuk Cha et.al. 2412.13734v1 null
2024-12-18 Diffusion models and stochastic quantisation in lattice field theory Gert Aarts et.al. 2412.13704v1 null
2024-12-18 MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing Chuang Yang et.al. 2412.13684v1 null
2024-12-18 VIIS: Visible and Infrared Information Synthesis for Severe Low-light Image Enhancement Chen Zhao et.al. 2412.13655v1 link
2024-12-18 Model-independent measurement of isospin diffusion in Ni-Ni systems at intermediate energy C. Ciampi et.al. 2412.13648v1 null

Vision Transformer

Publish Date Title Authors PDF Code
2024-12-18 LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer Yipeng Zhang et.al. 2412.13871v1 null
2024-12-17 Identification of Epileptic Spasms (ESES) Phases Using EEG Signals: A Vision Transformer Approach Wei Gong et.al. 2412.13028v1 null
2024-12-17 Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-training Mingjia Shi et.al. 2412.12496v1 link
2024-12-16 No More Adam: Learning Rate Scaling at Initialization is All You Need Minghao Xu et.al. 2412.11768v2 link
2024-12-16 Flex-PE: Flexible and SIMD Multi-Precision Processing Element for AI Workloads Mukul Lokhande et.al. 2412.11702v1 null
2024-12-16 HResFormer: Hybrid Residual Transformer for Volumetric Medical Image Segmentation Sucheng Ren et.al. 2412.11458v1 null
2024-12-15 One-Shot Multilingual Font Generation Via ViT Zhiheng Wang et.al. 2412.11342v1 null
2024-12-15 MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation Zhiwei Yang et.al. 2412.11076v1 link
2024-12-14 RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone Mustafa Munir et.al. 2412.10995v1 link
2024-12-14 Boosting ViT-based MRI Reconstruction from the Perspectives of Frequency Modulation, Spatial Purification, and Scale Diversification Yucong Meng et.al. 2412.10776v1 null
2024-12-14 One Pixel is All I Need Deng Siqin et.al. 2412.10681v1 null
2024-12-13 Learning to Merge Tokens via Decoupled Embedding for Efficient Vision Transformers Dong Hoon Lee et.al. 2412.10569v1 link
2024-12-13 VibrantVS: A high-resolution multi-task transformer for forest canopy height estimation Tony Chang et.al. 2412.10351v1 null
2024-12-13 ManipGPT: Is Affordance Segmentation by Large Vision Models Enough for Articulated Object Manipulation? Taewhan Kim et.al. 2412.10050v2 null
2024-12-13 T-GMSI: A transformer-based generative model for spatial interpolation under sparse measurements Xiangxi Tian et.al. 2412.09886v1 null
2024-12-12 Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models Faith Johnson et.al. 2412.09739v1 null
2024-12-12 From Noise to Nuance: Advances in Deep Generative Image Models Benji Peng et.al. 2412.09656v1 null
2024-12-12 Vision Transformers for Efficient Indoor Pathloss Radio Map Prediction Edvard Ghukasyan et.al. 2412.09507v1 null
2024-12-12 A Novel Ensemble-Based Deep Learning Model with Explainable AI for Accurate Kidney Disease Diagnosis Md. Arifuzzaman et.al. 2412.09472v1 null
2024-12-12 Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component Evaluation Davor Vukadin et.al. 2412.09311v1 link
2024-12-12 Selective Visual Prompting in Vision Mamba Yifeng Yao et.al. 2412.08947v1 null
2024-12-12 Sensing for Space Safety and Sustainability: A Deep Learning Approach with Vision Transformers Wenxuan Zhang et.al. 2412.08913v2 null
2024-12-11 SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation Tapas Kumar Dutta et.al. 2412.08482v1 null
2024-12-11 A Review of Intelligent Device Fault Diagnosis Technologies Based on Machine Vision Guiran Liu et.al. 2412.08148v1 null
2024-12-10 Comparative Analysis of Deep Learning Approaches for Harmful Brain Activity Detection Using EEG Shivraj Singh Bhatti et.al. 2412.07878v1 null
2024-12-10 An Enhancement of CNN Algorithm for Rice Leaf Disease Image Classification in Mobile Applications Kayne Uriel K. Rodrigo et.al. 2412.07182v1 null
2024-12-09 Static Key Attention in Vision Zizhao Hu et.al. 2412.07049v1 null
2024-12-09 Vision transformer based Deep Learning of Topological indicators in Majorana Nanowires Jacob R. Taylor et.al. 2412.06768v1 null
2024-12-09 Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image Segmentation Shun Zhang et.al. 2412.06664v2 null
2024-12-09 Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers Johanna Vielhaben et.al. 2412.06639v1 link

NeRF

Publish Date Title Authors PDF Code
2024-12-18 GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians Xiaobao Wei et.al. 2412.13983v1 link
2024-12-17 EOGS: Gaussian Splatting for Earth Observation Luca Savant Aira et.al. 2412.13047v1 null
2024-12-17 Optimize the Unseen -- Fast NeRF Cleanup with Free Space Prior Leo Segre et.al. 2412.12772v2 null
2024-12-17 Towards a Training Free Approach for 3D Scene Editing Vivek Madhavaram et.al. 2412.12766v1 null
2024-12-16 GS-ProCams: Gaussian Splatting-based Projector-Camera Systems Qingyue Deng et.al. 2412.11762v1 null
2024-12-16 Sequence Matters: Harnessing Video Models in 3D Super-Resolution Hyun-kyu Ko et.al. 2412.11525v2 null
2024-12-16 VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression Qiang Hu et.al. 2412.11362v1 null
2024-12-13 NeRF-Texture: Synthesizing Neural Radiance Field Textures Yi-Hua Huang et.al. 2412.10004v1 null
2024-12-13 Sharpening Your Density Fields: Spiking Neuron Aided Fast Geometry Learning Yi Gu et.al. 2412.09881v1 null
2024-12-12 PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields Sean Wu et.al. 2412.09680v1 link
2024-12-11 GN-FR:Generalizable Neural Radiance Fields for Flare Removal Gopi Raju Matta et.al. 2412.08200v2 null
2024-12-11 NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods Qiang Qu et.al. 2412.08029v1 link
2024-12-10 EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering Toshiya Yura et.al. 2412.07293v1 null
2024-12-09 Diffusing Differentiable Representations Yash Savani et.al. 2412.06981v1 null
2024-12-09 Dynamic EventNeRF: Reconstructing General Dynamic Scenes from Multi-view Event Cameras Viktor Rudnev et.al. 2412.06770v1 null
2024-12-09 Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video Renlong Wu et.al. 2412.06424v1 link
2024-12-09 Splatter-360: Generalizable 360$^{\circ}$ Gaussian Splatting for Wide-baseline Panoramic Images Zheng Chen et.al. 2412.06250v1 link
2024-12-07 WATER-GS: Toward Copyright Protection for 3D Gaussian Splatting via Universal Watermarking Yuqi Tan et.al. 2412.05695v1 null
2024-12-06 Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories Susung Hong et.al. 2412.05279v1 null
2024-12-06 MixedGaussianAvatar: Realistically and Geometrically Accurate Head Avatar via Mixed 2D-3D Gaussian Splatting Peng Chen et.al. 2412.04955v2 link
2024-12-04 NeRF and Gaussian Splatting SLAM in the Wild Fabian Schmidt et.al. 2412.03263v1 link
2024-12-01 SAGA: Surface-Aligned Gaussian Avatar Ronghan Chen et.al. 2412.00845v1 null
2024-12-01 CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images Jian Liu et.al. 2412.00754v1 null
2024-11-30 Speedy-Splat: Fast 3D Gaussian Splatting with Sparse Pixels and Sparse Primitives Alex Hanson et.al. 2412.00578v1 link
2024-11-30 Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects Amir Barda et.al. 2412.00518v1 null
2024-11-29 $C^{3}$-NeRF: Modeling Multiple Scenes via Conditional-cum-Continual Neural Radiance Fields Prajwal Singh et.al. 2411.19903v1 null
2024-11-29 Gaussian Splashing: Direct Volumetric Rendering Underwater Nir Mualem et.al. 2411.19588v1 null
2024-11-29 ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration Chaojun Ni et.al. 2411.19548v1 null
2024-11-29 LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis Tianqi Li et.al. 2411.19525v1 null
2024-11-28 SAMa: Material-aware 3D Selection and Segmentation Michael Fischer et.al. 2411.19322v1 null

Super Resolution

Publish Date Title Authors PDF Code
2024-12-18 Super-Resolution Generative Adversarial Network for Data Compression of Direct Numerical Simulations Ludovico Nista et.al. 2412.14150v1 null
2024-12-17 Learning of Patch-Based Smooth-Plus-Sparse Models for Image Reconstruction Stanislas Ducotterd et.al. 2412.13070v1 link
2024-12-17 Super-Resolving Normalising Flows for Lattice Field Theories Marc Bauer et.al. 2412.12842v1 null
2024-12-16 EGP3D: Edge-guided Geometric Preserving 3D Point Cloud Super-resolution for RGB-D camera Zheng Fang et.al. 2412.11680v1 null
2024-12-16 CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution Bingwen Hu et.al. 2412.11609v1 null
2024-12-16 Sequence Matters: Harnessing Video Models in 3D Super-Resolution Hyun-kyu Ko et.al. 2412.11525v2 null
2024-12-16 Block-Based Multi-Scale Image Rescaling Jian Li et.al. 2412.11468v1 null
2024-12-16 Quantization of Climate Change Impacts on Renewable Energy Generation Capacity: A Super-Resolution Recurrent Diffusion Model Xiaochong Dong et.al. 2412.11399v1 null
2024-12-14 A Staged Deep Learning Approach to Spatial Refinement in 3D Temporal Atmospheric Transport M. Giselle Fernández-Godino et.al. 2412.10945v2 null
2024-12-13 SuperMark: Robust and Training-free Image Watermarking via Diffusion-based Super-Resolution Runyi Hu et.al. 2412.10049v1 null
2024-12-13 A Single-Frame and Multi-Frame Cascaded Image Super-Resolution Method Jing Sun et.al. 2412.09846v1 null
2024-12-13 Super-Resolution for Remote Sensing Imagery via the Coupling of a Variational Model and Deep Learning Jing Sun et.al. 2412.09841v1 null
2024-12-11 RealOSR: Latent Unfolding Boosting Diffusion-based Real-world Omnidirectional Image Super-Resolution Xuhan Sheng et.al. 2412.09646v1 null
2024-12-12 OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs Yuanzhi Zhu et.al. 2412.09465v1 link
2024-12-12 A Plug-and-Play Algorithm for 3D Video Super-Resolution of Single-Photon LiDAR data Alice Ruget et.al. 2412.09427v1 null
2024-12-12 Distribution free uncertainty quantification in neuroscience-inspired deep operators Shailesh Garg et.al. 2412.09369v1 null
2024-12-12 Arbitrary-steps Image Super-resolution via Diffusion Inversion Zongsheng Yue et.al. 2412.09013v1 link
2024-12-11 Fair Primal Dual Splitting Method for Image Inverse Problems Yunfei Qu et.al. 2412.08613v1 null
2024-12-11 Efficient estimation of error bounds for quantum multiparametric imaging with constraints Alexander Mikhalychev et.al. 2412.08199v2 null
2024-12-11 Statistical Downscaling via High-Dimensional Distribution Matching with Generative Models Zhong Yi Wan et.al. 2412.08079v1 null
2024-12-10 MPSI: Mamba enhancement model for pixel-wise sequential interaction Image Super-Resolution Yuchun He et.al. 2412.07222v1 null
2024-12-10 A Progressive Image Restoration Network for High-order Degradation Imaging in Remote Sensing Yujie Feng et.al. 2412.07195v1 null
2024-12-10 Hero-SR: One-Step Diffusion for Super-Resolution with Human Perception Priors Jiangang Wang et.al. 2412.07152v1 null
2024-12-10 RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Resolution Jiangang Wang et.al. 2412.07149v1 link
2024-12-09 Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning Mehdi Noroozi et.al. 2412.06978v1 null
2024-12-09 Neural Garment Dynamic Super-Resolution Meng Zhang et.al. 2412.06285v1 link
2024-12-09 MSCrackMamba: Leveraging Vision Mamba for Crack Detection in Fused Multispectral Imagery Qinfeng Zhu et.al. 2412.06211v1 null
2024-12-09 You KAN Do It in a Single Shot: Plug-and-Play Methods with Single-Instance Priors Yanqi Cheng et.al. 2412.06204v1 null
2024-12-07 Jointly RS Image Deblurring and Super-Resolution with Adjustable-Kernel and Multi-Domain Attention Yan Zhang et.al. 2412.05696v1 link
2024-12-07 Test-time Cost-and-Quality Controllable Arbitrary-Scale Super-Resolution with Variable Fourier Components Kazutoshi Akita et.al. 2412.05517v2 null

Depth Estimation

Publish Date Title Authors PDF Code
2024-12-18 Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation Rémi Marsal et.al. 2412.14103v1 null
2024-12-18 Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation Haotong Lin et.al. 2412.14015v1 null
2024-12-18 Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion Massimiliano Viola et.al. 2412.13389v1 null
2024-12-17 Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera Zhengdi Yu et.al. 2412.12861v1 null
2024-12-17 PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts Kun Guo et.al. 2412.12460v1 link
2024-12-16 V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations Jin-Cheng Jhang et.al. 2412.11412v1 null
2024-12-16 Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video Junkai Fan et.al. 2412.11395v1 null
2024-12-15 ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction Yi Feng et.al. 2412.11210v1 link
2024-12-14 MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance Wenjun Huang et.al. 2412.10730v1 null
2024-12-12 Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos Linyi Jin et.al. 2412.09621v1 null
2024-12-12 T-SVG: Text-Driven Stereoscopic Video Generation Qiao Jin et.al. 2412.09323v1 null
2024-12-12 Cross-View Completion Models are Zero-shot Correspondence Estimators Honggyu An et.al. 2412.09072v1 null
2024-12-11 BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation Shengze Wang et.al. 2412.08640v1 null
2024-12-11 Utilizing Multi-step Loss for Single Image Reflection Removal Abdelrahman Elnenaey et.al. 2412.08582v2 link
2024-12-11 Dense Depth from Event Focal Stack Kenta Horikawa et.al. 2412.08120v1 null
2024-12-10 Diffusion-Based Attention Warping for Consistent 3D Scene Editing Eyal Gomel et.al. 2412.07984v1 null
2024-12-10 Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation Kurt H. W. Stolle et.al. 2412.07966v1 null
2024-12-09 SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception Yaniv Benny et.al. 2412.06968v1 null
2024-12-09 Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving Xin Fei et.al. 2412.06777v1 link
2024-12-09 MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views Antoine Guédon et.al. 2412.06767v1 null
2024-12-09 On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events Jesse Hagenaars et.al. 2412.06359v1 null
2024-12-09 Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction Dongxu Wei et.al. 2412.06273v1 null
2024-12-09 Event fields: Capturing light fields at high speed, resolution, and dynamic range Ziyuan Qu et.al. 2412.06191v1 null
2024-12-08 GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion Karlo Koledic et.al. 2412.06080v1 null
2024-12-08 Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors Alex Rich et.al. 2412.05771v1 null
2024-12-07 TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action Zixian Ma et.al. 2412.05479v2 null
2024-12-06 SimC3D: A Simple Contrastive 3D Pretraining Framework Using RGB Images Jiahua Dong et.al. 2412.05274v1 null
2024-12-06 Penetrative rotating magnetoconvection subject to lateral variations in temperature gradients Tirtharaj Barman et.al. 2412.05235v1 null
2024-12-06 PanoDreamer: 3D Panorama Synthesis from a Single Image Avinash Paliwal et.al. 2412.04827v1 link
2024-12-05 LAA-Net: A Physical-prior-knowledge Based Network for Robust Nighttime Depth Estimation Kebin Peng et.al. 2412.04666v1 null

View Synthesis

Publish Date Title Authors PDF Code
2024-12-18 Real-Time Position-Aware View Synthesis from Single-View Input Manu Gond et.al. 2412.14005v1 null
2024-12-18 Turbo-GS: Accelerating 3D Gaussian Fitting for High-Quality Radiance Fields Tao Lu et.al. 2412.13547v1 null
2024-12-17 StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models Yunzhi Yan et.al. 2412.13188v1 null
2024-12-17 CATSplat: Context-Aware Transformer with Spatial Guidance for Generalizable 3D Gaussian Splatting from A Single-View Image Wonseok Roh et.al. 2412.12906v1 null
2024-12-17 HyperGS: Hyperspectral 3D Gaussian Splatting Christopher Thirgood et.al. 2412.12849v1 null
2024-12-17 Optimize the Unseen -- Fast NeRF Cleanup with Free Space Prior Leo Segre et.al. 2412.12772v2 null
2024-12-16 PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting Cheng Zhang et.al. 2412.12096v1 link
2024-12-16 SweepEvGS: Event-Based 3D Gaussian Splatting for Macro and Micro Radiance Field Rendering from a Single Sweep Jingqian Wu et.al. 2412.11579v1 null
2024-12-16 SpatialMe: Stereo Video Conversion Using Depth-Warping and Blend-Inpainting Jiale Zhang et.al. 2412.11512v1 null
2024-12-16 MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes Ruijie Lu et.al. 2412.11457v1 null
2024-12-13 Probabilistic Inverse Cameras: Image to 3D via Multiview Geometry Rishabh Kabra et.al. 2412.10273v1 null
2024-12-13 SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians Siyun Liang et.al. 2412.10231v1 null
2024-12-13 GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion Jiapeng Tang et.al. 2412.10209v1 null
2024-12-13 TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views Liang Zhao et.al. 2412.10051v1 null
2024-12-13 SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video Jongmin Park et.al. 2412.09982v2 null
2024-12-12 PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields Sean Wu et.al. 2412.09680v1 link
2024-12-12 Representing Long Volumetric Video with Temporal Gaussian Hierarchy Zhen Xu et.al. 2412.09608v1 link
2024-12-12 Feat2GS: Probing Visual Foundation Models with Gaussian Splatting Yue Chen et.al. 2412.09606v1 null
2024-12-12 DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving Hao Lu et.al. 2412.09043v1 link
2024-12-11 Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views Songchun Zhang et.al. 2412.08412v2 null
2024-12-11 NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods Qiang Qu et.al. 2412.08029v1 link
2024-12-10 From an Image to a Scene: Learning to Imagine the World from a Million 360 Videos Matthew Wallingford et.al. 2412.07770v1 link
2024-12-10 SimVS: Simulating World Inconsistencies for Robust View Synthesis Alex Trevithick et.al. 2412.07696v1 null
2024-12-10 Faster and Better 3D Splatting via Group Training Chengbo Wang et.al. 2412.07608v1 null
2024-12-10 ResGS: Residual Densification of 3D Gaussian for Efficient Detail Recovery Yanzhe Lyu et.al. 2412.07494v1 null
2024-12-10 EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering Toshiya Yura et.al. 2412.07293v1 null
2024-12-09 MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds Zhenggang Tang et.al. 2412.06974v1 null
2024-12-09 MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views Antoine Guédon et.al. 2412.06767v1 null
2024-12-09 Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video Renlong Wu et.al. 2412.06424v1 link
2024-12-07 Template-free Articulated Gaussian Splatting for Real-time Reposable Dynamic View Synthesis Diwen Wan et.al. 2412.05570v1 null