Updated on 2024.12.19

Image Generation

Publish Date	Title	Authors	PDF	Code
2024-12-18	E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling	Zhihang Yuan et.al.	2412.14170v1	null
2024-12-18	Autoregressive Video Generation without Vector Quantization	Haoge Deng et.al.	2412.14169v1	link
2024-12-18	FashionComposer: Compositional Fashion Image Generation	Sihui Ji et.al.	2412.14168v1	null
2024-12-18	VideoDPO: Omni-Preference Alignment for Video Diffusion Generation	Runtao Liu et.al.	2412.14167v1	null
2024-12-18	Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation Localization	Xuekang Zhu et.al.	2412.13753v1	link
2024-12-18	Text2Relight: Creative Portrait Relighting with Text Guidance	Junuk Cha et.al.	2412.13734v1	null
2024-12-18	Diffusion models and stochastic quantisation in lattice field theory	Gert Aarts et.al.	2412.13704v1	null
2024-12-18	MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing	Chuang Yang et.al.	2412.13684v1	null
2024-12-18	Self-control: A Better Conditional Mechanism for Masked Autoregressive Model	Qiaoying Qu et.al.	2412.13635v1	null
2024-12-17	Posterior Mean Matching: Generative Modeling through Online Bayesian Inference	Sebastian Salazar et.al.	2412.13286v1	null
2024-12-17	F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration	Lu Liu et.al.	2412.13155v1	null
2024-12-17	Prompt Augmentation for Self-supervised Text-guided Image Manipulation	Rumeysa Bodur et.al.	2412.13081v1	null
2024-12-17	3D MedDiffusion: A 3D Medical Diffusion Model for Controllable and High-quality Medical Image Generation	Haoshen Wang et.al.	2412.13059v1	null
2024-12-17	Stable Diffusion is a Natural Cross-Modal Decoder for Layered AI-generated Image Compression	Ruijie Chen et.al.	2412.12982v1	null
2024-12-17	Attentive Eraser: Unleashing Diffusion Model's Object Removal Potential via Self-Attention Redirection Guidance	Wenhao Sun et.al.	2412.12974v2	link
2024-12-17	Unsupervised Region-Based Image Editing of Denoising Diffusion Models	Zixiang Li et.al.	2412.12912v1	null
2024-12-17	ArtAug: Enhancing Text-to-Image Generation through Synthesis-Understanding Interaction	Zhongjie Duan et.al.	2412.12888v2	link
2024-12-17	Rethinking Diffusion-Based Image Generators for Fundus Fluorescein Angiography Synthesis on Limited Data	Chengzhou Yu et.al.	2412.12778v1	null
2024-12-17	Guided and Variance-Corrected Fusion with One-shot Style Alignment for Large-Content Image Generation	Shoukun Sun et.al.	2412.12771v1	null
2024-12-17	Addressing Small and Imbalanced Medical Image Datasets Using Generative Models: A Comparative Study of DDPM and PGGANs with Random and Greedy K Sampling	Iman Khazrak et.al.	2412.12532v1	link
2024-12-17	Pattern Analogies: Learning to Perform Programmatic Image Edits by Analogy	Aditya Ganeshan et.al.	2412.12463v1	null
2024-12-17	Numerical Pruning for Efficient Autoregressive Models	Xuan Shen et.al.	2412.12441v1	null
2024-12-16	Efficient Scaling of Diffusion Transformers for Text-to-Image Generation	Hao Li et.al.	2412.12391v1	null
2024-12-16	OmniPrism: Learning Disentangled Visual Concept for Image Generation	Yangyang Li et.al.	2412.12242v1	null
2024-12-16	You Only Submit One Image to Find the Most Suitable Generative Model	Zhi Zhou et.al.	2412.12232v1	null
2024-12-16	Causal Diffusion Transformers for Generative Modeling	Chaorui Deng et.al.	2412.12095v2	link
2024-12-16	Instruction-based Image Manipulation by Watching How Things Move	Mingdeng Cao et.al.	2412.12087v1	null
2024-12-16	A LoRA is Worth a Thousand Pictures	Chenxi Liu et.al.	2412.12048v1	null
2024-12-16	IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation	Yiren Song et.al.	2412.11638v1	null
2024-12-16	3D$^2$-Actor: Learning Pose-Conditioned 3D-Aware Denoiser for Realistic Gaussian Avatar Modeling	Zichen Tang et.al.	2412.11599v1	link

Light Field Super Resolution

Publish Date	Title	Authors	PDF	Code
2024-10-14	SeedLM: Compressing LLM Weights into Seeds of Pseudo-Random Generators	Rasoul Shafipour et.al.	2410.10714v2	null
2024-09-26	LGFN: Lightweight Light Field Image Super-Resolution using Local Convolution Modulation and Global Attention Feature Extraction	Zhongxin Yu et.al.	2409.17759v1	null
2024-07-22	Efficient Multi-disparity Transformer for Light Field Image Super-resolution	Zeke Zexi Hu et.al.	2407.15329v1	null
2024-06-23	Mamba-based Light Field Super-Resolution with Efficient Subspace Scanning	Ruisheng Gao et.al.	2406.16083v1	null
2024-06-18	LFMamba: Light Field Image Super-Resolution with State Space Model	Wang xia et.al.	2406.12463v1	null
2024-05-11	Incorporating Degradation Estimation in Light Field Spatial Super-Resolution	Zeyu Xiao et.al.	2405.07012v1	null
2024-04-18	Pseudo-random generators using linear feedback shift registers with output extraction	Holger Nobach et.al.	2404.12011v1	null
2024-02-29	Unsupervised Learning of High-resolution Light Field Imaging via Beam Splitter-based Hybrid Lenses	Jianxin Lei et.al.	2402.19020v1	null
2024-02-16	Lightweight ciphers based on chaotic Map -- LFSR architectures	M. Garcia-Bosque et.al.	2402.10871v1	null
2024-01-01	Beyond Subspace Isolation: Many-to-Many Transformer for Light Field Image Super-resolution	Zeke Zexi Hu et.al.	2401.00740v1	null
2023-07-31	LFSR based RNG on low cost FPGA for QKD applications	Pooja Chandravanshi et.al.	2307.16431v1	null
2023-07-05	A Scheme to resist Fast Correlation Attack for Word Oriented LFSR based Stream Cipher	Subrata Nandi et.al.	2307.02182v1	null
2023-06-07	Security Analysis of WG-7 Lightweight Stream Cipher against Cube Attack	Bijoy Das et.al.	2306.04352v1	null
2023-05-30	Toward Real-World Light Field Super-Resolution	Zeyu Xiao et.al.	2305.18994v1	link
2023-05-12	A Lightweight Authentication Protocol against Modeling Attacks based on a Novel LFSR-APUF	Yao Wang et.al.	2305.07254v1	null
2023-04-30	On Rueppel's Linear Complexity Conjecture	Graham H. Norton et.al.	2305.00405v1	null
2023-04-20	NTIRE 2023 Challenge on Light Field Image Super-Resolution: Dataset, Methods and Results	Yingqian Wang et.al.	2304.10415v1	link
2023-04-11	Towards Power Characterization of FPGA Architectures To Enable Open-Source Power Estimation Using Micro-Benchmarks	Stefan Riesenberger et.al.	2304.05326v1	null
2023-03-16	Linear Codes from Simplicial Complexes over $\mathbb{F}_{2^n}$	Hongwei Liu et.al.	2303.09292v1	null
2023-03-05	A Provably Secure Strong PUF based on LWE: Construction and Implementation	Xiaodan Xi et.al.	2303.02802v1	null
2022-10-09	Learning Texture Transformer Network for Light Field Super-Resolution	Javeria Shabbir et.al.	2210.09293v1	null
2022-08-12	Software implementation of the SNOW 3G Generator on iOS and Android platforms	Jezabel Molina-Gil et.al.	2208.06147v1	null
2022-08-06	RFID authentication protocol based on a novel EPC Gen2 PRNG	Pino Caballero-Gil et.al.	2208.05345v1	null
2022-08-06	Weak Equivalents for Nonlinear Filtering Functions	Amparo Fúster-Sabater et.al.	2208.04734v1	null
2022-07-31	Ordered Orthogonal Array Construction Using LFSR Sequences	André Guerino Castoldi et.al.	2208.00333v1	null
2022-07-25	Sub-Aperture Feature Adaptation in Single Image Super-resolution Model for Light Field Imaging	Aupendu Kar et.al.	2207.11894v2	null
2022-06-09	A GPU-Accelerated Light-field Super-resolution Framework Based on Mixed Noise Model and Weighted Regularization	Trung-Hieu Tran et.al.	2206.05047v1	null
2022-01-02	Detail-Preserving Transformer for Light Field Image Super-Resolution	Shunzhou Wang et.al.	2201.00346v1	link
2021-11-07	Texture-enhanced Light Field Super-resolution with Spatio-Angular Decomposition Kernels	Zexi Hu et.al.	2111.04069v2	null
2021-10-07	Shift-BNN: Highly-Efficient Probabilistic Bayesian Neural Network Training via Memory-Friendly Pattern Retrieving	Qiyu Wan et.al.	2110.03553v1	null

Light Field Depth Estimation

Publish Date	Title	Authors	PDF	Code
2024-03-04	Iterative Occlusion-Aware Light Field Depth Estimation using 4D Geometrical Cues	Rui Lourenço et.al.	2403.02043v1	null
2023-05-28	OccCasNet: Occlusion-aware Cascade Cost Volume for Light Field Depth Estimation	Wentao Chao et.al.	2305.17710v1	link
2023-01-20	Unsupervised Light Field Depth Estimation via Multi-view Feature Matching with Occlusion Prediction	Shansi Zhang et.al.	2301.08433v2	null
2022-08-20	Learning Sub-Pixel Disparity Distribution for Light Field Depth Estimation	Wentao Chao et.al.	2208.09688v3	link
2022-03-29	Light Field Depth Estimation via Stitched Epipolar Plane Images	Ping Zhou et.al.	2203.15201v3	link
2022-03-29	Self-Supervised Light Field Depth Estimation Using Epipolar Plane Images	Kunyuan Li et.al.	2203.15171v1	null
2022-03-04	OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation	Peng Li et.al.	2203.02231v3	null
2022-03-03	Occlusion-Aware Cost Constructor for Light Field Depth Estimation	Yingqian Wang et.al.	2203.01576v1	link
2021-06-06	Occlusion-aware Unsupervised Learning of Depth from 4-D Light Fields	Jing Jin et.al.	2106.03043v2	link
2021-04-13	Learning Multi-modal Information for Robust Light Field Depth Estimation	Yongri Piao et.al.	2104.05971v1	link
2021-04-13	Dynamic Fusion Network For Light Field Depth Estimation	Yongri Piao et.al.	2104.05969v1	null
2020-09-09	View-consistent 4D Light Field Depth Estimation	Numair Khan et.al.	2009.04065v1	link
2020-07-09	EPI-based Oriented Relation Networks for Light Field Depth Estimation	Kunyuan Li et.al.	2007.04538v2	link
2019-09-19	Learning to Think Outside the Box: Wide-Baseline Light Field Depth Estimation with EPI-Shift	Titus Leistner et.al.	1909.09059v1	null
2019-07-31	Rapid Light Field Depth Estimation with Semi-Global Matching	Yuriy Anisimov et.al.	1907.13449v1	null
2018-04-06	EPINET: A Fully-Convolutional Neural Network Using Epipolar Geometry for Depth from Light Field Images	Changha Shin et.al.	1804.02379v1	null
2017-08-07	Accurate Light Field Depth Estimation with Superpixel Regularization over Partially Occluded Regions	Jie Chen et.al.	1708.01964v1	null
2016-08-15	Occlusion-Model Guided Anti-Occlusion Depth Estimation in Light Field	Hao Zhu et.al.	1608.04187v2	null

Light Field View Synthesis

Publish Date	Title	Authors	PDF	Code
2024-03-15	Den-SOFT: Dense Space-Oriented Light Field DataseT for 6-DOF Immersive Experience	Xiaohang Yu et.al.	2403.09973v1	null
2024-03-02	Neural radiance fields-based holography [Invited]	Minsung Kang et.al.	2403.01137v2	null
2023-11-14	Learning based Deep Disentangling Light Field Reconstruction and Disparity Estimation Application	Langqing Shi et.al.	2311.08129v1	null
2023-09-04	ImmersiveNeRF: Hybrid Radiance Fields for Unbounded Immersive Light Field Reconstruction	Xiaohang Yu et.al.	2309.01374v1	null
2023-07-06	RealLiFe: Real-Time Light Field Reconstruction via Hierarchical Sparse Gradient Descent	Yijie Deng et.al.	2307.03017v3	null
2022-12-23	Quantum correlation light-field microscope with extreme depth of field	Yingwen Zhang et.al.	2212.12582v2	null
2022-09-22	Fast Disparity Estimation from a Single Compressed Light Field Measurement	Emmanuel Martinez et.al.	2209.11342v1	null
2022-04-26	Acquiring a Dynamic Light Field through a Single-Shot Coded Image	Ryoya Mizuno et.al.	2204.12089v1	null
2022-04-01	Epipolar Focus Spectrum: A Novel Light Field Representation and Application in Dense-view Reconstruction	Yaning Li et.al.	2204.00193v1	null
2021-08-27	A Novel Hierarchical Light Field Coding Scheme Based on Hybrid Stacked Multiplicative Layers and Fourier Disparity Layers for Glasses-Free 3D Displays	Joshitha Ravishankar et.al.	2108.12399v1	null
2021-08-08	Efficient Light Field Reconstruction via Spatio-Angular Dense Network	Zexi Hu et.al.	2108.03635v1	link
2021-06-04	Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering	Vincent Sitzmann et.al.	2106.02634v2	null
2021-03-24	Light Field Reconstruction Using Convolutional Network on EPI and Extended Applications	Gaochang Wu et.al.	2103.13043v1	link
2021-02-14	Light Field Reconstruction via Deep Adaptive Fusion of Hybrid Lenses	Jing Jin et.al.	2102.07085v3	link
2020-12-03	Light-field view synthesis using convolutional block attention module	M. Shahzeb Khan Gul et.al.	2012.01900v2	null
2020-09-07	Light Field View Synthesis via Aperture Disparity and Warping Confidence Map	Nan Meng et.al.	2009.02978v2	null
2020-08-12	Self-supervised Light Field View Synthesis Using Cycle Consistency	Yang Chen et.al.	2008.05084v1	null
2020-07-23	Deep Spatial-angular Regularization for Compressive Light Field Reconstruction over Coded Apertures	Mantang Guo et.al.	2007.11882v1	link
2020-07-05	Spatial-Angular Attention Network for Light Field Reconstruction	Gaochang Wu et.al.	2007.02252v2	link
2020-05-13	A Generative Model for Generic Light Field Reconstruction	Paramanand Chandramouli et.al.	2005.06508v2	null
2020-03-20	Self-Supervised Light Field Reconstruction Using Shearlet Transform and Cycle Consistency	Yuan Gao et.al.	2003.09294v1	null
2020-03-19	DRST: Deep Residual Shearlet Transform for Densely Sampled Light Field Reconstruction	Yuan Gao et.al.	2003.08865v1	null
2020-02-26	Learning Light Field Angular Super-Resolution via a Geometry-Aware Network	Jing Jin et.al.	2002.11263v1	link
2020-01-14	Seeing the World in a Bag of Chips	Jeong Joon Park et.al.	2001.04642v2	null
2019-10-03	High-dimensional Dense Residual Convolutional Neural Network for Light Field Reconstruction	Nan Meng et.al.	1910.01426v4	link
2019-08-31	Deep Coarse-to-fine Dense Light Field Reconstruction with Flexible Sampling and Geometry-aware Fusion	Jing Jin et.al.	1909.01341v3	link
2019-02-17	LapEPI-Net: A Laplacian Pyramid EPI structure for Learning-based Dense Light Field Reconstruction	Gaochang Wu et.al.	1902.06221v1	null
2018-12-26	A Unified Learning Based Framework for Light Field Reconstruction from Coded Projections	Anil Kumar Vadathya et.al.	1812.10532v2	null
2018-10-20	A System for Acquiring, Processing, and Rendering Panoramic Light Field Stills for Virtual Reality	Ryan S. Overbeck et.al.	1810.08860v1	null
2018-06-14	Dense Light Field Reconstruction From Sparse Sampling Using Residual Network	Mantang Guo et.al.	1806.05506v2	null

Light Field Other Applications

Publish Date	Title	Authors	PDF	Code
2024-11-21	Segment Anything in Light Fields for Real-Time Applications via Constrained Prompting	Nikolai Goncharov et.al.	2411.13840v1	link
2023-03-13	View Adaptive Light Field Deblurring Networks with Depth Perception	Zeqi Shen et.al.	2303.06860v1	null
2022-04-28	Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency Detection	Mingtao Feng et.al.	2204.13456v1	link
2021-10-02	Light Field Saliency Detection with Dual Local Graph Learning andReciprocative Guidance	Nian Liu et.al.	2110.00698v1	link
2020-12-30	DUT-LFSaliency: Versatile Dataset and Light Field-to-RGB Saliency Detection	Yongri Piao et.al.	2012.15124v1	null
2020-10-25	Fast and Accurate Light Field Saliency Detection through Deep Encoding	Sahan Hemachandra et.al.	2010.13073v2	null
2019-06-19	Light Field Saliency Detection with Deep Convolutional Networks	Jun Zhang et.al.	1906.08331v2	link
2019-03-31	Fast and Full-Resolution Light Field Deblurring using a Deep Neural Network	Jonathan Samuel Lumentut et.al.	1904.00352v1	null
2017-12-20	Light Field Segmentation From Super-pixel Graph Representation	Xianqiang Lv et.al.	1712.07394v1	null
2017-11-29	Joint Blind Motion Deblurring and Depth Estimation of Light Field	Dongwoo Lee et.al.	1711.10918v2	null

Diffusion

Publish Date	Title	Authors	PDF	Code
2024-12-18	AniDoc: Animation Creation Made Easier	Yihao Meng et.al.	2412.14173v1	null
2024-12-18	E-CAR: Efficient Continuous Autoregressive Image Generation via Multistage Modeling	Zhihang Yuan et.al.	2412.14170v1	null
2024-12-18	Autoregressive Video Generation without Vector Quantization	Haoge Deng et.al.	2412.14169v1	link
2024-12-18	VideoDPO: Omni-Preference Alignment for Video Diffusion Generation	Runtao Liu et.al.	2412.14167v1	null
2024-12-18	AKiRa: Augmentation Kit on Rays for optical video generation	Xi Wang et.al.	2412.14158v1	null
2024-12-18	MCMat: Multiview-Consistent and Physically Accurate PBR Material Generation	Shenhao Zhu et.al.	2412.14148v1	null
2024-12-18	Measuring collective diffusion properties by counting particles in boxes	Adam Carter et.al.	2412.14122v1	null
2024-12-18	SurgSora: Decoupled RGBD-Flow Diffusion Model for Controllable Surgical Video Generation	Tong Chen et.al.	2412.14018v1	null
2024-12-18	A perturbative approach to the macroscopic fluctuation theory	Thierry Bodineau et.al.	2412.13991v1	null
2024-12-18	Double sine-Gordon class of universal coarsening dynamics in a spin-1 Bose gas	Ido Siovitz et.al.	2412.13986v1	null
2024-12-18	Gravitational wave astronomy and the expansion history of the Universe	Massimo Giovannini et.al.	2412.13968v1	null
2024-12-18	Comparative Analysis of Machine Learning-Based Imputation Techniques for Air Quality Datasets with High Missing Data Rates	Sen Yan et.al.	2412.13966v1	null
2024-12-18	Anomalous Diffusion of Superparamagnetic Walkers with Tailored Statistics	Alessia Gentili et.al.	2412.13960v1	null
2024-12-18	On Explaining Knowledge Distillation: Measuring and Visualising the Knowledge Transfer Process	Gereziher Adhane et.al.	2412.13943v1	null
2024-12-18	Spatio-Temporal Forecasting of PM2.5 via Spatial-Diffusion guided Encoder-Decoder Architecture	Malay Pandey et.al.	2412.13935v1	null
2024-12-18	Investigating the Effects of Diffusion-based Conditional Generative Speech Models Used for Speech Enhancement on Dysarthric Speech	Joanna Reszka et.al.	2412.13933v1	null
2024-12-18	Two-states Brownian particle in a Harmonic Potential	Giovanni Battista Carollo et.al.	2412.13921v1	null
2024-12-18	X-ray Binaries: a potential dominant contributor to the cosmic ray spectral knee structure	Hua Yue et.al.	2412.13889v1	null
2024-12-18	IDEQ: an improved diffusion model for the TSP	Mickael Basson et.al.	2412.13858v1	null
2024-12-18	Coupled Eikonal problems to model cardiac reentries in Purkinje network and myocardium	Samuele Brunati et.al.	2412.13837v1	null
2024-12-18	Object Style Diffusion for Generalized Object Detection in Urban Scene	Hao Li et.al.	2412.13815v1	null
2024-12-18	Spatial Brain Tumor Concentration Estimation for Individualized Radiotherapy Planning	Jonas Weidner et.al.	2412.13811v1	null
2024-12-18	SongEditor: Adapting Zero-Shot Song Generation Language Model as a Multi-Task Editor	Chenyu Yang et.al.	2412.13786v1	null
2024-12-18	Minimum nonlinearity for pattern-forming Turing instability in a mathematical autocatalytic model	Javier López-Pedrares et.al.	2412.13783v1	null
2024-12-18	Transport theory and spin-transfer physics in d-wave altermagnets	Ricardo Zarzuela et.al.	2412.13763v1	null
2024-12-18	Text2Relight: Creative Portrait Relighting with Text Guidance	Junuk Cha et.al.	2412.13734v1	null
2024-12-18	Diffusion models and stochastic quantisation in lattice field theory	Gert Aarts et.al.	2412.13704v1	null
2024-12-18	MMO-IG: Multi-Class and Multi-Scale Object Image Generation for Remote Sensing	Chuang Yang et.al.	2412.13684v1	null
2024-12-18	VIIS: Visible and Infrared Information Synthesis for Severe Low-light Image Enhancement	Chen Zhao et.al.	2412.13655v1	link
2024-12-18	Model-independent measurement of isospin diffusion in Ni-Ni systems at intermediate energy	C. Ciampi et.al.	2412.13648v1	null

Vision Transformer

Publish Date	Title	Authors	PDF	Code
2024-12-18	LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via Hierarchical Window Transformer	Yipeng Zhang et.al.	2412.13871v1	null
2024-12-17	Identification of Epileptic Spasms (ESES) Phases Using EEG Signals: A Vision Transformer Approach	Wei Gong et.al.	2412.13028v1	null
2024-12-17	Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-training	Mingjia Shi et.al.	2412.12496v1	link
2024-12-16	No More Adam: Learning Rate Scaling at Initialization is All You Need	Minghao Xu et.al.	2412.11768v2	link
2024-12-16	Flex-PE: Flexible and SIMD Multi-Precision Processing Element for AI Workloads	Mukul Lokhande et.al.	2412.11702v1	null
2024-12-16	HResFormer: Hybrid Residual Transformer for Volumetric Medical Image Segmentation	Sucheng Ren et.al.	2412.11458v1	null
2024-12-15	One-Shot Multilingual Font Generation Via ViT	Zhiheng Wang et.al.	2412.11342v1	null
2024-12-15	MoRe: Class Patch Attention Needs Regularization for Weakly Supervised Semantic Segmentation	Zhiwei Yang et.al.	2412.11076v1	link
2024-12-14	RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone	Mustafa Munir et.al.	2412.10995v1	link
2024-12-14	Boosting ViT-based MRI Reconstruction from the Perspectives of Frequency Modulation, Spatial Purification, and Scale Diversification	Yucong Meng et.al.	2412.10776v1	null
2024-12-14	One Pixel is All I Need	Deng Siqin et.al.	2412.10681v1	null
2024-12-13	Learning to Merge Tokens via Decoupled Embedding for Efficient Vision Transformers	Dong Hoon Lee et.al.	2412.10569v1	link
2024-12-13	VibrantVS: A high-resolution multi-task transformer for forest canopy height estimation	Tony Chang et.al.	2412.10351v1	null
2024-12-13	ManipGPT: Is Affordance Segmentation by Large Vision Models Enough for Articulated Object Manipulation?	Taewhan Kim et.al.	2412.10050v2	null
2024-12-13	T-GMSI: A transformer-based generative model for spatial interpolation under sparse measurements	Xiangxi Tian et.al.	2412.09886v1	null
2024-12-12	Agtech Framework for Cranberry-Ripening Analysis Using Vision Foundation Models	Faith Johnson et.al.	2412.09739v1	null
2024-12-12	From Noise to Nuance: Advances in Deep Generative Image Models	Benji Peng et.al.	2412.09656v1	null
2024-12-12	Vision Transformers for Efficient Indoor Pathloss Radio Map Prediction	Edvard Ghukasyan et.al.	2412.09507v1	null
2024-12-12	A Novel Ensemble-Based Deep Learning Model with Explainable AI for Accurate Kidney Disease Diagnosis	Md. Arifuzzaman et.al.	2412.09472v1	null
2024-12-12	Advancing Attribution-Based Neural Network Explainability through Relative Absolute Magnitude Layer-Wise Relevance Propagation and Multi-Component Evaluation	Davor Vukadin et.al.	2412.09311v1	link
2024-12-12	Selective Visual Prompting in Vision Mamba	Yifeng Yao et.al.	2412.08947v1	null
2024-12-12	Sensing for Space Safety and Sustainability: A Deep Learning Approach with Vision Transformers	Wenxuan Zhang et.al.	2412.08913v2	null
2024-12-11	SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation	Tapas Kumar Dutta et.al.	2412.08482v1	null
2024-12-11	A Review of Intelligent Device Fault Diagnosis Technologies Based on Machine Vision	Guiran Liu et.al.	2412.08148v1	null
2024-12-10	Comparative Analysis of Deep Learning Approaches for Harmful Brain Activity Detection Using EEG	Shivraj Singh Bhatti et.al.	2412.07878v1	null
2024-12-10	An Enhancement of CNN Algorithm for Rice Leaf Disease Image Classification in Mobile Applications	Kayne Uriel K. Rodrigo et.al.	2412.07182v1	null
2024-12-09	Static Key Attention in Vision	Zizhao Hu et.al.	2412.07049v1	null
2024-12-09	Vision transformer based Deep Learning of Topological indicators in Majorana Nanowires	Jacob R. Taylor et.al.	2412.06768v1	null
2024-12-09	Knowledge Transfer and Domain Adaptation for Fine-Grained Remote Sensing Image Segmentation	Shun Zhang et.al.	2412.06664v2	null
2024-12-09	Beyond Scalars: Concept-Based Alignment Analysis in Vision Transformers	Johanna Vielhaben et.al.	2412.06639v1	link

NeRF

Publish Date	Title	Authors	PDF	Code
2024-12-18	GraphAvatar: Compact Head Avatars with GNN-Generated 3D Gaussians	Xiaobao Wei et.al.	2412.13983v1	link
2024-12-17	EOGS: Gaussian Splatting for Earth Observation	Luca Savant Aira et.al.	2412.13047v1	null
2024-12-17	Optimize the Unseen -- Fast NeRF Cleanup with Free Space Prior	Leo Segre et.al.	2412.12772v2	null
2024-12-17	Towards a Training Free Approach for 3D Scene Editing	Vivek Madhavaram et.al.	2412.12766v1	null
2024-12-16	GS-ProCams: Gaussian Splatting-based Projector-Camera Systems	Qingyue Deng et.al.	2412.11762v1	null
2024-12-16	Sequence Matters: Harnessing Video Models in 3D Super-Resolution	Hyun-kyu Ko et.al.	2412.11525v2	null
2024-12-16	VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression	Qiang Hu et.al.	2412.11362v1	null
2024-12-13	NeRF-Texture: Synthesizing Neural Radiance Field Textures	Yi-Hua Huang et.al.	2412.10004v1	null
2024-12-13	Sharpening Your Density Fields: Spiking Neuron Aided Fast Geometry Learning	Yi Gu et.al.	2412.09881v1	null
2024-12-12	PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields	Sean Wu et.al.	2412.09680v1	link
2024-12-11	GN-FR:Generalizable Neural Radiance Fields for Flare Removal	Gopi Raju Matta et.al.	2412.08200v2	null
2024-12-11	NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods	Qiang Qu et.al.	2412.08029v1	link
2024-12-10	EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering	Toshiya Yura et.al.	2412.07293v1	null
2024-12-09	Diffusing Differentiable Representations	Yash Savani et.al.	2412.06981v1	null
2024-12-09	Dynamic EventNeRF: Reconstructing General Dynamic Scenes from Multi-view Event Cameras	Viktor Rudnev et.al.	2412.06770v1	null
2024-12-09	Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video	Renlong Wu et.al.	2412.06424v1	link
2024-12-09	Splatter-360: Generalizable 360$^{\circ}$ Gaussian Splatting for Wide-baseline Panoramic Images	Zheng Chen et.al.	2412.06250v1	link
2024-12-07	WATER-GS: Toward Copyright Protection for 3D Gaussian Splatting via Universal Watermarking	Yuqi Tan et.al.	2412.05695v1	null
2024-12-06	Perturb-and-Revise: Flexible 3D Editing with Generative Trajectories	Susung Hong et.al.	2412.05279v1	null
2024-12-06	MixedGaussianAvatar: Realistically and Geometrically Accurate Head Avatar via Mixed 2D-3D Gaussian Splatting	Peng Chen et.al.	2412.04955v2	link
2024-12-04	NeRF and Gaussian Splatting SLAM in the Wild	Fabian Schmidt et.al.	2412.03263v1	link
2024-12-01	SAGA: Surface-Aligned Gaussian Avatar	Ronghan Chen et.al.	2412.00845v1	null
2024-12-01	CtrlNeRF: The Generative Neural Radiation Fields for the Controllable Synthesis of High-fidelity 3D-Aware Images	Jian Liu et.al.	2412.00754v1	null
2024-11-30	Speedy-Splat: Fast 3D Gaussian Splatting with Sparse Pixels and Sparse Primitives	Alex Hanson et.al.	2412.00578v1	link
2024-11-30	Instant3dit: Multiview Inpainting for Fast Editing of 3D Objects	Amir Barda et.al.	2412.00518v1	null
2024-11-29	$C^{3}$-NeRF: Modeling Multiple Scenes via Conditional-cum-Continual Neural Radiance Fields	Prajwal Singh et.al.	2411.19903v1	null
2024-11-29	Gaussian Splashing: Direct Volumetric Rendering Underwater	Nir Mualem et.al.	2411.19588v1	null
2024-11-29	ReconDreamer: Crafting World Models for Driving Scene Reconstruction via Online Restoration	Chaojun Ni et.al.	2411.19548v1	null
2024-11-29	LokiTalk: Learning Fine-Grained and Generalizable Correspondences to Enhance NeRF-based Talking Head Synthesis	Tianqi Li et.al.	2411.19525v1	null
2024-11-28	SAMa: Material-aware 3D Selection and Segmentation	Michael Fischer et.al.	2411.19322v1	null

Super Resolution

Publish Date	Title	Authors	PDF	Code
2024-12-18	Super-Resolution Generative Adversarial Network for Data Compression of Direct Numerical Simulations	Ludovico Nista et.al.	2412.14150v1	null
2024-12-17	Learning of Patch-Based Smooth-Plus-Sparse Models for Image Reconstruction	Stanislas Ducotterd et.al.	2412.13070v1	link
2024-12-17	Super-Resolving Normalising Flows for Lattice Field Theories	Marc Bauer et.al.	2412.12842v1	null
2024-12-16	EGP3D: Edge-guided Geometric Preserving 3D Point Cloud Super-resolution for RGB-D camera	Zheng Fang et.al.	2412.11680v1	null
2024-12-16	CLIP-SR: Collaborative Linguistic and Image Processing for Super-Resolution	Bingwen Hu et.al.	2412.11609v1	null
2024-12-16	Sequence Matters: Harnessing Video Models in 3D Super-Resolution	Hyun-kyu Ko et.al.	2412.11525v2	null
2024-12-16	Block-Based Multi-Scale Image Rescaling	Jian Li et.al.	2412.11468v1	null
2024-12-16	Quantization of Climate Change Impacts on Renewable Energy Generation Capacity: A Super-Resolution Recurrent Diffusion Model	Xiaochong Dong et.al.	2412.11399v1	null
2024-12-14	A Staged Deep Learning Approach to Spatial Refinement in 3D Temporal Atmospheric Transport	M. Giselle Fernández-Godino et.al.	2412.10945v2	null
2024-12-13	SuperMark: Robust and Training-free Image Watermarking via Diffusion-based Super-Resolution	Runyi Hu et.al.	2412.10049v1	null
2024-12-13	A Single-Frame and Multi-Frame Cascaded Image Super-Resolution Method	Jing Sun et.al.	2412.09846v1	null
2024-12-13	Super-Resolution for Remote Sensing Imagery via the Coupling of a Variational Model and Deep Learning	Jing Sun et.al.	2412.09841v1	null
2024-12-11	RealOSR: Latent Unfolding Boosting Diffusion-based Real-world Omnidirectional Image Super-Resolution	Xuhan Sheng et.al.	2412.09646v1	null
2024-12-12	OFTSR: One-Step Flow for Image Super-Resolution with Tunable Fidelity-Realism Trade-offs	Yuanzhi Zhu et.al.	2412.09465v1	link
2024-12-12	A Plug-and-Play Algorithm for 3D Video Super-Resolution of Single-Photon LiDAR data	Alice Ruget et.al.	2412.09427v1	null
2024-12-12	Distribution free uncertainty quantification in neuroscience-inspired deep operators	Shailesh Garg et.al.	2412.09369v1	null
2024-12-12	Arbitrary-steps Image Super-resolution via Diffusion Inversion	Zongsheng Yue et.al.	2412.09013v1	link
2024-12-11	Fair Primal Dual Splitting Method for Image Inverse Problems	Yunfei Qu et.al.	2412.08613v1	null
2024-12-11	Efficient estimation of error bounds for quantum multiparametric imaging with constraints	Alexander Mikhalychev et.al.	2412.08199v2	null
2024-12-11	Statistical Downscaling via High-Dimensional Distribution Matching with Generative Models	Zhong Yi Wan et.al.	2412.08079v1	null
2024-12-10	MPSI: Mamba enhancement model for pixel-wise sequential interaction Image Super-Resolution	Yuchun He et.al.	2412.07222v1	null
2024-12-10	A Progressive Image Restoration Network for High-order Degradation Imaging in Remote Sensing	Yujie Feng et.al.	2412.07195v1	null
2024-12-10	Hero-SR: One-Step Diffusion for Super-Resolution with Human Perception Priors	Jiangang Wang et.al.	2412.07152v1	null
2024-12-10	RAP-SR: RestorAtion Prior Enhancement in Diffusion Models for Realistic Image Super-Resolution	Jiangang Wang et.al.	2412.07149v1	link
2024-12-09	Edge-SD-SR: Low Latency and Parameter Efficient On-device Super-Resolution with Stable Diffusion via Bidirectional Conditioning	Mehdi Noroozi et.al.	2412.06978v1	null
2024-12-09	Neural Garment Dynamic Super-Resolution	Meng Zhang et.al.	2412.06285v1	link
2024-12-09	MSCrackMamba: Leveraging Vision Mamba for Crack Detection in Fused Multispectral Imagery	Qinfeng Zhu et.al.	2412.06211v1	null
2024-12-09	You KAN Do It in a Single Shot: Plug-and-Play Methods with Single-Instance Priors	Yanqi Cheng et.al.	2412.06204v1	null
2024-12-07	Jointly RS Image Deblurring and Super-Resolution with Adjustable-Kernel and Multi-Domain Attention	Yan Zhang et.al.	2412.05696v1	link
2024-12-07	Test-time Cost-and-Quality Controllable Arbitrary-Scale Super-Resolution with Variable Fourier Components	Kazutoshi Akita et.al.	2412.05517v2	null

Depth Estimation

Publish Date	Title	Authors	PDF	Code
2024-12-18	Foundation Models Meet Low-Cost Sensors: Test-Time Adaptation for Rescaling Disparity for Zero-Shot Metric Depth Estimation	Rémi Marsal et.al.	2412.14103v1	null
2024-12-18	Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation	Haotong Lin et.al.	2412.14015v1	null
2024-12-18	Marigold-DC: Zero-Shot Monocular Depth Completion with Guided Diffusion	Massimiliano Viola et.al.	2412.13389v1	null
2024-12-17	Dyn-HaMR: Recovering 4D Interacting Hand Motion from a Dynamic Camera	Zhengdi Yu et.al.	2412.12861v1	null
2024-12-17	PromptDet: A Lightweight 3D Object Detection Framework with LiDAR Prompts	Kun Guo et.al.	2412.12460v1	link
2024-12-16	V-MIND: Building Versatile Monocular Indoor 3D Detector with Diverse 2D Annotations	Jin-Cheng Jhang et.al.	2412.11412v1	null
2024-12-16	Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video	Junkai Fan et.al.	2412.11395v1	null
2024-12-15	ViPOcc: Leveraging Visual Priors from Vision Foundation Models for Single-View 3D Occupancy Prediction	Yi Feng et.al.	2412.11210v1	link
2024-12-14	MAL: Cluster-Masked and Multi-Task Pretraining for Enhanced xLSTM Vision Performance	Wenjun Huang et.al.	2412.10730v1	null
2024-12-12	Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos	Linyi Jin et.al.	2412.09621v1	null
2024-12-12	T-SVG: Text-Driven Stereoscopic Video Generation	Qiao Jin et.al.	2412.09323v1	null
2024-12-12	Cross-View Completion Models are Zero-shot Correspondence Estimators	Honggyu An et.al.	2412.09072v1	null
2024-12-11	BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation	Shengze Wang et.al.	2412.08640v1	null
2024-12-11	Utilizing Multi-step Loss for Single Image Reflection Removal	Abdelrahman Elnenaey et.al.	2412.08582v2	link
2024-12-11	Dense Depth from Event Focal Stack	Kenta Horikawa et.al.	2412.08120v1	null
2024-12-10	Diffusion-Based Attention Warping for Consistent 3D Scene Editing	Eyal Gomel et.al.	2412.07984v1	null
2024-12-10	Balancing Shared and Task-Specific Representations: A Hybrid Approach to Depth-Aware Video Panoptic Segmentation	Kurt H. W. Stolle et.al.	2412.07966v1	null
2024-12-09	SphereUFormer: A U-Shaped Transformer for Spherical 360 Perception	Yaniv Benny et.al.	2412.06968v1	null
2024-12-09	Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving	Xin Fei et.al.	2412.06777v1	link
2024-12-09	MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views	Antoine Guédon et.al.	2412.06767v1	null
2024-12-09	On-Device Self-Supervised Learning of Low-Latency Monocular Depth from Only Events	Jesse Hagenaars et.al.	2412.06359v1	null
2024-12-09	Omni-Scene: Omni-Gaussian Representation for Ego-Centric Sparse-View Scene Reconstruction	Dongxu Wei et.al.	2412.06273v1	null
2024-12-09	Event fields: Capturing light fields at high speed, resolution, and dynamic range	Ziyuan Qu et.al.	2412.06191v1	null
2024-12-08	GVDepth: Zero-Shot Monocular Depth Estimation for Ground Vehicles based on Probabilistic Cue Fusion	Karlo Koledic et.al.	2412.06080v1	null
2024-12-08	Prism: Semi-Supervised Multi-View Stereo with Monocular Structure Priors	Alex Rich et.al.	2412.05771v1	null
2024-12-07	TACO: Learning Multi-modal Action Models with Synthetic Chains-of-Thought-and-Action	Zixian Ma et.al.	2412.05479v2	null
2024-12-06	SimC3D: A Simple Contrastive 3D Pretraining Framework Using RGB Images	Jiahua Dong et.al.	2412.05274v1	null
2024-12-06	Penetrative rotating magnetoconvection subject to lateral variations in temperature gradients	Tirtharaj Barman et.al.	2412.05235v1	null
2024-12-06	PanoDreamer: 3D Panorama Synthesis from a Single Image	Avinash Paliwal et.al.	2412.04827v1	link
2024-12-05	LAA-Net: A Physical-prior-knowledge Based Network for Robust Nighttime Depth Estimation	Kebin Peng et.al.	2412.04666v1	null

View Synthesis

Publish Date	Title	Authors	PDF	Code
2024-12-18	Real-Time Position-Aware View Synthesis from Single-View Input	Manu Gond et.al.	2412.14005v1	null
2024-12-18	Turbo-GS: Accelerating 3D Gaussian Fitting for High-Quality Radiance Fields	Tao Lu et.al.	2412.13547v1	null
2024-12-17	StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models	Yunzhi Yan et.al.	2412.13188v1	null
2024-12-17	CATSplat: Context-Aware Transformer with Spatial Guidance for Generalizable 3D Gaussian Splatting from A Single-View Image	Wonseok Roh et.al.	2412.12906v1	null
2024-12-17	HyperGS: Hyperspectral 3D Gaussian Splatting	Christopher Thirgood et.al.	2412.12849v1	null
2024-12-17	Optimize the Unseen -- Fast NeRF Cleanup with Free Space Prior	Leo Segre et.al.	2412.12772v2	null
2024-12-16	PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting	Cheng Zhang et.al.	2412.12096v1	link
2024-12-16	SweepEvGS: Event-Based 3D Gaussian Splatting for Macro and Micro Radiance Field Rendering from a Single Sweep	Jingqian Wu et.al.	2412.11579v1	null
2024-12-16	SpatialMe: Stereo Video Conversion Using Depth-Warping and Blend-Inpainting	Jiale Zhang et.al.	2412.11512v1	null
2024-12-16	MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes	Ruijie Lu et.al.	2412.11457v1	null
2024-12-13	Probabilistic Inverse Cameras: Image to 3D via Multiview Geometry	Rishabh Kabra et.al.	2412.10273v1	null
2024-12-13	SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians	Siyun Liang et.al.	2412.10231v1	null
2024-12-13	GAF: Gaussian Avatar Reconstruction from Monocular Videos via Multi-view Diffusion	Jiapeng Tang et.al.	2412.10209v1	null
2024-12-13	TSGaussian: Semantic and Depth-Guided Target-Specific Gaussian Splatting from Sparse Views	Liang Zhao et.al.	2412.10051v1	null
2024-12-13	SplineGS: Robust Motion-Adaptive Spline for Real-Time Dynamic 3D Gaussians from Monocular Video	Jongmin Park et.al.	2412.09982v2	null
2024-12-12	PBR-NeRF: Inverse Rendering with Physics-Based Neural Fields	Sean Wu et.al.	2412.09680v1	link
2024-12-12	Representing Long Volumetric Video with Temporal Gaussian Hierarchy	Zhen Xu et.al.	2412.09608v1	link
2024-12-12	Feat2GS: Probing Visual Foundation Models with Gaussian Splatting	Yue Chen et.al.	2412.09606v1	null
2024-12-12	DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving	Hao Lu et.al.	2412.09043v1	link
2024-12-11	Pragmatist: Multiview Conditional Diffusion Models for High-Fidelity 3D Reconstruction from Unposed Sparse Views	Songchun Zhang et.al.	2412.08412v2	null
2024-12-11	NeRF-NQA: No-Reference Quality Assessment for Scenes Generated by NeRF and Neural View Synthesis Methods	Qiang Qu et.al.	2412.08029v1	link
2024-12-10	From an Image to a Scene: Learning to Imagine the World from a Million 360 Videos	Matthew Wallingford et.al.	2412.07770v1	link
2024-12-10	SimVS: Simulating World Inconsistencies for Robust View Synthesis	Alex Trevithick et.al.	2412.07696v1	null
2024-12-10	Faster and Better 3D Splatting via Group Training	Chengbo Wang et.al.	2412.07608v1	null
2024-12-10	ResGS: Residual Densification of 3D Gaussian for Efficient Detail Recovery	Yanzhe Lyu et.al.	2412.07494v1	null
2024-12-10	EventSplat: 3D Gaussian Splatting from Moving Event Cameras for Real-time Rendering	Toshiya Yura et.al.	2412.07293v1	null
2024-12-09	MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds	Zhenggang Tang et.al.	2412.06974v1	null
2024-12-09	MAtCha Gaussians: Atlas of Charts for High-Quality Geometry and Photorealism From Sparse Views	Antoine Guédon et.al.	2412.06767v1	null
2024-12-09	Deblur4DGS: 4D Gaussian Splatting from Blurry Monocular Video	Renlong Wu et.al.	2412.06424v1	link
2024-12-07	Template-free Articulated Gaussian Splatting for Real-time Reposable Dynamic View Synthesis	Diwen Wan et.al.	2412.05570v1	null

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Updated on 2024.12.19

Image Generation

Light Field Super Resolution

Light Field Depth Estimation

Light Field View Synthesis

Light Field Other Applications

Diffusion

Vision Transformer

NeRF

Super Resolution

Depth Estimation

View Synthesis

Files

README.md

Latest commit

History

README.md

File metadata and controls

Updated on 2024.12.19

Image Generation

Light Field Super Resolution

Light Field Depth Estimation

Light Field View Synthesis

Light Field Other Applications

Diffusion

Vision Transformer

NeRF

Super Resolution

Depth Estimation

View Synthesis