Awesome_VCM

Paper list of visual data compression for machines, including image/video coding for machines, feature compression, collaborative coding, point cloud compression for machines and image/video coding for machines with large multimodal models.

Maintained by: Lingyu Zhu and Peilin Chen

Overview

Notes

If you find papers relevant to this topic, please share them as a discussion post.
Some papers may simultaneously belong to multiple subfields, and we categorize them accordingly to reflect these overlaps.
Looking forward to your kind contributions and discussions! Many thanks!

Updated on 2024.11.19

Table of Contents

Image/Video Coding for Machines
Feature Compression
Collaborative Coding
Point Cloud for Machines
Image/Video Coding Meets Large Multimodal Models

Image/Video Coding for Machines

Publish Date	Title	Authors	PDF	Code
2021.08	Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding	Wen Gao et.al.	TCSVT	null
2024.08	Preprocessing Enhanced Image Compression for Machine Vision	Guo Lu et.al.	TCSVT	null
2024.08	A coding framework and benchmark towards low-bitrate video understanding	Yuan Tian et.al.	TPAMI	null
2024.08	Privacy-Preserving Autoencoder for Collaborative Object Detection	Bardia Azizian et.al.	TIP	null
2024.07	Task-Switchable Pre-Processor for Image Compression for Multiple Machine Vision Tasks	Mingyi Yang et.al.	TCSVT	null
2024.07	Region-of-Interest-Based Video Coding for Machines	Olgierd Stankiewicz et.al.	ICMEW	null
2024.07	Vnvc: A versatile neural video coding framework for efficient human-machine vision	Xihua Sheng et.al.	TPAMI	null
2024.07	Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics	Wenhan Yang et.al.	TPAMI	null
2024.07	Rate-Distortion-Cognition Controllable Versatile Neural Image Compression	Jinming Liu et.al.	2407.11700	null
2024.06	SMC++: Masked Learning of Unsupervised Video Semantic Compression	Yuan Tian et.al.	2406.04765	null
2024.06	Machine Perception-Driven Facial Image Compression: A Layered Generative Approach	Yuefeng Zhang et.al.	TCSVT	null
2024.06	Human–Machine Collaborative Image Compression Method Based on Implicit Neural Representation	Huanyang Li et.al.	J EM SEL TOP C	null
2024.05	Privacy-preserving with Flexible Autoencoder for Video Coding for Machines	Aorui Gou et.al.	ISCAS	null
2024.04	Deep Video Codec Control for Vision Model	Christoph Reich et.al.	2308.16215	null
2024.04	A Perspective on Deep Vision Performance with Standard Image and Video Codecs	Christoph Reich et.al.	CVPRW	null
2024.04	Task-Aware Encoder Control for Deep Video Compression	Xingtong Ge et.al.	CVPR	null
2023.12	Image Coding for Machines based on Non-Uniform Importance Allocation	Yunpeng Qi et.al.	VCIP	null
2023.12	Scalable Face Image Coding via StyleGAN Prior: Towards Compression for Human-Machine Collaborative Vision	Qi Mao et.al.	TIP	null
2023.10	Sketch Assisted Face Image Coding for Human and Machine Vision: A Joint Training Approach	Xin Fang et.al.	TCSVT	null
2023.10	Deepsvc: Deep scalable video coding for both machine and human vision	Hongbin Lin et.al.	ACM MM	null
2023.10	ICMH-Net: Neural Image Compression Towards both Machine Vision and Human Vision	Lei Liu et.al.	ACM MM	null
2023.10	Video Object Detection From Compressed Formats for Modern Lightweight Consumer Electronics	Sangeeta Yadav et.al.	TCE	null
2023.08	Unified Architecture Adaptation for Compressed Domain Semantic Inference	Zhihao Duan et.al.	TCSVT	null
2023.06	Semantic Preprocessor for Image Compression for Machines	Mingyi Yang et.al.	ICASSP	null
2023.05	Prompt-icm: A unified framework towards image coding for machines with task-driven prompts	Ruoyu Feng et.al.	2305.02578	null
2023.05	Fast VVC Intra Encoding for Video Coding for Machines	Aorui Gou et.al.	ISCAS	null
2022.03	Scalable Image Coding for Humans and Machines	Hyomin Choi et.al.	TIP	null
2021.07	Thousand to One: Semantic Prior Modeling for Conceptual Coding	Jianhui Chang et.al.	ICME	null
2021.07	Visual Analysis Motivated Rate-Distortion Model for Image Coding	Zhimeng Huang et.al.	ICME	null
2021.07	Learned Image Coding for Machines: A Content-Adaptive Approach	Nam Le et.al.	ICME	null
2021.05	End-to-end optimized image compression for machines, a study	Lahiru D. Chamain et.al.	DCC	null
2021.05	Collaborative Intelligence: Challenges and Opportunities	Ivan V. Bajić et.al.	ICASSP	null
2021.05	Recent Standard Development Activities on Video Coding for Machines	Wen Gao et.al.	2105.12653	null
2021.05	Image Coding For Machines: an End-To-End Learned Approach	Nam Le et.al.	ICASSP	null
2021.02	Pareto-Optimal Bit Allocation for Collaborative Intelligence	Saeed Ranjbar Alvar et.al.	TIP	null
2020.11	Task-Aware Quantization Network for JPEG Image Compression	Jinyoung Choi et.al.	ECCV	null
2020.10	Semantic-Preserving Image Compression	Neel Patwa et.al.	ICIP	null
2020.08	Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics	Lingyu Duan et.al.	TIP	null
2020.07	Towards Coding For Human And Machine Vision: A Scalable Image Coding Approach	Yueyu Hu et.al.	ICME	null
2020.06	Image Compression With Encoder-Decoder Matched Semantic Segmentation	Trinh Man Hoang et.al.	CVPRW	null
2020.05	Back-And-Forth Prediction for Deep Tensor Compression	Hyomin Choi et.al.	ICASSP	null
2020.05	Bit Allocation for Multi-Task Collaborative Intelligence	Saeed Ranjbar Alvar et.al.	ICASSP	null
2020.01	Towards Efficient Front-End Visual Sensing for Digital Retina: A Model-Centric Paradigm	Yihang Lou et.al.	TMM	null
2019.10	AdaCompress: Adaptive Compression for Online Computer Vision Services	Hongshan Li et.al.	ACM MM	null
2019.08	Multi-Task Learning with Compressible Features for Collaborative Intelligence	Saeed Ranjbar Alvar et.al.	ICIP	null
2019.08	Layered conceptual image compression via deep semantic synthesis	Jianhui Chang et.al.	ICIP	null
2019.05	DSSLIC: Deep Semantic Segmentation-based Layered Image Compression	Mohammad Akbari et.al.	ICASSP	null
2019.05	Pixel-level Texture Segmentation Based AV1 Video Compression	Di Chen et.al.	ICASSP	null

Feature Compression

Publish Date	Title	Authors	PDF	Code
2024.05	Split Computing With Scalable Feature Compression for Visual Analytics on the Edge	Zhongzheng Yuan et.al.	TMM	null
2024.04	Hierarchical Image Feature Compression for Machines via Feature Sparsity Learning	Ding Ding et.al.	SPL	null
2023.07	Residual based hierarchical feature compression for multi-task machine vision	Chaoran Chen et.al.	ICME	null
2023.06	Learnt mutual feature compression for machine vision	Tie Liu et.al.	ICASSP	null
2021.07	Rate-Distortion Optimized Hierarchical Deep Feature Compression	Ademola Ikusan et.al.	ICME	null
2021.06	Semantics-to-Signal Scalable Image Compression with Learned Revertible Representations	Kang Liu et.al.	IJCV	null
2021.02	Pareto-Optimal Bit Allocation for Collaborative Intelligence	Saeed Ranjbar Alvar et.al.	TIP	null
2020.12	Sensitivity-Aware Bit Allocation for Intermediate Deep Feature Compression	Yuzhang Hu et.al.	VCIP)	null
2020.10	Data Representation in Hybrid Coding Framework for Feature Maps Compression	Zhuo Chen et.al.	ICIP	null
2020.05	Deriving Compact Feature Representations Via Annealed Contraction	Muhammad A. Shah et.al.	ICASSP	null
2020.05	Bit Allocation for Multi-Task Collaborative Intelligence	Saeed Ranjbar Alvar et.al.	ICASSP	null
2019.10	Lossy Intermediate Deep Learning Feature Compression and Evaluation	Zhuo Chen et.al.	ACM MM	null
2019.09	Toward Intelligent Sensing: Intermediate Deep Feature Compression	Zhuo Chen et.al.	TIP	null
2019.08	Multi-Task Learning with Compressible Features for Collaborative Intelligence	Saeed Ranjbar Alvar et.al.	ICIP	null
2019.05	Pixel-level Texture Segmentation Based AV1 Video Compression	Di Chen et.al.	ICASSP	null

Collaborative Coding

Publish Date	Title	Authors	PDF	Code
2024.07	Vnvc: A versatile neural video coding framework for efficient human-machine vision	Xihua Sheng et.al.	TPAMI	null
2024.06	Human–Machine Collaborative Image Compression Method Based on Implicit Neural Representation	Huanyang Li et.al.	J EM SEL TOP C	null
2024.02	Scalable Human-Machine Point Cloud Compression	Mateen Ulhaq et.al.	PCS	null
2023.12	Scalable Face Image Coding via StyleGAN Prior: Towards Compression for Human-Machine Collaborative Vision	Qi Mao et.al.	TIP	null
2023.10	Sketch Assisted Face Image Coding for Human and Machine Vision: A Joint Training Approach	Xin Fang et.al.	TCSVT	null
2023.10	Deepsvc: Deep scalable video coding for both machine and human vision	Hongbin Lin et.al.	ACM MM	null
2023.10	ICMH-Net: Neural Image Compression Towards both Machine Vision and Human Vision	Lei Liu et.al.	ACM MM	null
2021.06	Semantics-to-Signal Scalable Image Compression with Learned Revertible Representations	Kang Liu et.al.	IJCV	null
2021.02	Pareto-Optimal Bit Allocation for Collaborative Intelligence	Saeed Ranjbar Alvar et.al.	TIP	null
2020.07	Towards Coding For Human And Machine Vision: A Scalable Image Coding Approach	Yueyu Hu et.al.	ICME	null
2020.05	Bit Allocation for Multi-Task Collaborative Intelligence	Saeed Ranjbar Alvar et.al.	ICASSP	null
2019.08	Multi-Task Learning with Compressible Features for Collaborative Intelligence	Saeed Ranjbar Alvar et.al.	ICIP	null
2019.05	Pixel-level Texture Segmentation Based AV1 Video Compression	Di Chen et.al.	ICASSP	null

Point Cloud Compression for Machines

Publish Date	Title	Authors	PDF	Code
2024.07	Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor	Lei Liu et.al.	IJCAI	null
2024.02	Scalable Human-Machine Point Cloud Compression	Mateen Ulhaq et.al.	PCS	null
2023.10	Deep learning-based compressed domain point cloud classification	Abdelrahman Seleem et.al.	ICIP	null

Image/Video Coding Meets Large Multimodal Models

Publish Date	Title	Authors	PDF	Code
2024.11	Large Language Models for Lossless Image Compression: Next-Pixel Prediction in Language Space is All You Need	Kecheng Chen et.al.	2411.12448	null
2024.10	High Efficiency Image Compression for Large Visual-Language Models	Binzhe Li et.al.	TCSVT	null
2024.08	Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs	Jinming Liu et.al.	2408.08575	null
2024.08	When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding	Pingping Zhang et.al.	2408.08093	null
2024.07	ComNeck: Bridging Compressed Image Latents and Multimodal LLMs via Universal Transform-Neck	Chia-Hao Kao et.al.	2407.19651	null

Name		Name	Last commit message	Last commit date
Latest commit History 57 Commits
src		src
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Awesome_VCM

Overview

Notes

Updated on 2024.11.19

Image/Video Coding for Machines

Feature Compression

Collaborative Coding

Point Cloud Compression for Machines

Image/Video Coding Meets Large Multimodal Models

About

Releases

Packages

lingyzhu0101/Awesome_VCM

Folders and files

Latest commit

History

Repository files navigation

Awesome_VCM

Overview

Notes

Updated on 2024.11.19

Image/Video Coding for Machines

Feature Compression

Collaborative Coding

Point Cloud Compression for Machines

Image/Video Coding Meets Large Multimodal Models

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages