Skip to content

[Paper List‘24] Paper List of Visual Data Coding for Machines, including Image/Video Coding for Machines, Feature Compression, Point Cloud Compression for Machines and Image/Video Coding for Machines with Large Multimodal Models

Notifications You must be signed in to change notification settings

lingyzhu0101/Awesome_VCM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

57 Commits
 
 
 
 

Repository files navigation

Awesome_VCM

Paper list of visual data compression for machines, including image/video coding for machines, feature compression, collaborative coding, point cloud compression for machines and image/video coding for machines with large multimodal models.

Maintained by: Lingyu Zhu and Peilin Chen

Overview

Notes

  • If you find papers relevant to this topic, please share them as a discussion post.
  • Some papers may simultaneously belong to multiple subfields, and we categorize them accordingly to reflect these overlaps.
  • Looking forward to your kind contributions and discussions! Many thanks!

Updated on 2024.11.19

Table of Contents
  1. Image/Video Coding for Machines
  2. Feature Compression
  3. Collaborative Coding
  4. Point Cloud for Machines
  5. Image/Video Coding Meets Large Multimodal Models

Image/Video Coding for Machines

Publish Date Title Authors PDF Code
2021.08 Digital Retina: A Way to Make the City Brain More Efficient by Visual Coding Wen Gao et.al. TCSVT null
2024.08 Preprocessing Enhanced Image Compression for Machine Vision Guo Lu et.al. TCSVT null
2024.08 A coding framework and benchmark towards low-bitrate video understanding Yuan Tian et.al. TPAMI null
2024.08 Privacy-Preserving Autoencoder for Collaborative Object Detection Bardia Azizian et.al. TIP null
2024.07 Task-Switchable Pre-Processor for Image Compression for Multiple Machine Vision Tasks Mingyi Yang et.al. TCSVT null
2024.07 Region-of-Interest-Based Video Coding for Machines Olgierd Stankiewicz et.al. ICMEW null
2024.07 Vnvc: A versatile neural video coding framework for efficient human-machine vision Xihua Sheng et.al. TPAMI null
2024.07 Video Coding for Machines: Compact Visual Representation Compression for Intelligent Collaborative Analytics Wenhan Yang et.al. TPAMI null
2024.07 Rate-Distortion-Cognition Controllable Versatile Neural Image Compression Jinming Liu et.al. 2407.11700 null
2024.06 SMC++: Masked Learning of Unsupervised Video Semantic Compression Yuan Tian et.al. 2406.04765 null
2024.06 Machine Perception-Driven Facial Image Compression: A Layered Generative Approach Yuefeng Zhang et.al. TCSVT null
2024.06 Human–Machine Collaborative Image Compression Method Based on Implicit Neural Representation Huanyang Li et.al. J EM SEL TOP C null
2024.05 Privacy-preserving with Flexible Autoencoder for Video Coding for Machines Aorui Gou et.al. ISCAS null
2024.04 Deep Video Codec Control for Vision Model Christoph Reich et.al. 2308.16215 null
2024.04 A Perspective on Deep Vision Performance with Standard Image and Video Codecs Christoph Reich et.al. CVPRW null
2024.04 Task-Aware Encoder Control for Deep Video Compression Xingtong Ge et.al. CVPR null
2023.12 Image Coding for Machines based on Non-Uniform Importance Allocation Yunpeng Qi et.al. VCIP null
2023.12 Scalable Face Image Coding via StyleGAN Prior: Towards Compression for Human-Machine Collaborative Vision Qi Mao et.al. TIP null
2023.10 Sketch Assisted Face Image Coding for Human and Machine Vision: A Joint Training Approach Xin Fang et.al. TCSVT null
2023.10 Deepsvc: Deep scalable video coding for both machine and human vision Hongbin Lin et.al. ACM MM null
2023.10 ICMH-Net: Neural Image Compression Towards both Machine Vision and Human Vision Lei Liu et.al. ACM MM null
2023.10 Video Object Detection From Compressed Formats for Modern Lightweight Consumer Electronics Sangeeta Yadav et.al. TCE null
2023.08 Unified Architecture Adaptation for Compressed Domain Semantic Inference Zhihao Duan et.al. TCSVT null
2023.06 Semantic Preprocessor for Image Compression for Machines Mingyi Yang et.al. ICASSP null
2023.05 Prompt-icm: A unified framework towards image coding for machines with task-driven prompts Ruoyu Feng et.al. 2305.02578 null
2023.05 Fast VVC Intra Encoding for Video Coding for Machines Aorui Gou et.al. ISCAS null
2022.03 Scalable Image Coding for Humans and Machines Hyomin Choi et.al. TIP null
2021.07 Thousand to One: Semantic Prior Modeling for Conceptual Coding Jianhui Chang et.al. ICME null
2021.07 Visual Analysis Motivated Rate-Distortion Model for Image Coding Zhimeng Huang et.al. ICME null
2021.07 Learned Image Coding for Machines: A Content-Adaptive Approach Nam Le et.al. ICME null
2021.05 End-to-end optimized image compression for machines, a study Lahiru D. Chamain et.al. DCC null
2021.05 Collaborative Intelligence: Challenges and Opportunities Ivan V. Bajić et.al. ICASSP null
2021.05 Recent Standard Development Activities on Video Coding for Machines Wen Gao et.al. 2105.12653 null
2021.05 Image Coding For Machines: an End-To-End Learned Approach Nam Le et.al. ICASSP null
2021.02 Pareto-Optimal Bit Allocation for Collaborative Intelligence Saeed Ranjbar Alvar et.al. TIP null
2020.11 Task-Aware Quantization Network for JPEG Image Compression Jinyoung Choi et.al. ECCV null
2020.10 Semantic-Preserving Image Compression Neel Patwa et.al. ICIP null
2020.08 Video Coding for Machines: A Paradigm of Collaborative Compression and Intelligent Analytics Lingyu Duan et.al. TIP null
2020.07 Towards Coding For Human And Machine Vision: A Scalable Image Coding Approach Yueyu Hu et.al. ICME null
2020.06 Image Compression With Encoder-Decoder Matched Semantic Segmentation Trinh Man Hoang et.al. CVPRW null
2020.05 Back-And-Forth Prediction for Deep Tensor Compression Hyomin Choi et.al. ICASSP null
2020.05 Bit Allocation for Multi-Task Collaborative Intelligence Saeed Ranjbar Alvar et.al. ICASSP null
2020.01 Towards Efficient Front-End Visual Sensing for Digital Retina: A Model-Centric Paradigm Yihang Lou et.al. TMM null
2019.10 AdaCompress: Adaptive Compression for Online Computer Vision Services Hongshan Li et.al. ACM MM null
2019.08 Multi-Task Learning with Compressible Features for Collaborative Intelligence Saeed Ranjbar Alvar et.al. ICIP null
2019.08 Layered conceptual image compression via deep semantic synthesis Jianhui Chang et.al. ICIP null
2019.05 DSSLIC: Deep Semantic Segmentation-based Layered Image Compression Mohammad Akbari et.al. ICASSP null
2019.05 Pixel-level Texture Segmentation Based AV1 Video Compression Di Chen et.al. ICASSP null

Feature Compression

Publish Date Title Authors PDF Code
2024.05 Split Computing With Scalable Feature Compression for Visual Analytics on the Edge Zhongzheng Yuan et.al. TMM null
2024.04 Hierarchical Image Feature Compression for Machines via Feature Sparsity Learning Ding Ding et.al. SPL null
2023.07 Residual based hierarchical feature compression for multi-task machine vision Chaoran Chen et.al. ICME null
2023.06 Learnt mutual feature compression for machine vision Tie Liu et.al. ICASSP null
2021.07 Rate-Distortion Optimized Hierarchical Deep Feature Compression Ademola Ikusan et.al. ICME null
2021.06 Semantics-to-Signal Scalable Image Compression with Learned Revertible Representations Kang Liu et.al. IJCV null
2021.02 Pareto-Optimal Bit Allocation for Collaborative Intelligence Saeed Ranjbar Alvar et.al. TIP null
2020.12 Sensitivity-Aware Bit Allocation for Intermediate Deep Feature Compression Yuzhang Hu et.al. VCIP) null
2020.10 Data Representation in Hybrid Coding Framework for Feature Maps Compression Zhuo Chen et.al. ICIP null
2020.05 Deriving Compact Feature Representations Via Annealed Contraction Muhammad A. Shah et.al. ICASSP null
2020.05 Bit Allocation for Multi-Task Collaborative Intelligence Saeed Ranjbar Alvar et.al. ICASSP null
2019.10 Lossy Intermediate Deep Learning Feature Compression and Evaluation Zhuo Chen et.al. ACM MM null
2019.09 Toward Intelligent Sensing: Intermediate Deep Feature Compression Zhuo Chen et.al. TIP null
2019.08 Multi-Task Learning with Compressible Features for Collaborative Intelligence Saeed Ranjbar Alvar et.al. ICIP null
2019.05 Pixel-level Texture Segmentation Based AV1 Video Compression Di Chen et.al. ICASSP null

Collaborative Coding

Publish Date Title Authors PDF Code
2024.07 Vnvc: A versatile neural video coding framework for efficient human-machine vision Xihua Sheng et.al. TPAMI null
2024.06 Human–Machine Collaborative Image Compression Method Based on Implicit Neural Representation Huanyang Li et.al. J EM SEL TOP C null
2024.02 Scalable Human-Machine Point Cloud Compression Mateen Ulhaq et.al. PCS null
2023.12 Scalable Face Image Coding via StyleGAN Prior: Towards Compression for Human-Machine Collaborative Vision Qi Mao et.al. TIP null
2023.10 Sketch Assisted Face Image Coding for Human and Machine Vision: A Joint Training Approach Xin Fang et.al. TCSVT null
2023.10 Deepsvc: Deep scalable video coding for both machine and human vision Hongbin Lin et.al. ACM MM null
2023.10 ICMH-Net: Neural Image Compression Towards both Machine Vision and Human Vision Lei Liu et.al. ACM MM null
2021.06 Semantics-to-Signal Scalable Image Compression with Learned Revertible Representations Kang Liu et.al. IJCV null
2021.02 Pareto-Optimal Bit Allocation for Collaborative Intelligence Saeed Ranjbar Alvar et.al. TIP null
2020.07 Towards Coding For Human And Machine Vision: A Scalable Image Coding Approach Yueyu Hu et.al. ICME null
2020.05 Bit Allocation for Multi-Task Collaborative Intelligence Saeed Ranjbar Alvar et.al. ICASSP null
2019.08 Multi-Task Learning with Compressible Features for Collaborative Intelligence Saeed Ranjbar Alvar et.al. ICIP null
2019.05 Pixel-level Texture Segmentation Based AV1 Video Compression Di Chen et.al. ICASSP null

Point Cloud Compression for Machines

Publish Date Title Authors PDF Code
2024.07 Towards Point Cloud Compression for Machine Perception: A Simple and Strong Baseline by Learning the Octree Depth Level Predictor Lei Liu et.al. IJCAI null
2024.02 Scalable Human-Machine Point Cloud Compression Mateen Ulhaq et.al. PCS null
2023.10 Deep learning-based compressed domain point cloud classification Abdelrahman Seleem et.al. ICIP null

Image/Video Coding Meets Large Multimodal Models

Publish Date Title Authors PDF Code
2024.11 Large Language Models for Lossless Image Compression: Next-Pixel Prediction in Language Space is All You Need Kecheng Chen et.al. 2411.12448 null
2024.10 High Efficiency Image Compression for Large Visual-Language Models Binzhe Li et.al. TCSVT null
2024.08 Tell Codec What Worth Compressing: Semantically Disentangled Image Coding for Machine with LMMs Jinming Liu et.al. 2408.08575 null
2024.08 When Video Coding Meets Multimodal Large Language Models: A Unified Paradigm for Video Coding Pingping Zhang et.al. 2408.08093 null
2024.07 ComNeck: Bridging Compressed Image Latents and Multimodal LLMs via Universal Transform-Neck Chia-Hao Kao et.al. 2407.19651 null

About

[Paper List‘24] Paper List of Visual Data Coding for Machines, including Image/Video Coding for Machines, Feature Compression, Point Cloud Compression for Machines and Image/Video Coding for Machines with Large Multimodal Models

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published