Repositories tagged with "efficient-model"
temporal-shift-module
mit-han-lab
โ[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understandingโ
once-for-all
โ[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deploymentโ
proxylessnas
โ[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardwareโ
amc
โ[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devicesโ
KVQuant
SqueezeAILab
โ[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantizationโ
haq
โ[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precisionโ
nn-Meter
microsoft
โA DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices. โ
hardware-aware-transformers
โ[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processingโ
ZeroQ
amirgholami
โ[CVPR'20] ZeroQ: A Novel Zero Shot Quantization Frameworkโ
I-BERT
kssteven418
โ[ICML'21 Oral] I-BERT: Integer-only BERT Quantizationโ
amc-models
VoV3D
youngwanLEE
โEfficient 3D Backbone Network for Temporal Modelingโ
HBONet
d-li14
โ[ICCV 2019] Harmonious Bottleneck on Two Orthogonal Dimensions, surpassing MobileNetV2โ
LTP
โ[KDD'22] Learned Token Pruning for Transformers โ
owq
xvyaward
โCode for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Models".โ
S2-BNN
szq0214
โS2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)โ
Any-Precision-DNNs
SHI-Labs
โAny-Precision Deep Neural Networks (AAAI 2021)โ
OffSeg
HVision-NKU
โ[ICCV 2025] Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignmentโ