efficient-ai

Dynamic Attention Mask (DAM) generate adaptive sparse attention masks per layer and head for Transformer models, enabling long-context inference with lower compute and memory overhead without fine-tuning.

inference-optimization sparse-attention efficient-ai

Updated Jun 16, 2025
Python

Liu-Hy / WMDD

Star

Official PyTorch implementation of the paper "Dataset Distillation via the Wasserstein Metric" (ICCV 2025).

efficiency optimal-transport distillation dataset-distillation efficient-ai

Updated Aug 5, 2025
Python

Shikha-code36 / early-exit-cnn

Star

A deep learning framework that implements Early Exit strategies in Convolutional Neural Networks (CNNs) using Deep Q-Learning (DQN). This project enhances computational efficiency by dynamically determining the optimal exit point in a neural network for image classification tasks on CIFAR-10.

reinforcement-learning deep-learning cnn pytorch dqn image-classification cifar10 cifar-10 pytorch-cnn cnn-pytorch cifar10-classification early-exit model-optimization efficient-ai

Updated Feb 23, 2025
Jupyter Notebook

fangvv / HWGNAS

Star

Code for paper "Automated Design for Hardware-aware Graph Neural Networks on Edge Devices"

neural-networks neural-architecture-search latency-prediction edge-devices graph-neural-networks gnn jetson-nano on-device-ai inference-acceleration hardware-aware-nas efficient-ai

Updated Aug 22, 2025
Python

sujin-1013 / task-aware-DMO

Star

Task-Aware Dynamic Model Optimization for Multi-Task Learning (IEEE Access 2023)

deep-learning mtl multi-task-learning model-compression decathlon ai-research lightweight-model efficient-ai

Updated Apr 21, 2025

paredezadrian / mocanet

Star

MOCA-Net: Novel neural architecture with sparse MoE, external memory, and budget-aware computation. Real Stanford SST-2 integration, O(L) complexity, 96.40% accuracy. Built for efficient sequence modeling.

deep-learning sentiment-analysis pytorch neural-networks research-tool external-memory mixture-of-experts sequence-modeling budget-optimization sst2 efficient-ai

Updated Aug 16, 2025
Python

LumGenLab / LumGPT

Star

Production-grade GPT transformer implemented from scratch in C++. Runs on modest hardware with complete mathematical derivations and optimized tensor operations.

machine-learning deep-learning transformer cpp17 gpt language-model efficient-ai opensource-llm lumgenlab

Updated Sep 19, 2025
C++

Improve this page

Add a description, image, and links to the efficient-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the efficient-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

efficient-ai

Here are 13 public repositories matching this topic...

NVlabs / Long-RL

cokeshao / Awesome-Multimodal-Token-Compression

jeho-lee / Awesome-On-Device-AI-Systems

tiannuo-yang / SearchAgent-X

BaiTheBest / SparseLLM

erectbranch / MIT-Efficient-AI

ResponsibleAILab / DAM

Liu-Hy / WMDD

Shikha-code36 / early-exit-cnn

fangvv / HWGNAS

sujin-1013 / task-aware-DMO

paredezadrian / mocanet

LumGenLab / LumGPT

Improve this page

Add this topic to your repo