ml-inference

Here are 8 public repositories matching this topic...

BabarAli93 / EdgeAIBus

[TPDS 2025] EdgeAIBus: AI-driven Joint Container Management and Model Selection Framework for Heterogeneous Edge Computing

kubernetes impala transformers migration drl edge-computing heterogeneity energy-conservation ml-inference

Updated Aug 26, 2025
Jupyter Notebook

awesome-software / mlops-python-package

Star

Kickstart your MLOps initiative with a flexible, robust, and productive Python package.

ml-inference

Updated Dec 14, 2024
Jupyter Notebook

100percentibrahim / livebatch

Star

A lightweight, framework-agnostic middleware that dynamically batches inference requests in real time to maximize GPU/TPU utilization.

golang microservices gpu grpc performance-optimization model-serving dynamic-batching ml-inference

Updated May 23, 2025
Go

PratikBarhate / hushar

Star

A simple gRPC server for Machine Learning (ML) Model Inference in Rust.

aws machine-learning deep-learning grpc rust-lang nueral-networks onnx mlops ml-operations ml-inference

Updated Jun 5, 2025
Rust

benjaminkost / chess-scoresheet-digitalization-service

Star

Microservice to digitalize a chess scoresheet

ocr chess microservice restful-api microservice-framework backend-service backend-api mlops chess-scoresheet ml-inference chesshub

Updated Sep 15, 2025
Python

chonzadaniel / Credit-card-FraudDetection

Star

Submission of Project

deployment tokenizer feature-extraction predictive-modeling fraud-detection supervised-machine-learning supervised-learning-algorithms ml-pipeline huggingface fraudulent-transactions streamlit ml-training huggingface-transformers ml-model-evaluation supervised-finetuning ml-inference bert-base-ucased ml-performance-metrics

Updated Jul 19, 2025
Jupyter Notebook

JayDS22 / Enterprise-Data-Warehouse

Star

Enterprise Data Warehouse & ML Platform - High-performance platform processing 24B records with <60s latency and 100K records/sec throughput, featuring 32 fact tables, 128 dimensions, and automated ML pipelines achieving 91.2% accuracy. Real-time ML inference serving 300K+ predictions/hour with ensemble models.

big-data ensemble-learning dbt datawarehouse ml-inference

Updated Sep 9, 2025
Python

ermolushka / vllm-benchmark

Star

scripts for benchmarking vLLM using Llama 8b and NVIDIA 4090 GPU

quantization 4090 vllm llama3 ml-inference

Updated Sep 10, 2025
Python

Improve this page

Add a description, image, and links to the ml-inference topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ml-inference topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ml-inference

Here are 8 public repositories matching this topic...

BabarAli93 / EdgeAIBus

awesome-software / mlops-python-package

100percentibrahim / livebatch

PratikBarhate / hushar

benjaminkost / chess-scoresheet-digitalization-service

chonzadaniel / Credit-card-FraudDetection

JayDS22 / Enterprise-Data-Warehouse

ermolushka / vllm-benchmark

Improve this page

Add this topic to your repo