Add QOS API's #2148

poornas · 2025-08-26T01:03:15Z

This is for bucket level Quality of service feature

harshavardhana · 2025-09-06T19:19:51Z

PTAL @klauspost need your feedback here.

klauspost · 2025-09-07T09:37:47Z

@poornas Why is this in minio-go? This isn't S3 API, correct? I assume this is part of a bigger change since bucket-level QOS aren't that useful without a node-QoS.

harshavardhana · 2025-09-07T16:19:04Z

@poornas Why is this in minio-go? This isn't S3 API, correct? I assume this is part of a bigger change since bucket-level QOS aren't that useful without a node-QoS.

This is bucket level settings and it's implemented at the same level as S3 API.

Just like bucket inventory APIs that are also not AWS S3 specific we have extended it extensively.

klauspost · 2025-09-07T16:43:01Z

Sounds much like an admin API to me. But whatever.

Is there any docs on the YAML? This doesn't tell too much - maybe it should be here as well?

harshavardhana · 2025-09-17T04:56:04Z

Sounds much like an admin API to me. But whatever.

Is there any docs on the YAML? This doesn't tell too much - maybe it should be here as well?

@poornas ^^

poornas · 2025-09-17T06:53:28Z

@klauspost , added the yaml docs to aistor PR - didn't expect claude to generate such comprehensive docs on the feature

poornas · 2025-09-18T19:09:09Z

@vadmeste @klauspost , PTAL

klauspost · 2025-09-18T19:30:39Z

api-bucket-qos.go

+type QOSMetric struct {
+	APIName            string        `json:"apiName"`
+	Rule               QOSRule       `json:"rule"`
+	Totals             CounterMetric `json:"totals"`
+	Throttled          CounterMetric `json:"throttleCount"`
+	ExceededRateLimit  CounterMetric `json:"exceededRateLimitCount"`
+	ClientDisconnCount CounterMetric `json:"clientDisconnectCount"`
+	ReqTimeoutCount    CounterMetric `json:"reqTimeoutCount"`
+}


So in terms of overall design, I am still a bit unsure how this ties into the total system.

Rates can only realistically be applied per node. So are these numbers divided by the node count? And each bucket has it's own settings.

So given a servers capacity 'n', for this to be effective it will be divided by the node count and divided with the bucket count.

So each bucket would end up with a very small req/s... So I feel like I'm probably missing the bigger picture. Is there per node QoS setting as well?

The design was initially to rate limit at node level - the way you are describing. However, QOS was defined more like a template and depending on the number of buckets the rate limits set would be arbitrary and not easy for users to decide sane limits.

The scope has since been changed to make QOS bucket centric, - it would be more useful to throttle specific API’s based on workload seen in bucket specific metrics.

The QOS config can be configured on a per bucket and as-needed basis . A rule limiting PutObject API to x concurrent requests for a bucket would imply total limit is x * n (number of nodes). Bucket level QOS will allow taxing callers/API's that are known to be problematic to system performance based off metrics rather than admins setting a limit at node level that may be harder to control

drawback of bucket QOS is that overall limits is harder to infer - prometheus metrics and qos status will likely help with this

klauspost

I am not doing to leave this hanging, when someone clearly thinks this is a good idea.

poornas · 2025-09-24T02:04:59Z

can this be merged

poornas requested review from vadmeste and harshavardhana August 26, 2025 01:03

poornas force-pushed the bqos3 branch 2 times, most recently from 5e80643 to 5b5337e Compare August 26, 2025 01:33

harshavardhana requested a review from klauspost August 26, 2025 01:42

harshavardhana approved these changes Aug 31, 2025

View reviewed changes

klauspost reviewed Sep 18, 2025

View reviewed changes

poornas requested a review from klauspost September 22, 2025 16:29

klauspost approved these changes Sep 23, 2025

View reviewed changes

poornas force-pushed the bqos3 branch from 5b5337e to 2453faf Compare September 29, 2025 17:31

Add QOS API's

c7a64f8

poornas force-pushed the bqos3 branch from 2453faf to c7a64f8 Compare September 29, 2025 17:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add QOS API's #2148

Add QOS API's #2148

poornas commented Aug 26, 2025

Uh oh!

harshavardhana commented Sep 6, 2025

Uh oh!

klauspost commented Sep 7, 2025

Uh oh!

harshavardhana commented Sep 7, 2025

Uh oh!

klauspost commented Sep 7, 2025

Uh oh!

harshavardhana commented Sep 17, 2025

Uh oh!

poornas commented Sep 17, 2025

Uh oh!

poornas commented Sep 18, 2025

Uh oh!

klauspost Sep 18, 2025

Uh oh!

poornas Sep 18, 2025

Uh oh!

klauspost left a comment

Uh oh!

poornas commented Sep 24, 2025

Uh oh!

Uh oh!

Add QOS API's #2148

Are you sure you want to change the base?

Add QOS API's #2148

Conversation

poornas commented Aug 26, 2025

Uh oh!

harshavardhana commented Sep 6, 2025

Uh oh!

klauspost commented Sep 7, 2025

Uh oh!

harshavardhana commented Sep 7, 2025

Uh oh!

klauspost commented Sep 7, 2025

Uh oh!

harshavardhana commented Sep 17, 2025

Uh oh!

poornas commented Sep 17, 2025

Uh oh!

poornas commented Sep 18, 2025

Uh oh!

klauspost Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

poornas Sep 18, 2025

Choose a reason for hiding this comment

Uh oh!

klauspost left a comment

Choose a reason for hiding this comment

Uh oh!

poornas commented Sep 24, 2025

Uh oh!

Uh oh!