2018-10-08 13:16:59 +02:00
|
|
|
# kube-metrics-adapter
|
2018-10-09 13:23:50 +02:00
|
|
|
[![Build Status](https://travis-ci.org/zalando-incubator/kube-metrics-adapter.svg?branch=master)](https://travis-ci.org/zalando-incubator/kube-metrics-adapter)
|
2018-11-14 11:02:53 +01:00
|
|
|
[![Coverage Status](https://coveralls.io/repos/github/zalando-incubator/kube-metrics-adapter/badge.svg?branch=master)](https://coveralls.io/github/zalando-incubator/kube-metrics-adapter?branch=master)
|
2019-07-27 10:42:46 +02:00
|
|
|
|
2018-10-08 13:17:05 +02:00
|
|
|
Kube Metrics Adapter is a general purpose metrics adapter for Kubernetes that
|
|
|
|
can collect and serve custom and external metrics for Horizontal Pod
|
|
|
|
Autoscaling.
|
|
|
|
|
2019-07-27 10:42:46 +02:00
|
|
|
It supports scaling based on [Prometheus metrics](https://prometheus.io/), [SQS queues](https://aws.amazon.com/sqs/) and others out of the box.
|
|
|
|
|
2018-10-08 13:17:05 +02:00
|
|
|
It discovers Horizontal Pod Autoscaling resources and starts to collect the
|
|
|
|
requested metrics and stores them in memory. It's implemented using the
|
|
|
|
[custom-metrics-apiserver](https://github.com/kubernetes-incubator/custom-metrics-apiserver)
|
|
|
|
library.
|
|
|
|
|
|
|
|
Here's an example of a `HorizontalPodAutoscaler` resource configured to get
|
|
|
|
`requests-per-second` metrics from each pod of the deployment `myapp`.
|
|
|
|
|
|
|
|
```yaml
|
2020-01-24 10:20:59 +01:00
|
|
|
apiVersion: autoscaling/v2beta2
|
2018-10-08 13:17:05 +02:00
|
|
|
kind: HorizontalPodAutoscaler
|
|
|
|
metadata:
|
|
|
|
name: myapp-hpa
|
|
|
|
annotations:
|
2020-11-04 20:40:53 +01:00
|
|
|
# metric-config.<metricType>.<metricName>.<collectorType>/<configKey>
|
2018-10-08 13:17:05 +02:00
|
|
|
metric-config.pods.requests-per-second.json-path/json-key: "$.http_server.rps"
|
|
|
|
metric-config.pods.requests-per-second.json-path/path: /metrics
|
|
|
|
metric-config.pods.requests-per-second.json-path/port: "9090"
|
|
|
|
spec:
|
|
|
|
scaleTargetRef:
|
|
|
|
apiVersion: apps/v1
|
|
|
|
kind: Deployment
|
|
|
|
name: myapp
|
|
|
|
minReplicas: 1
|
|
|
|
maxReplicas: 10
|
|
|
|
metrics:
|
|
|
|
- type: Pods
|
|
|
|
pods:
|
2020-01-24 10:20:59 +01:00
|
|
|
metric:
|
|
|
|
name: requests-per-second
|
|
|
|
target:
|
|
|
|
averageValue: 1k
|
|
|
|
type: AverageValue
|
2018-10-08 13:17:05 +02:00
|
|
|
```
|
|
|
|
|
|
|
|
The `metric-config.*` annotations are used by the `kube-metrics-adapter` to
|
|
|
|
configure a collector for getting the metrics. In the above example it
|
|
|
|
configures a *json-path pod collector*.
|
|
|
|
|
2019-07-27 10:42:46 +02:00
|
|
|
## Kubernetes compatibility
|
|
|
|
|
|
|
|
Like the [support
|
|
|
|
policy](https://kubernetes.io/docs/setup/release/version-skew-policy/) offered
|
|
|
|
for Kubernetes, this project aims to support the latest three minor releases of
|
|
|
|
Kubernetes.
|
|
|
|
|
2020-01-24 10:20:59 +01:00
|
|
|
The default supported API is `autoscaling/v2beta2` (available since `v1.12`).
|
|
|
|
This API MUST be available in the cluster which is the default. However for
|
|
|
|
GKE, this requires GKE v1.15.7 according to this [GKE
|
|
|
|
Issue](https://issuetracker.google.com/issues/135624588).
|
2019-07-27 10:42:46 +02:00
|
|
|
|
2018-10-08 13:17:05 +02:00
|
|
|
## Building
|
|
|
|
|
|
|
|
This project uses [Go modules](https://github.com/golang/go/wiki/Modules) as
|
|
|
|
introduced in Go 1.11 therefore you need Go >=1.11 installed in order to build.
|
|
|
|
If using Go 1.11 you also need to [activate Module
|
|
|
|
support](https://github.com/golang/go/wiki/Modules#installing-and-activating-module-support).
|
|
|
|
|
|
|
|
Assuming Go has been setup with module support it can be built simply by running:
|
|
|
|
|
|
|
|
```sh
|
|
|
|
export GO111MODULE=on # needed if the project is checked out in your $GOPATH.
|
|
|
|
$ make
|
|
|
|
```
|
|
|
|
|
|
|
|
## Collectors
|
|
|
|
|
|
|
|
Collectors are different implementations for getting metrics requested by an
|
|
|
|
HPA resource. They are configured based on HPA resources and started on-demand by the
|
|
|
|
`kube-metrics-adapter` to only collect the metrics required for scaling the application.
|
|
|
|
|
|
|
|
The collectors are configured either simply based on the metrics defined in an
|
|
|
|
HPA resource, or via additional annotations on the HPA resource.
|
|
|
|
|
|
|
|
## Pod collector
|
|
|
|
|
|
|
|
The pod collector allows collecting metrics from each pod matched by the HPA.
|
|
|
|
Currently only `json-path` collection is supported.
|
|
|
|
|
|
|
|
### Supported metrics
|
|
|
|
|
2019-07-27 10:42:46 +02:00
|
|
|
| Metric | Description | Type | K8s Versions |
|
|
|
|
| ------------ | -------------- | ------- | -- |
|
2020-01-24 10:20:59 +01:00
|
|
|
| *custom* | No predefined metrics. Metrics are generated from user defined queries. | Pods | `>=1.12` |
|
2018-10-08 13:17:05 +02:00
|
|
|
|
|
|
|
### Example
|
|
|
|
|
|
|
|
This is an example of using the pod collector to collect metrics from a json
|
|
|
|
metrics endpoint of each pod matched by the HPA.
|
|
|
|
|
|
|
|
```yaml
|
2020-01-24 10:20:59 +01:00
|
|
|
apiVersion: autoscaling/v2beta2
|
2018-10-08 13:17:05 +02:00
|
|
|
kind: HorizontalPodAutoscaler
|
|
|
|
metadata:
|
|
|
|
name: myapp-hpa
|
|
|
|
annotations:
|
2020-11-04 20:40:53 +01:00
|
|
|
# metric-config.<metricType>.<metricName>.<collectorType>/<configKey>
|
2018-10-08 13:17:05 +02:00
|
|
|
metric-config.pods.requests-per-second.json-path/json-key: "$.http_server.rps"
|
|
|
|
metric-config.pods.requests-per-second.json-path/path: /metrics
|
|
|
|
metric-config.pods.requests-per-second.json-path/port: "9090"
|
2019-07-14 15:49:00 +02:00
|
|
|
metric-config.pods.requests-per-second.json-path/scheme: "https"
|
2019-10-24 17:15:10 +01:00
|
|
|
metric-config.pods.requests-per-second.json-path/aggregator: "max"
|
2018-10-08 13:17:05 +02:00
|
|
|
spec:
|
|
|
|
scaleTargetRef:
|
|
|
|
apiVersion: apps/v1
|
|
|
|
kind: Deployment
|
|
|
|
name: myapp
|
|
|
|
minReplicas: 1
|
|
|
|
maxReplicas: 10
|
|
|
|
metrics:
|
|
|
|
- type: Pods
|
|
|
|
pods:
|
2020-01-24 10:20:59 +01:00
|
|
|
metric:
|
|
|
|
name: requests-per-second
|
|
|
|
target:
|
|
|
|
averageValue: 1k
|
|
|
|
type: AverageValue
|
2018-10-08 13:17:05 +02:00
|
|
|
```
|
|
|
|
|
|
|
|
The pod collector is configured through the annotations which specify the
|
|
|
|
collector name `json-path` and a set of configuration options for the
|
|
|
|
collector. `json-key` defines the json-path query for extracting the right
|
|
|
|
metric. This assumes the pod is exposing metrics in JSON format. For the above
|
|
|
|
example the following JSON data would be expected:
|
|
|
|
|
|
|
|
```json
|
|
|
|
{
|
|
|
|
"http_server": {
|
|
|
|
"rps": 0.5
|
|
|
|
}
|
|
|
|
}
|
|
|
|
```
|
|
|
|
|
|
|
|
The json-path query support depends on the
|
2020-08-13 09:51:05 +02:00
|
|
|
[github.com/spyzhov/ajson](https://github.com/spyzhov/ajson) library.
|
2018-10-08 13:17:05 +02:00
|
|
|
See the README for possible queries. It's expected that the metric you query
|
|
|
|
returns something that can be turned into a `float64`.
|
|
|
|
|
2019-07-14 15:49:00 +02:00
|
|
|
The other configuration options `path`, `port` and `scheme` specify where the metrics
|
|
|
|
endpoint is exposed on the pod. The `path` and `port` options do not have default values
|
|
|
|
so they must be defined. The `scheme` is optional and defaults to `http`.
|
2018-10-08 13:17:05 +02:00
|
|
|
|
2019-10-24 17:15:10 +01:00
|
|
|
The `aggregator` configuration option specifies the aggregation function used to aggregate
|
|
|
|
values of JSONPath expressions that evaluate to arrays/slices of numbers.
|
|
|
|
It's optional but when the expression evaluates to an array/slice, it's absence will
|
|
|
|
produce an error. The supported aggregation functions are `avg`, `max`, `min` and `sum`.
|
|
|
|
|
2020-03-10 03:45:28 -06:00
|
|
|
The `raw-query` configuration option specifies the query params to send along to the endpoint:
|
|
|
|
```yaml
|
|
|
|
metric-config.pods.requests-per-second.json-path/path: /metrics
|
|
|
|
metric-config.pods.requests-per-second.json-path/port: "9090"
|
|
|
|
metric-config.pods.requests-per-second.json-path/raw-query: "foo=bar&baz=bop"
|
|
|
|
```
|
|
|
|
will create a URL like this:
|
|
|
|
```
|
|
|
|
http://<podIP>:9090/metrics?foo=bar&baz=bop
|
|
|
|
```
|
|
|
|
|
2020-05-15 20:48:46 -06:00
|
|
|
There are also configuration options for custom (connect and request) timeouts when querying pods for metrics:
|
|
|
|
```yaml
|
|
|
|
metric-config.pods.requests-per-second.json-path/request-timeout: 2s
|
|
|
|
metric-config.pods.requests-per-second.json-path/connect-timeout: 500ms
|
|
|
|
```
|
|
|
|
|
|
|
|
The default for both of the above values is 15 seconds.
|
|
|
|
|
2018-10-08 13:17:05 +02:00
|
|
|
## Prometheus collector
|
|
|
|
|
|
|
|
The Prometheus collector is a generic collector which can map Prometheus
|
|
|
|
queries to metrics that can be used for scaling. This approach is different
|
|
|
|
from how it's done in the
|
|
|
|
[k8s-prometheus-adapter](https://github.com/DirectXMan12/k8s-prometheus-adapter)
|
|
|
|
where all available Prometheus metrics are collected
|
|
|
|
and transformed into metrics which the HPA can scale on, and there is no
|
|
|
|
possibility to do custom queries.
|
|
|
|
With the approach implemented here, users can define custom queries and only metrics
|
|
|
|
returned from those queries will be available, reducing the total number of
|
|
|
|
metrics stored.
|
|
|
|
|
|
|
|
One downside of this approach is that bad performing queries can slow down/kill
|
|
|
|
Prometheus, so it can be dangerous to allow in a multi tenant cluster. It's
|
|
|
|
also not possible to restrict the available metrics using something like RBAC
|
|
|
|
since any user would be able to create the metrics based on a custom query.
|
|
|
|
|
|
|
|
I still believe custom queries are more useful, but it's good to be aware of
|
|
|
|
the trade-offs between the two approaches.
|
|
|
|
|
|
|
|
### Supported metrics
|
|
|
|
|
2019-07-27 10:42:46 +02:00
|
|
|
| Metric | Description | Type | Kind | K8s Versions |
|
|
|
|
| ------------ | -------------- | ------- | -- | -- |
|
2020-01-24 10:20:59 +01:00
|
|
|
| `prometheus-query` | Generic metric which requires a user defined query. | External | | `>=1.12` |
|
|
|
|
| *custom* | No predefined metrics. Metrics are generated from user defined queries. | Object | *any* | `>=1.12` |
|
2018-10-08 13:17:05 +02:00
|
|
|
|
2020-10-26 17:37:24 +01:00
|
|
|
### Configure Scrape Interval
|
|
|
|
|
|
|
|
There is a way to set the interval via an annotation like:
|
|
|
|
|
|
|
|
```yaml
|
|
|
|
metric-config.external.prometheus-query.prometheus/interval: 30s
|
|
|
|
```
|
|
|
|
|
|
|
|
Default is 1 minute.
|
|
|
|
|
2019-04-30 23:22:16 +02:00
|
|
|
### Example: External Metric
|
|
|
|
|
|
|
|
This is an example of an HPA configured to get metrics based on a Prometheus
|
|
|
|
query. The query is defined in the annotation
|
2020-11-04 20:40:53 +01:00
|
|
|
`metric-config.external.processed-events-per-second.prometheus/query`
|
2019-04-30 23:22:16 +02:00
|
|
|
where `processed-events-per-second` is the query name which will be associated
|
2020-11-04 20:40:53 +01:00
|
|
|
with the result of the query.
|
|
|
|
This allows having multiple prometheus queries associated with a single HPA.
|
2019-04-30 23:22:16 +02:00
|
|
|
|
|
|
|
```yaml
|
2020-01-24 10:20:59 +01:00
|
|
|
apiVersion: autoscaling/v2beta2
|
2019-04-30 23:22:16 +02:00
|
|
|
kind: HorizontalPodAutoscaler
|
|
|
|
metadata:
|
|
|
|
name: myapp-hpa
|
|
|
|
annotations:
|
2019-08-14 11:46:57 +02:00
|
|
|
# This annotation is optional.
|
|
|
|
# If specified, then this prometheus server is used,
|
|
|
|
# instead of the prometheus server specified as the CLI argument `--prometheus-server`.
|
2020-11-04 20:40:53 +01:00
|
|
|
metric-config.external.processed-events-per-second.prometheus/prometheus-server: http://prometheus.my-namespace.svc
|
|
|
|
# metric-config.<metricType>.<metricName>.<collectorType>/<configKey>
|
|
|
|
metric-config.external.processed-events-per-second.prometheus/query: |
|
2019-04-30 23:22:16 +02:00
|
|
|
scalar(sum(rate(event-service_events_count{application="event-service",processed="true"}[1m])))
|
|
|
|
spec:
|
|
|
|
scaleTargetRef:
|
|
|
|
apiVersion: apps/v1
|
|
|
|
kind: Deployment
|
|
|
|
name: custom-metrics-consumer
|
|
|
|
minReplicas: 1
|
|
|
|
maxReplicas: 10
|
|
|
|
metrics:
|
|
|
|
- type: External
|
|
|
|
external:
|
2020-01-24 10:20:59 +01:00
|
|
|
metric:
|
2020-11-04 20:40:53 +01:00
|
|
|
name: processed-events-per-second
|
2020-01-24 10:20:59 +01:00
|
|
|
selector:
|
|
|
|
matchLabels:
|
2020-11-04 20:40:53 +01:00
|
|
|
type: prometheus
|
2020-01-24 10:20:59 +01:00
|
|
|
target:
|
|
|
|
type: AverageValue
|
|
|
|
averageValue: "10"
|
2019-04-30 23:22:16 +02:00
|
|
|
```
|
|
|
|
|
|
|
|
### Example: Object Metric [DEPRECATED]
|
|
|
|
|
|
|
|
> _Note: Prometheus Object metrics are **deprecated** and will most likely be
|
|
|
|
> removed in the future. Use the Prometheus External metrics instead as described
|
|
|
|
> above._
|
2018-10-08 13:17:05 +02:00
|
|
|
|
|
|
|
This is an example of an HPA configured to get metrics based on a Prometheus
|
|
|
|
query. The query is defined in the annotation
|
|
|
|
`metric-config.object.processed-events-per-second.prometheus/query` where
|
|
|
|
`processed-events-per-second` is the metric name which will be associated with
|
|
|
|
the result of the query.
|
|
|
|
|
|
|
|
It also specifies an annotation
|
|
|
|
`metric-config.object.processed-events-per-second.prometheus/per-replica` which
|
|
|
|
instructs the collector to treat the results as an average over all pods
|
|
|
|
targeted by the HPA. This makes it possible to mimic the behavior of
|
|
|
|
`targetAverageValue` which is not implemented for metric type `Object` as of
|
|
|
|
Kubernetes v1.10. ([It will most likely come in v1.12](https://github.com/kubernetes/kubernetes/pull/64097#event-1696222479)).
|
|
|
|
|
|
|
|
```yaml
|
|
|
|
apiVersion: autoscaling/v2beta1
|
|
|
|
kind: HorizontalPodAutoscaler
|
|
|
|
metadata:
|
|
|
|
name: myapp-hpa
|
|
|
|
annotations:
|
2020-11-04 20:40:53 +01:00
|
|
|
# metric-config.<metricType>.<metricName>.<collectorType>/<configKey>
|
2018-10-08 13:17:05 +02:00
|
|
|
metric-config.object.processed-events-per-second.prometheus/query: |
|
|
|
|
scalar(sum(rate(event-service_events_count{application="event-service",processed="true"}[1m])))
|
|
|
|
metric-config.object.processed-events-per-second.prometheus/per-replica: "true"
|
|
|
|
spec:
|
|
|
|
scaleTargetRef:
|
|
|
|
apiVersion: apps/v1
|
|
|
|
kind: Deployment
|
|
|
|
name: custom-metrics-consumer
|
|
|
|
minReplicas: 1
|
|
|
|
maxReplicas: 10
|
|
|
|
metrics:
|
|
|
|
- type: Object
|
|
|
|
object:
|
|
|
|
metricName: processed-events-per-second
|
|
|
|
target:
|
|
|
|
apiVersion: v1
|
2019-03-25 16:15:54 +01:00
|
|
|
kind: Pod
|
|
|
|
name: dummy-pod
|
2018-10-08 13:17:05 +02:00
|
|
|
targetValue: 10 # this will be treated as targetAverageValue
|
|
|
|
```
|
|
|
|
|
2019-03-25 16:15:54 +01:00
|
|
|
_Note:_ The HPA object requires an `Object` to be specified. However when a Prometheus metric is used there is no need
|
|
|
|
for this object. But to satisfy the schema we specify a dummy pod called `dummy-pod`.
|
|
|
|
|
2019-04-30 23:22:16 +02:00
|
|
|
|
2018-10-08 13:17:05 +02:00
|
|
|
## Skipper collector
|
|
|
|
|
|
|
|
The skipper collector is a simple wrapper around the Prometheus collector to
|
|
|
|
make it easy to define an HPA for scaling based on ingress metrics when
|
|
|
|
[skipper](https://github.com/zalando/skipper) is used as the ingress
|
|
|
|
implementation in your cluster. It assumes you are collecting Prometheus
|
|
|
|
metrics from skipper and it provides the correct Prometheus queries out of the
|
|
|
|
box so users don't have to define those manually.
|
|
|
|
|
|
|
|
### Supported metrics
|
|
|
|
|
2019-07-27 10:42:46 +02:00
|
|
|
| Metric | Description | Type | Kind | K8s Versions |
|
|
|
|
| ----------- | -------------- | ------ | ---- | ---- |
|
2020-01-24 10:20:59 +01:00
|
|
|
| `requests-per-second` | Scale based on requests per second for a certain ingress. | Object | `Ingress` | `>=1.14` |
|
2018-10-08 13:17:05 +02:00
|
|
|
|
|
|
|
### Example
|
|
|
|
|
|
|
|
This is an example of an HPA that will scale based on `requests-per-second` for
|
|
|
|
an ingress called `myapp`.
|
|
|
|
|
|
|
|
```yaml
|
2020-01-24 10:20:59 +01:00
|
|
|
apiVersion: autoscaling/v2beta2
|
2018-10-08 13:17:05 +02:00
|
|
|
kind: HorizontalPodAutoscaler
|
|
|
|
metadata:
|
|
|
|
name: myapp-hpa
|
|
|
|
spec:
|
|
|
|
scaleTargetRef:
|
|
|
|
apiVersion: apps/v1
|
|
|
|
kind: Deployment
|
|
|
|
name: myapp
|
|
|
|
minReplicas: 1
|
|
|
|
maxReplicas: 10
|
|
|
|
metrics:
|
|
|
|
- type: Object
|
|
|
|
object:
|
2020-01-24 10:20:59 +01:00
|
|
|
describedObject:
|
2018-10-08 13:17:05 +02:00
|
|
|
apiVersion: extensions/v1beta1
|
|
|
|
kind: Ingress
|
|
|
|
name: myapp
|
2020-01-24 10:20:59 +01:00
|
|
|
metric:
|
|
|
|
name: requests-per-second
|
|
|
|
target:
|
|
|
|
averageValue: "10"
|
|
|
|
type: AverageValue
|
2018-10-08 13:17:05 +02:00
|
|
|
```
|
|
|
|
|
2019-01-17 13:13:52 +01:00
|
|
|
### Metric weighting based on backend
|
|
|
|
|
2020-01-24 10:20:59 +01:00
|
|
|
Skipper supports sending traffic to different backend based on annotations
|
|
|
|
present on the `Ingress` object. When the metric name is specified without a
|
|
|
|
backend as `requests-per-second` then the number of replicas will be calculated
|
|
|
|
based on the full traffic served by that ingress. If however only the traffic
|
|
|
|
being routed to a specific backend should be used then the backend name can be
|
|
|
|
specified as a metric name like `requests-per-second,backend1` which would
|
|
|
|
return the requests-per-second being sent to the `backend1`. The ingress
|
|
|
|
annotation where the backend weights can be obtained can be specified through
|
|
|
|
the flag `--skipper-backends-annotation`.
|
2019-07-27 10:42:46 +02:00
|
|
|
|
2020-01-21 10:13:27 +01:00
|
|
|
## InfluxDB collector
|
|
|
|
|
|
|
|
The InfluxDB collector maps [Flux](https://github.com/influxdata/flux) queries to metrics that can be used for scaling.
|
|
|
|
|
|
|
|
Note that the collector targets an [InfluxDB v2](https://v2.docs.influxdata.com/v2.0/get-started/) instance, that's why
|
|
|
|
we only support Flux instead of InfluxQL.
|
|
|
|
|
|
|
|
### Supported metrics
|
|
|
|
|
|
|
|
| Metric | Description | Type | Kind | K8s Versions |
|
|
|
|
| ------------ | -------------- | ------- | -- | -- |
|
|
|
|
| `flux-query` | Generic metric which requires a user defined query. | External | | `>=1.10` |
|
|
|
|
|
|
|
|
### Example: External Metric
|
|
|
|
|
|
|
|
This is an example of an HPA configured to get metrics based on a Flux query.
|
|
|
|
The query is defined in the annotation
|
2020-11-04 20:40:53 +01:00
|
|
|
`metric-config.external.<metricName>.influxdb/query` where `<metricName>` is
|
|
|
|
the query name which will be associated with the result of the query. This
|
|
|
|
allows having multiple flux queries associated with a single HPA.
|
2020-01-21 10:13:27 +01:00
|
|
|
|
|
|
|
```yaml
|
2020-01-24 10:20:59 +01:00
|
|
|
apiVersion: autoscaling/v2beta2
|
2020-01-21 10:13:27 +01:00
|
|
|
kind: HorizontalPodAutoscaler
|
|
|
|
metadata:
|
2020-01-24 10:20:59 +01:00
|
|
|
name: myapp-hpa
|
2020-01-21 10:13:27 +01:00
|
|
|
annotations:
|
|
|
|
# These annotations are optional.
|
|
|
|
# If specified, then they are used for setting up the InfluxDB client properly,
|
|
|
|
# instead of using the ones specified via CLI. Respectively:
|
|
|
|
# - --influxdb-address
|
|
|
|
# - --influxdb-token
|
2020-02-21 18:26:46 +01:00
|
|
|
# - --influxdb-org
|
2020-11-04 20:40:53 +01:00
|
|
|
metric-config.external.queue-depth.influxdb/address: "http://influxdbv2.my-namespace.svc"
|
|
|
|
metric-config.external.queue-depth.influxdb/token: "secret-token"
|
2020-02-21 18:26:46 +01:00
|
|
|
# This could be either the organization name or the ID.
|
2020-11-04 20:40:53 +01:00
|
|
|
metric-config.external.queue-depth.influxdb/org: "deadbeef"
|
|
|
|
# metric-config.<metricType>.<metricName>.<collectorType>/<configKey>
|
2020-01-21 10:13:27 +01:00
|
|
|
# <configKey> == query-name
|
2020-11-04 20:40:53 +01:00
|
|
|
metric-config.external.queue-depth.influxdb/query: |
|
2020-01-21 10:13:27 +01:00
|
|
|
from(bucket: "apps")
|
|
|
|
|> range(start: -30s)
|
|
|
|
|> filter(fn: (r) => r._measurement == "queue_depth")
|
|
|
|
|> group()
|
|
|
|
|> max()
|
|
|
|
// Rename "_value" to "metricvalue" for letting the metrics server properly unmarshal the result.
|
|
|
|
|> rename(columns: {_value: "metricvalue"})
|
|
|
|
|> keep(columns: ["metricvalue"])
|
|
|
|
spec:
|
|
|
|
scaleTargetRef:
|
|
|
|
apiVersion: apps/v1
|
|
|
|
kind: Deployment
|
|
|
|
name: queryd-v1
|
|
|
|
minReplicas: 1
|
|
|
|
maxReplicas: 4
|
|
|
|
metrics:
|
|
|
|
- type: External
|
|
|
|
external:
|
2020-01-24 10:20:59 +01:00
|
|
|
metric:
|
2020-11-04 20:40:53 +01:00
|
|
|
name: queue-depth
|
2020-01-24 10:20:59 +01:00
|
|
|
selector:
|
|
|
|
matchLabels:
|
2020-11-04 20:40:53 +01:00
|
|
|
type: influxdb
|
2020-01-24 10:20:59 +01:00
|
|
|
target:
|
|
|
|
type: Value
|
|
|
|
value: "1"
|
2020-01-21 10:13:27 +01:00
|
|
|
```
|
|
|
|
|
2018-10-08 13:17:05 +02:00
|
|
|
## AWS collector
|
|
|
|
|
|
|
|
The AWS collector allows scaling based on external metrics exposed by AWS
|
|
|
|
services e.g. SQS queue lengths.
|
|
|
|
|
2019-05-17 11:07:18 +02:00
|
|
|
### AWS IAM role
|
|
|
|
|
|
|
|
To integrate with AWS, the controller needs to run on nodes with
|
|
|
|
access to AWS API. Additionally the controller have to have a role
|
|
|
|
with the following policy to get all required data from AWS:
|
|
|
|
|
|
|
|
```yaml
|
|
|
|
PolicyDocument:
|
|
|
|
Statement:
|
|
|
|
- Action: 'sqs:GetQueueUrl'
|
|
|
|
Effect: Allow
|
|
|
|
Resource: '*'
|
|
|
|
- Action: 'sqs:GetQueueAttributes'
|
|
|
|
Effect: Allow
|
|
|
|
Resource: '*'
|
|
|
|
- Action: 'sqs:ListQueues'
|
|
|
|
Effect: Allow
|
|
|
|
Resource: '*'
|
|
|
|
- Action: 'sqs:ListQueueTags'
|
|
|
|
Effect: Allow
|
|
|
|
Resource: '*'
|
|
|
|
Version: 2012-10-17
|
|
|
|
```
|
|
|
|
|
2018-10-08 13:17:05 +02:00
|
|
|
### Supported metrics
|
|
|
|
|
2019-07-27 10:42:46 +02:00
|
|
|
| Metric | Description | Type | K8s Versions |
|
|
|
|
| ------------ | ------- | -- | -- |
|
2020-01-24 10:20:59 +01:00
|
|
|
| `sqs-queue-length` | Scale based on SQS queue length | External | `>=1.12` |
|
2018-10-08 13:17:05 +02:00
|
|
|
|
|
|
|
### Example
|
|
|
|
|
|
|
|
This is an example of an HPA that will scale based on the length of an SQS
|
|
|
|
queue.
|
|
|
|
|
|
|
|
```yaml
|
2020-01-24 10:20:59 +01:00
|
|
|
apiVersion: autoscaling/v2beta2
|
2018-10-08 13:17:05 +02:00
|
|
|
kind: HorizontalPodAutoscaler
|
|
|
|
metadata:
|
|
|
|
name: myapp-hpa
|
|
|
|
spec:
|
|
|
|
scaleTargetRef:
|
|
|
|
apiVersion: apps/v1
|
|
|
|
kind: Deployment
|
|
|
|
name: custom-metrics-consumer
|
|
|
|
minReplicas: 1
|
|
|
|
maxReplicas: 10
|
|
|
|
metrics:
|
|
|
|
- type: External
|
|
|
|
external:
|
2020-01-24 10:20:59 +01:00
|
|
|
metric:
|
2020-11-04 20:40:53 +01:00
|
|
|
name: my-sqs
|
2020-01-24 10:20:59 +01:00
|
|
|
selector:
|
|
|
|
matchLabels:
|
2020-11-04 20:40:53 +01:00
|
|
|
type: sqs-queue-length
|
2020-01-24 10:20:59 +01:00
|
|
|
queue-name: foobar
|
|
|
|
region: eu-central-1
|
|
|
|
target:
|
|
|
|
averageValue: "30"
|
|
|
|
type: AverageValue
|
2018-10-08 13:17:05 +02:00
|
|
|
```
|
|
|
|
|
|
|
|
The `matchLabels` are used by `kube-metrics-adapter` to configure a collector
|
|
|
|
that will get the queue length for an SQS queue named `foobar` in region
|
|
|
|
`eu-central-1`.
|
|
|
|
|
|
|
|
The AWS account of the queue currently depends on how `kube-metrics-adapter` is
|
|
|
|
configured to get AWS credentials. The normal assumption is that you run the
|
|
|
|
adapter in a cluster running in the AWS account where the queue is defined.
|
|
|
|
Please open an issue if you would like support for other use cases.
|
2018-10-29 14:26:25 +01:00
|
|
|
|
|
|
|
## ZMON collector
|
|
|
|
|
|
|
|
The ZMON collector allows scaling based on external metrics exposed by
|
|
|
|
[ZMON](https://github.com/zalando/zmon) checks.
|
|
|
|
|
|
|
|
### Supported metrics
|
|
|
|
|
2019-07-27 10:42:46 +02:00
|
|
|
| Metric | Description | Type | K8s Versions |
|
|
|
|
| ------------ | ------- | -- | -- |
|
2020-01-24 10:20:59 +01:00
|
|
|
| `zmon-check` | Scale based on any ZMON check results | External | `>=1.12` |
|
2018-10-29 14:26:25 +01:00
|
|
|
|
|
|
|
### Example
|
|
|
|
|
|
|
|
This is an example of an HPA that will scale based on the specified value
|
|
|
|
exposed by a ZMON check with id `1234`.
|
|
|
|
|
|
|
|
```yaml
|
2020-01-24 10:20:59 +01:00
|
|
|
apiVersion: autoscaling/v2beta2
|
2018-10-29 14:26:25 +01:00
|
|
|
kind: HorizontalPodAutoscaler
|
|
|
|
metadata:
|
|
|
|
name: myapp-hpa
|
|
|
|
annotations:
|
2020-11-04 20:40:53 +01:00
|
|
|
# metric-config.<metricType>.<metricName>.<collectorType>/<configKey>
|
|
|
|
metric-config.external.my-zmon-check.zmon/key: "custom.*"
|
|
|
|
metric-config.external.my-zmon-check.zmon/tag-application: "my-custom-app-*"
|
2018-10-29 14:26:25 +01:00
|
|
|
spec:
|
|
|
|
scaleTargetRef:
|
|
|
|
apiVersion: apps/v1
|
|
|
|
kind: Deployment
|
|
|
|
name: custom-metrics-consumer
|
|
|
|
minReplicas: 1
|
|
|
|
maxReplicas: 10
|
|
|
|
metrics:
|
|
|
|
- type: External
|
|
|
|
external:
|
2020-01-24 10:20:59 +01:00
|
|
|
metric:
|
2020-11-04 20:40:53 +01:00
|
|
|
name: my-zmon-check
|
2020-01-24 10:20:59 +01:00
|
|
|
selector:
|
|
|
|
matchLabels:
|
2020-11-04 20:40:53 +01:00
|
|
|
type: zmon
|
2020-01-24 10:20:59 +01:00
|
|
|
check-id: "1234" # the ZMON check to query for metrics
|
|
|
|
key: "custom.value"
|
|
|
|
tag-application: my-custom-app
|
|
|
|
aggregators: avg # comma separated list of aggregation functions, default: last
|
|
|
|
duration: 5m # default: 10m
|
2020-04-01 20:32:09 +02:00
|
|
|
target:
|
|
|
|
averageValue: "30"
|
|
|
|
type: AverageValue
|
2018-10-29 14:26:25 +01:00
|
|
|
```
|
|
|
|
|
|
|
|
The `check-id` specifies the ZMON check to query for the metrics. `key`
|
|
|
|
specifies the JSON key in the check output to extract the metric value from.
|
|
|
|
E.g. if you have a check which returns the following data:
|
|
|
|
|
|
|
|
```json
|
|
|
|
{
|
|
|
|
"custom": {
|
|
|
|
"value": 1.0
|
|
|
|
},
|
|
|
|
"other": {
|
|
|
|
"value": 3.0
|
|
|
|
}
|
|
|
|
}
|
|
|
|
```
|
|
|
|
|
|
|
|
Then the value `1.0` would be returned when the key is defined as `custom.value`.
|
|
|
|
|
|
|
|
The `tag-<name>` labels defines the tags used for the kariosDB query. In a
|
|
|
|
normal ZMON setup the following tags will be available:
|
|
|
|
|
|
|
|
* `application`
|
|
|
|
* `alias` (name of Kubernetes cluster)
|
|
|
|
* `entity` - full ZMON entity ID.
|
|
|
|
|
|
|
|
`aggregators` defines the aggregation functions applied to the metrics query.
|
|
|
|
For instance if you define the entity filter
|
|
|
|
`type=kube_pod,application=my-custom-app` you might get three entities back and
|
|
|
|
then you might want to get an average over the metrics for those three
|
|
|
|
entities. This would be possible by using the `avg` aggregator. The default
|
|
|
|
aggregator is `last` which returns only the latest metric point from the
|
|
|
|
query. The supported aggregation functions are `avg`, `dev`, `count`,
|
|
|
|
`first`, `last`, `max`, `min`, `sum`, `diff`. See the [KariosDB docs](https://kairosdb.github.io/docs/build/html/restapi/Aggregators.html) for
|
|
|
|
details.
|
|
|
|
|
|
|
|
The `duration` defines the duration used for the timeseries query. E.g. if you
|
|
|
|
specify a duration of `5m` then the query will return metric points for the
|
|
|
|
last 5 minutes and apply the specified aggregation with the same duration .e.g
|
|
|
|
`max(5m)`.
|
|
|
|
|
2020-11-04 20:40:53 +01:00
|
|
|
The annotations `metric-config.external.my-zmon-check.zmon/key` and
|
|
|
|
`metric-config.external.my-zmon-check.zmon/tag-<name>` can be optionally used if
|
2018-10-29 14:26:25 +01:00
|
|
|
you need to define a `key` or other `tag` with a "star" query syntax like
|
|
|
|
`values.*`. This *hack* is in place because it's not allowed to use `*` in the
|
|
|
|
metric label definitions. If both annotations and corresponding label is
|
|
|
|
defined, then the annotation takes precedence.
|
2020-04-01 20:32:09 +02:00
|
|
|
|
|
|
|
## HTTP Collector
|
|
|
|
|
|
|
|
The http collector allows collecting metrics from an external endpoint specified in the HPA.
|
|
|
|
Currently only `json-path` collection is supported.
|
|
|
|
|
|
|
|
### Supported metrics
|
|
|
|
|
|
|
|
| Metric | Description | Type | K8s Versions |
|
|
|
|
| ------------ | -------------- | ------- | -- |
|
|
|
|
| *custom* | No predefined metrics. Metrics are generated from user defined queries. | Pods | `>=1.12` |
|
|
|
|
|
|
|
|
### Example
|
|
|
|
|
|
|
|
This is an example of using the HTTP collector to collect metrics from a json
|
|
|
|
metrics endpoint specified in the annotations.
|
|
|
|
|
|
|
|
```yaml
|
|
|
|
apiVersion: autoscaling/v2beta2
|
|
|
|
kind: HorizontalPodAutoscaler
|
|
|
|
metadata:
|
|
|
|
name: myapp-hpa
|
|
|
|
annotations:
|
2020-11-04 20:40:53 +01:00
|
|
|
# metric-config.<metricType>.<metricName>.<collectorType>/<configKey>
|
|
|
|
metric-config.external.unique-metric-name.json-path/json-key: "$.some-metric.value"
|
|
|
|
metric-config.external.unique-metric-name.json-path/endpoint: "http://metric-source.app-namespace:8080/metrics"
|
|
|
|
metric-config.external.unique-metric-name.json-path/aggregator: "max"
|
2020-04-01 20:32:09 +02:00
|
|
|
spec:
|
|
|
|
scaleTargetRef:
|
|
|
|
apiVersion: apps/v1
|
|
|
|
kind: Deployment
|
|
|
|
name: myapp
|
|
|
|
minReplicas: 1
|
|
|
|
maxReplicas: 10
|
|
|
|
metrics:
|
|
|
|
- type: External
|
|
|
|
external:
|
|
|
|
metric:
|
2020-11-04 20:40:53 +01:00
|
|
|
name: unique-metric-name
|
2020-04-01 20:32:09 +02:00
|
|
|
selector:
|
|
|
|
matchLabels:
|
2020-11-04 20:40:53 +01:00
|
|
|
type: json-path
|
2020-04-01 20:32:09 +02:00
|
|
|
target:
|
|
|
|
averageValue: 1
|
|
|
|
type: AverageValue
|
|
|
|
```
|
|
|
|
|
|
|
|
The HTTP collector similar to the Pod Metrics collector. The metric name should always be `http`.
|
|
|
|
This value is also used in the annotations to configure the metrics adapter to query the required
|
|
|
|
target. The following configuration values are supported:
|
|
|
|
|
|
|
|
- `json-key` to specify the JSON path of the metric to be queried
|
|
|
|
- `endpoint` the fully formed path to query for the metric. In the above example a Kubernetes _Service_
|
|
|
|
in the namespace `app-namespace` is called.
|
|
|
|
- `aggregator` is only required if the metric is an array of values and specifies how the values
|
|
|
|
are aggregated. Currently this option can support the values: `sum`, `max`, `min`, `avg`.
|