Diagnose unassigned shards

Diagnose unassigned shards

There are multiple reasons why shards might get unassigned, ranging from misconfigured allocation settings to lack of disk space.

In order to diagnose the unassigned shards in your deployment use the following steps:

Elasticsearch Service Self-managed

In order to diagnose the unassigned shards, follow the next steps:

Use Kibana

  1. Log in to the Elastic Cloud console.
  2. On the Elasticsearch Service panel, click the name of your deployment.

    If the name of your deployment is disabled your Kibana instances might be unhealthy, in which case please contact Elastic Support. If your deployment doesn’t include Kibana, all you need to do is enable it first.

  3. Open your deployment’s side navigation menu (placed under the Elastic logo in the upper left corner) and go to Dev Tools > Console.

    Kibana Console

  4. View the unassigned shards using the cat shards API.

    1. resp = client.cat.shards(
    2. v=True,
    3. h="index,shard,prirep,state,node,unassigned.reason",
    4. s="state",
    5. )
    6. print(resp)
    1. response = client.cat.shards(
    2. v: true,
    3. h: 'index,shard,prirep,state,node,unassigned.reason',
    4. s: 'state'
    5. )
    6. puts response
    1. const response = await client.cat.shards({
    2. v: "true",
    3. h: "index,shard,prirep,state,node,unassigned.reason",
    4. s: "state",
    5. });
    6. console.log(response);
    1. GET _cat/shards?v=true&h=index,shard,prirep,state,node,unassigned.reason&s=state

    The response will look like this:

    1. [
    2. {
    3. "index": "my-index-000001",
    4. "shard": "0",
    5. "prirep": "p",
    6. "state": "UNASSIGNED",
    7. "node": null,
    8. "unassigned.reason": "INDEX_CREATED"
    9. }
    10. ]

    Unassigned shards have a state of UNASSIGNED. The prirep value is p for primary shards and r for replicas.

    The index in the example has a primary shard unassigned.

  5. To understand why an unassigned shard is not being assigned and what action you must take to allow Elasticsearch to assign it, use the cluster allocation explanation API.

    1. resp = client.cluster.allocation_explain(
    2. index="my-index-000001",
    3. shard=0,
    4. primary=True,
    5. )
    6. print(resp)
    1. response = client.cluster.allocation_explain(
    2. body: {
    3. index: 'my-index-000001',
    4. shard: 0,
    5. primary: true
    6. }
    7. )
    8. puts response
    1. const response = await client.cluster.allocationExplain({
    2. index: "my-index-000001",
    3. shard: 0,
    4. primary: true,
    5. });
    6. console.log(response);
    1. GET _cluster/allocation/explain
    2. {
    3. "index": "my-index-000001",
    4. "shard": 0,
    5. "primary": true
    6. }

    The index we want to diagnose.

    The unassigned shard ID.

    Indicates that we are diagnosing a primary shard.

    The response will look like this:

    1. {
    2. "index" : "my-index-000001",
    3. "shard" : 0,
    4. "primary" : true,
    5. "current_state" : "unassigned",
    6. "unassigned_info" : {
    7. "reason" : "INDEX_CREATED",
    8. "at" : "2022-01-04T18:08:16.600Z",
    9. "last_allocation_status" : "no"
    10. },
    11. "can_allocate" : "no",
    12. "allocate_explanation" : "Elasticsearch isn't allowed to allocate this shard to any of the nodes in the cluster. Choose a node to which you expect this shard to be allocated, find this node in the node-by-node explanation, and address the reasons which prevent Elasticsearch from allocating this shard there.",
    13. "node_allocation_decisions" : [
    14. {
    15. "node_id" : "8qt2rY-pT6KNZB3-hGfLnw",
    16. "node_name" : "node-0",
    17. "transport_address" : "127.0.0.1:9401",
    18. "roles": ["data_content", "data_hot"],
    19. "node_attributes" : {},
    20. "node_decision" : "no",
    21. "weight_ranking" : 1,
    22. "deciders" : [
    23. {
    24. "decider" : "filter",
    25. "decision" : "NO",
    26. "explanation" : "node does not match index setting [index.routing.allocation.include] filters [_name:\"nonexistent_node\"]"
    27. }
    28. ]
    29. }
    30. ]
    31. }

    The current state of the shard.

    The reason for the shard originally becoming unassigned.

    Whether to allocate the shard.

    Whether to allocate the shard to the particular node.

    The decider which led to the no decision for the node.

    An explanation as to why the decider returned a no decision, with a helpful hint pointing to the setting that led to the decision.

  6. The explanation in our case indicates the index allocation configurations are not correct. To review your allocation settings, use the get index settings and cluster get settings APIs.

    1. resp = client.indices.get_settings(
    2. index="my-index-000001",
    3. flat_settings=True,
    4. include_defaults=True,
    5. )
    6. print(resp)
    7. resp1 = client.cluster.get_settings(
    8. flat_settings=True,
    9. include_defaults=True,
    10. )
    11. print(resp1)
    1. response = client.indices.get_settings(
    2. index: 'my-index-000001',
    3. flat_settings: true,
    4. include_defaults: true
    5. )
    6. puts response
    7. response = client.cluster.get_settings(
    8. flat_settings: true,
    9. include_defaults: true
    10. )
    11. puts response
    1. const response = await client.indices.getSettings({
    2. index: "my-index-000001",
    3. flat_settings: "true",
    4. include_defaults: "true",
    5. });
    6. console.log(response);
    7. const response1 = await client.cluster.getSettings({
    8. flat_settings: "true",
    9. include_defaults: "true",
    10. });
    11. console.log(response1);
    1. GET my-index-000001/_settings?flat_settings=true&include_defaults=true
    2. GET _cluster/settings?flat_settings=true&include_defaults=true
  7. Change the settings using the update index settings and cluster update settings APIs to the correct values in order to allow the index to be allocated.

For more guidance on fixing the most common causes for unassinged shards please follow this guide or contact Elastic Support.

In order to diagnose the unassigned shards follow the next steps:

  1. View the unassigned shards using the cat shards API.

    1. resp = client.cat.shards(
    2. v=True,
    3. h="index,shard,prirep,state,node,unassigned.reason",
    4. s="state",
    5. )
    6. print(resp)
    1. response = client.cat.shards(
    2. v: true,
    3. h: 'index,shard,prirep,state,node,unassigned.reason',
    4. s: 'state'
    5. )
    6. puts response
    1. const response = await client.cat.shards({
    2. v: "true",
    3. h: "index,shard,prirep,state,node,unassigned.reason",
    4. s: "state",
    5. });
    6. console.log(response);
    1. GET _cat/shards?v=true&h=index,shard,prirep,state,node,unassigned.reason&s=state

    The response will look like this:

    1. [
    2. {
    3. "index": "my-index-000001",
    4. "shard": "0",
    5. "prirep": "p",
    6. "state": "UNASSIGNED",
    7. "node": null,
    8. "unassigned.reason": "INDEX_CREATED"
    9. }
    10. ]

    Unassigned shards have a state of UNASSIGNED. The prirep value is p for primary shards and r for replicas.

    The index in the example has a primary shard unassigned.

  2. To understand why an unassigned shard is not being assigned and what action you must take to allow Elasticsearch to assign it, use the cluster allocation explanation API.

    1. resp = client.cluster.allocation_explain(
    2. index="my-index-000001",
    3. shard=0,
    4. primary=True,
    5. )
    6. print(resp)
    1. response = client.cluster.allocation_explain(
    2. body: {
    3. index: 'my-index-000001',
    4. shard: 0,
    5. primary: true
    6. }
    7. )
    8. puts response
    1. const response = await client.cluster.allocationExplain({
    2. index: "my-index-000001",
    3. shard: 0,
    4. primary: true,
    5. });
    6. console.log(response);
    1. GET _cluster/allocation/explain
    2. {
    3. "index": "my-index-000001",
    4. "shard": 0,
    5. "primary": true
    6. }

    The index we want to diagnose.

    The unassigned shard ID.

    Indicates that we are diagnosing a primary shard.

    The response will look like this:

    1. {
    2. "index" : "my-index-000001",
    3. "shard" : 0,
    4. "primary" : true,
    5. "current_state" : "unassigned",
    6. "unassigned_info" : {
    7. "reason" : "INDEX_CREATED",
    8. "at" : "2022-01-04T18:08:16.600Z",
    9. "last_allocation_status" : "no"
    10. },
    11. "can_allocate" : "no",
    12. "allocate_explanation" : "Elasticsearch isn't allowed to allocate this shard to any of the nodes in the cluster. Choose a node to which you expect this shard to be allocated, find this node in the node-by-node explanation, and address the reasons which prevent Elasticsearch from allocating this shard there.",
    13. "node_allocation_decisions" : [
    14. {
    15. "node_id" : "8qt2rY-pT6KNZB3-hGfLnw",
    16. "node_name" : "node-0",
    17. "transport_address" : "127.0.0.1:9401",
    18. "roles": ["data_content", "data_hot"]
    19. "node_attributes" : {},
    20. "node_decision" : "no",
    21. "weight_ranking" : 1,
    22. "deciders" : [
    23. {
    24. "decider" : "filter",
    25. "decision" : "NO",
    26. "explanation" : "node does not match index setting [index.routing.allocation.include] filters [_name:\"nonexistent_node\"]"
    27. }
    28. ]
    29. }
    30. ]
    31. }

    The current state of the shard.

    The reason for the shard originally becoming unassigned.

    Whether to allocate the shard.

    Whether to allocate the shard to the particular node.

    The decider which led to the no decision for the node.

    An explanation as to why the decider returned a no decision, with a helpful hint pointing to the setting that led to the decision.

  3. The explanation in our case indicates the index allocation configurations are not correct. To review your allocation settings, use the get index settings and cluster get settings APIs.

    1. resp = client.indices.get_settings(
    2. index="my-index-000001",
    3. flat_settings=True,
    4. include_defaults=True,
    5. )
    6. print(resp)
    7. resp1 = client.cluster.get_settings(
    8. flat_settings=True,
    9. include_defaults=True,
    10. )
    11. print(resp1)
    1. response = client.indices.get_settings(
    2. index: 'my-index-000001',
    3. flat_settings: true,
    4. include_defaults: true
    5. )
    6. puts response
    7. response = client.cluster.get_settings(
    8. flat_settings: true,
    9. include_defaults: true
    10. )
    11. puts response
    1. const response = await client.indices.getSettings({
    2. index: "my-index-000001",
    3. flat_settings: "true",
    4. include_defaults: "true",
    5. });
    6. console.log(response);
    7. const response1 = await client.cluster.getSettings({
    8. flat_settings: "true",
    9. include_defaults: "true",
    10. });
    11. console.log(response1);
    1. GET my-index-000001/_settings?flat_settings=true&include_defaults=true
    2. GET _cluster/settings?flat_settings=true&include_defaults=true
  4. Change the settings using the update index settings and cluster update settings APIs to the correct values in order to allow the index to be allocated.

For more guidance on fixing the most common causes for unassinged shards please follow this guide.

See this video for a walkthrough of monitoring allocation health.