Failure store

Stack Planned Serverless

A failure store is a secondary set of indices inside a data stream, dedicated to storing failed documents. A failed document is any document that, without the failure store enabled, would cause an ingest pipeline exception or that has a structure that conflicts with a data stream's mappings. In the absence of the failure store, a failed document would cause the indexing operation to fail, with an error message returned in the operation response.

When a data stream's failure store is enabled, these failures are instead captured in a separate index and persisted to be analysed later. Clients receive a successful response with a flag indicating the failure was redirected.

Important

Failure stores do not capture failures caused by backpressure or document version conflicts. These failures are always returned as-is since they warrant specific action by the client.

On this page, you'll learn how to set up, use, and manage a failure store, as well as the structure of failure store documents.

For examples of how to use failure stores to identify and fix errors in ingest pipelines and your data, refer to Using failure stores to address ingestion issues.

Set up a data stream failure store

Each data stream has its own failure store that can be enabled to accept failed documents. By default, this failure store is disabled and any ingestion problems are raised in the response to write operations.

Set up for new data streams

You can specify in a data stream's index template if it should enable the failure store when it is first created.

Note

Unlike the settings and mappings fields on an index template which are repeatedly applied to new data stream write indices on rollover, the data_stream_options section of a template is applied to a data stream only once when the data stream is first created. To configure existing data streams, use the put data stream options API.

To enable the failure store on a new data stream, enable it in the data_stream_options of the template:

		 PUT _index_template/my-index-template {
  "index_patterns": ["my-datastream-*"],
  "data_stream": { },
  "template": {
    "data_stream_options": {
      "failure_store": {
        "enabled": true
      }
    }
  }
}

	

The options for a data stream to be applied at creation time.
The failure store feature will be enabled for new data streams that match this template.

After a matching data stream is created, its failure store will be enabled.

Set up for existing data streams

Enabling the failure store via index templates can only affect data streams that are newly created. Existing data streams that use a template are not affected by changes to the template's data_stream_options field. To modify an existing data stream's options, use the put data stream options API:

		 PUT _data_stream/my-datastream-existing/_options {
  "failure_store": {
    "enabled": true
  }
}

	

The failure store option will now be enabled.

The failure store redirection can be disabled using this API as well. When the failure store is deactivated, only failed document redirection is halted. Any existing failure data in the data stream will remain until removed by manual deletion or until the data expires due to reaching its max configured retention.

		 PUT _data_stream/my-datastream-existing/_options {
  "failure_store": {
    "enabled": false
  }
}

	

Redirecting failed documents into the failure store will now be disabled.

Enable failure store via cluster setting

If you have a large number of existing data streams you may want to enable their failure stores in one place. Instead of updating each of their options individually, set data_streams.failure_store.enabled to a list of index patterns in the cluster settings. Any data streams that match one of these patterns will operate with their failure store enabled.

		 PUT _cluster/settings {
  "persistent" : {
    "data_streams.failure_store.enabled" : [ "my-datastream-*", "logs-*" ]
  }
}

	

Indices that match my-datastream-* or logs-* will redirect failures to the failure store unless explicitly disabled.

Matching data streams will ignore this configuration if the failure store is explicitly enabled or disabled in their data stream options.

		 PUT _cluster/settings {
  "persistent" : {
    "data_streams.failure_store.enabled" : [ "my-datastream-*", "logs-*" ]
  }
}

	

Enabling the failure stores for my-datastream-* and logs-*

		 PUT _data_stream/my-datastream-1/_options {
  "failure_store": {
    "enabled": false
  }
}

	

The failure store for my-datastream-1 is disabled even though it matches my-datastream-*. The data stream options override the cluster setting.

Using a failure store

The failure store is meant to ease the burden of detecting and handling failures when ingesting data to Elasticsearch. Clients are less likely to encounter unrecoverable failures when writing documents, and developers are more easily able to troubleshoot faulty pipelines and mappings.

For examples of how to use failure stores to identify and fix errors in ingest pipelines and your data, refer to Using failure stores to address ingestion issues.

Failure redirection

Once a failure store is enabled for a data stream it will begin redirecting documents that fail due to common ingestion problems instead of returning errors in write operations. Clients are notified in a non-intrusive way when a document is redirected to the failure store.

Each data stream's failure store is made up of a list of indices that are dedicated to storing failed documents. These failure indices function much like a data stream's normal backing indices: There is a write index that accepts failed documents, the indices can be rolled over, and they're automatically cleaned up over time subject to a lifecycle policy. Failure indices are lazily created the first time they are needed to store a failed document.

When a document bound for a data stream encounters a problem during its ingestion, the response is annotated with the failure_store field which describes how Elasticsearch responded to that problem. The failure_store field is present on both the bulk and index API responses when applicable. Clients can use this information to augment their behavior based on the response from Elasticsearch.

Here we have a bulk operation that sends two documents. Both are writing to the id field which is mapped as a long field type. The first document will be accepted, but the second document would cause a failure because the value invalid_text cannot be parsed as a long. This second document will be redirected to the failure store:

		 POST my-datastream-new/_bulk {"create":{}}
{"@timestamp": "2025-05-01T00:00:00Z", "id": 1234}
{"create":{}}
{"@timestamp": "2025-05-01T00:00:00Z", "id": "invalid_text"}

	

A correctly formatted document.
Invalid document that cannot be parsed using the current mapping.

		{
  "errors": false,
  "took": 400,
  "items": [
    {
      "create": {
        "_index": ".ds-my-datastream-new-2025.05.01-000001",
        "_id": "YUvQipYB_ZAKuDfZRosB",
        "_version": 1,
        "result": "created",
        "_shards": {
          "total": 1,
          "successful": 1,
          "failed": 0
        },
        "_seq_no": 3,
        "_primary_term": 1,
        "status": 201
      }
    },
    {
      "create": {
        "_index": ".fs-my-datastream-new-2025.05.01-000002",
        "_id": "lEu8jZYB_ZAKuDfZNouU",
        "_version": 1,
        "result": "created",
        "_shards": {
          "total": 1,
          "successful": 1,
          "failed": 0
        },
        "_seq_no": 10,
        "_primary_term": 1,
        "failure_store": "used",
        "status": 201
      }
    }
  ]
}

	

The response code is 200 OK, and the response body does not report any errors encountered.
The first document is accepted into the data stream's write index.
The second document encountered a problem during ingest and was redirected to the data stream's failure store.
The response is annotated with a field indicating that the failure store was used to persist the second document.

If the document was redirected to a data stream's failure store due to a problem, then the failure_store field on the response will be used, and the response will not return any error information:

		{
  "_index": ".fs-my-datastream-new-2025.05.01-000002",
  "_id": "lEu8jZYB_ZAKuDfZNouU",
  "_version": 1,
  "result": "created",
  "_shards": {
    "total": 1,
    "successful": 1,
    "failed": 0
  },
  "_seq_no": 11,
  "_primary_term": 1,
  "failure_store": "used"
}

	

The document for this index operation was sent to the failure store's write index.
The response is annotated with a flag indicating the document was redirected.

If the document could have been redirected to a data stream's failure store but the failure store was disabled, then the failure_store field on the response will be not_enabled, and the response will display the error encountered as normal.

		{
  "error": {
    "root_cause": [
      {
        "type": "document_parsing_exception",
        "reason": "[1:53] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'"
      }
    ],
    "type": "document_parsing_exception",
    "reason": "[1:53] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'",
    "caused_by": {
      "type": "illegal_argument_exception",
      "reason": "For input string: \"invalid_text\""
    },
    "failure_store": "not_enabled"
  },
  "status": 400
}

	

The failure is returned to the client as normal when the failure store is not enabled.
The response is annotated with a flag indicating the failure store could have accepted the document, but it was not enabled.
The response status is 400 Bad Request due to the mapping problem.

If the document was redirected to a data stream's failure store but that failed document could not be stored (e.g. due to shard unavailability or a similar problem), then the failure_store field on the response will be failed, and the response will display the error for the original failure, as well as a suppressed error detailing why the failure could not be stored:

		{
  "error": {
    "root_cause": [
      {
        "type": "document_parsing_exception",
        "reason": "[1:53] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'",
        "suppressed": [
          {
            "type": "cluster_block_exception",
            "reason": "index [.fs-my-datastream-2025.05.01-000002] blocked by: [FORBIDDEN/5/index read-only (api)];"
          }
        ]
      }
    ],
    "type": "document_parsing_exception",
    "reason": "[1:53] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'",
    "caused_by": {
      "type": "illegal_argument_exception",
      "reason": "For input string: \"invalid_text\""
    },
    "suppressed": [
      {
        "type": "cluster_block_exception",
        "reason": "index [.fs-my-datastream-2025.05.01-000002] blocked by: [FORBIDDEN/5/index read-only (api)];"
      }
    ],
    "failure_store": "failed"
  },
  "status": 400
}

	

The root cause of the problem was a mapping mismatch.
The document could not be redirected because the failure store was not able to accept writes at this time due to an unforeseeable issue.
The complete exception tree is present on the response.
The response is annotated with a flag indicating the failure store would have accepted the document, but it was not able to.
The response status is 400 Bad Request due to the original mapping problem.

Searching failures

Once you have accumulated some failures, the failure store can be searched much like a regular data stream.

Warning

Documents redirected to the failure store in the event of a failed ingest pipeline will be stored in their original, unprocessed form. If an ingest pipeline normally redacts sensitive information from a document, then failed documents in their original, unprocessed form may contain sensitive information.

Furthermore, failed documents are likely to be structured differently than normal data in a data stream, and thus special care should be taken when making use of document level security or field level security. Any security policies that expect to utilize these features for both regular documents and failure documents should account for any differences in document structure between the two document types.

To limit visibility on potentially sensitive data, users require the read_failure_store index privilege for a data stream in order to search that data stream's failure store data.

Searching a data stream's failure store can be done by making use of the existing search APIs available in Elasticsearch.

To indicate that the search should be performed on failure store data, use the index component selector syntax to indicate which part of the data stream to target in the search operation. Appending the ::failures suffix to the name of the data stream indicates that the operation should be performed against that data stream's failure store instead of its regular backing indices.

ES|QL

		 POST _query?format=txt {
    "query": """FROM my-datastream::failures | DROP error.stack_trace | LIMIT 1"""
}

	

We drop the error.stack_trace field here just to keep the example free of newlines.

An example of a search result with the failed document present:

		       @timestamp       |    document.id     |document.index |document.routing|                                                            error.message                                                            |error.pipeline |error.pipeline_trace|error.processor_tag|error.processor_type|        error.type
------------------------+--------------------+---------------+----------------+-------------------------------------------------------------------------------------------------------------------------------------+---------------+--------------------+-------------------+--------------------+--------------------------
2025-05-01T12:00:00.000Z|Y0vQipYB_ZAKuDfZR4sR|my-datastream  |null            |[1:45] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'|null           |null                |null               |null                |document_parsing_exception

	

Note

Because the document.source field is unmapped, it is absent from the ES|QL results.

_search API

 GET my-datastream::failures/_search

An example of a search result with the failed document present:

		{
  "took": 0,
  "timed_out": false,
  "_shards": {
    "total": 1,
    "successful": 1,
    "skipped": 0,
    "failed": 0
  },
  "hits": {
    "total": {
      "value": 1,
      "relation": "eq"
    },
    "max_score": 1,
    "hits": [
      {
        "_index": ".fs-my-datastream-2025.05.01-000002",
        "_id": "lEu8jZYB_ZAKuDfZNouU",
        "_score": 1,
        "_source": {
          "@timestamp": "2025-05-01T12:00:00.000Z",
          "document": {
            "id": "Y0vQipYB_ZAKuDfZR4sR",
            "index": "my-datastream",
            "source": {
              "@timestamp": "2025-05-01T00:00:00Z",
              "id": "invalid_text"
            }
          },
          "error": {
            "type": "document_parsing_exception",
            "message": "[1:53] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'",
            "stack_trace": """o.e.i.m.DocumentParsingException: [1:53] failed to parse field [id] of type [long] in document with id 'Y0vQipYB_ZAKuDfZR4sR'. Preview of field's value: 'invalid_text'
	at o.e.i.m.FieldMapper.rethrowAsDocumentParsingException(FieldMapper.java:241)
	at o.e.i.m.FieldMapper.parse(FieldMapper.java:194)
	... 24 more
Caused by: j.l.IllegalArgumentException: For input string: "invalid_text"
	at o.e.x.s.AbstractXContentParser.toLong(AbstractXContentParser.java:189)
	at o.e.x.s.AbstractXContentParser.longValue(AbstractXContentParser.java:210)
	... 31 more
"""
          }
        }
      }
    ]
  }
}

	

The document belongs to a failure store index on the data stream.
The failure document timestamp is when the failure occurred in Elasticsearch.
The document that was sent is captured inside the failure document. Failure documents capture the ID of the document at time of failure, along with which data stream the document was being written to, and the contents of the document. The document.source fields are unmapped to ensure failures are always captured.
The failure document captures information about the error encountered, like the type of error, the error message, and a compressed stack trace.

SQL

		 POST _sql?format=txt {
    "query": """SELECT * FROM "my-datastream::failures" LIMIT 1"""
}

	

An example of a search result with the failed document present:

		       @timestamp       |    document.id     |document.index |document.routing|                                                            error.message                                                            |error.pipeline |error.pipeline_trace|error.processor_tag|error.processor_type|                                                                                                                                                                                                                                                                            error.stack_trace                                                                                                                                                                                                                                                                            |        error.type
------------------------+--------------------+---------------+----------------+-------------------------------------------------------------------------------------------------------------------------------------+---------------+--------------------+-------------------+--------------------+-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------+--------------------------
2025-05-05T20:49:10.899Z|sXk1opYBL1dfU_1htCAE|my-datastream  |null            |[1:45] failed to parse field [id] of type [long] in document with id 'sXk1opYBL1dfU_1htCAE'. Preview of field's value: 'invalid_text'|null           |null                |null               |null                |o.e.i.m.DocumentParsingException: [1:45] failed to parse field [id] of type [long] in document with id 'sXk1opYBL1dfU_1htCAE'. Preview of field's value: 'invalid_text'
	at o.e.i.m.FieldMapper.rethrowAsDocumentParsingException(FieldMapper.java:241)
	at o.e.i.m.FieldMapper.parse(FieldMapper.java:194)
	... 19 more
Caused by: j.l.IllegalArgumentException: For input string: "invalid_text"
	at o.e.x.s.AbstractXContentParser.toLong(AbstractXContentParser.java:189)
	at o.e.x.s.AbstractXContentParser.longValue(AbstractXContentParser.java:210)
	... 26 more
|document_parsing_exception

	

Note

Because the document.source field is unmapped, it is absent from the SQL results.

Failure document structure

Failure documents have a uniform structure that is handled internally by Elasticsearch.

@timestamp

(date) The timestamp at which the document encountered a failure in Elasticsearch.

document

(object) The document at time of failure. If the document failed in an ingest pipeline, then the document will be the unprocessed version of the document as it arrived in the original indexing request. If the document failed due to a mapping issue, then the document will be as it was after any ingest pipelines were applied to it.

document.id: (keyword) The ID of the original document at the time of failure.
document.routing: (keyword, optional) The routing of the original document at the time of failure if it was specified.
document.index: (keyword) The index that the document was being written to when it failed.
document.source: (unmapped object) The body of the original document. This field is unmapped and only present in the failure document's source. This prevents mapping conflicts in the failure store when redirecting failed documents. If you need to include fields from the original document's source in your queries, use runtime fields on the search request.

error

(object) Information about the failure that prevented this document from being indexed.

error.message: (match_only_text) The error message that describes the failure.
error.stack_trace: (text) A compressed stack trace from Elasticsearch for the failure.
error.type: (keyword) The type classification of the failure. Values are the same type returned within failed indexing API responses.
error.pipeline: (keyword, optional) If the failure occurred in an ingest pipeline, this will contain the name of the pipeline.
error.pipeline_trace: (keyword, optional array) If the failure occurred in an ingest pipeline, this will contain the list of pipelines that the document had visited up until the failure.
error.processor_tag: (keyword, optional) If the failure occurred in an ingest processor that is annotated with a tag, the tag contents will be present here.
error.processor_type: (keyword, optional) If the failure occurred in an ingest processor, this will contain the processor type. (e.g. script, append, enrich, etc.)

Failure document source

The contents of a failure's document field is dependent on when the failure occurred in ingestion. When sending data to a data stream, documents can fail in two different phases: during an ingest pipeline and during indexing.

Documents that fail during an ingest pipeline will store the source of the document as it was originally sent to Elasticsearch. Changes from pipelines are discarded before redirecting the failure.
Documents that fail during indexing will store the source of the document as it was during the index operation. Any changes from pipelines will be reflected in the source of the document that is redirected.

To help demonstrate the differences between these kinds of failures, we will use the following pipeline and template definition.

		 PUT _ingest/pipeline/my-datastream-example-pipeline {
  "processors": [
    {
      "set": {
        "override": false,
        "field": "@timestamp",
        "copy_from": "_ingest.timestamp"
      }
    },
    {
      "set": {
        "field": "published",
        "copy_from": "data"
      }
    }
  ]
}

	

We use this processor to add a @timestamp to the document if one is missing.
A simple processor that copies the data field to the published field.

		 PUT _index_template/my-datastream-example-template {
    "index_patterns": ["my-datastream-ingest*"],
    "data_stream": {},
    "template": {
      "settings": {
        "index.default_pipeline": "my-datastream-example-pipeline"
      },
      "mappings": {
        "properties": {
          "published": {
            "type": "long"
          }
        }
      },
      "data_stream_options": {
        "failure_store": {
          "enabled": true
        }
      }
    }
}

	

Calling the pipeline by default.
A field of type long to hold our result.
Failure store is enabled.

During ingestion, documents are first processed by any applicable ingest pipelines. This process modifies a copy of the document and only saves the changes to the original document after all pipelines have completed. If a document is sent to the failure store because of a failure during an ingest pipeline, any changes to the document made by the pipelines it has been through will be discarded before redirecting the failure. This means that the document will be in the same state as when it was originally sent by the client. This has the benefit of being able to see the document before any pipelines have run on it, and allows for the original document to be used in simulate operations to further troubleshoot any problems in the ingest pipeline.

Using the pipeline and template defined above, we will send a document that is missing a required field for the pipeline. The document will fail:

		 POST my-datastream-ingest/_doc {
  "random": 42
}

	

Not the field we're looking for.

		{
  "_index": ".fs-my-datastream-ingest-2025.05.09-000002",
  "_id": "eXS-tpYBwrYNjPmat9Cx",
  "_version": 1,
  "result": "created",
  "_shards": {
    "total": 1,
    "successful": 1,
    "failed": 0
  },
  "_seq_no": 0,
  "_primary_term": 1,
  "failure_store": "used"
}

	

The document failed and went to the failure store.

Inspecting the corresponding failure document will show the document in its original form as it was sent to Elasticsearch.

 GET my-datastream-ingest::failures/_search

		{
  "took": 0,
  "timed_out": false,
  "_shards": {
    "total": 1,
    "successful": 1,
    "skipped": 0,
    "failed": 0
  },
  "hits": {
    "total": {
      "value": 1,
      "relation": "eq"
    },
    "max_score": 1,
    "hits": [
      {
        "_index": ".fs-my-datastream-ingest-2025.05.09-000002",
        "_id": "eXS-tpYBwrYNjPmat9Cx",
        "_score": 1,
        "_source": {
          "@timestamp": "2025-05-09T20:31:13.759Z",
          "document": {
            "index": "my-datastream-ingest",
            "source": {
              "random": 42
            }
          },
          "error": {
            "type": "illegal_argument_exception",
            "message": "field [data] not present as part of path [data]",
            "stack_trace": """j.l.IllegalArgumentException: field [data] not present as part of path [data]
	at o.e.i.IngestDocument.getFieldValue(IngestDocument.java:202)
	at o.e.i.c.SetProcessor.execute(SetProcessor.java:86)
	... 14 more
""",
            "pipeline_trace": [
              "my-datastream-example-pipeline"
            ],
            "pipeline": "my-datastream-example-pipeline",
            "processor_type": "set"
          }
        }
      }
    ]
  }
}

	

The document field shows the state of the document is from before any pipeline executions.
The pipeline failed after the timestamp would have been added.

We can see that the document failed on the second processor in the pipeline. The first processor would have added a @timestamp field. Since the pipeline failed, we find that it has no @timestamp field added because it did not save any changes from before the pipeline failed.

The second time when failures can occur is during indexing. After the documents have been processed by any applicable pipelines, they are parsed using the index mappings before being indexed into the shard. If a document is sent to the failure store due to a failure in this process, then it will be stored as it was after any ingestion had occurred. This is because, by this point, the original document has already been overwritten by the ingest pipeline changes. This has the benefit of allowing you to see what the document looked like during the mapping and indexing phase of the write operation.

Building on the example above, we send a document that has a text value where we expect a numeric value:

		 POST my-datastream-ingest/_doc {
  "data": "this field is invalid"
}

	

The mappings above expect this field to have been a numeric value.

		{
  "_index": ".fs-my-datastream-ingest-2025.05.09-000002",
  "_id": "sXTVtpYBwrYNjPmaFNAY",
  "_version": 1,
  "result": "created",
  "_shards": {
    "total": 1,
    "successful": 1,
    "failed": 0
  },
  "_seq_no": 0,
  "_primary_term": 1,
  "failure_store": "used"
}

	

The document failed and was sent to the failure store.

If we obtain the corresponding failure document, we can see that the document stored has had the default pipeline applied to it.

 GET my-datastream-ingest::failures/_search

		{
  "took": 0,
  "timed_out": false,
  "_shards": {
    "total": 1,
    "successful": 1,
    "skipped": 0,
    "failed": 0
  },
  "hits": {
    "total": {
      "value": 1,
      "relation": "eq"
    },
    "max_score": 1,
    "hits": [
      {
        "_index": ".fs-my-datastream-ingest-2025.05.09-000002",
        "_id": "sXTVtpYBwrYNjPmaFNAY",
        "_score": 1,
        "_source": {
          "@timestamp": "2025-05-09T20:55:38.943Z",
          "document": {
            "id": "sHTVtpYBwrYNjPmaEdB5",
            "index": "my-datastream-ingest",
            "source": {
              "@timestamp": "2025-05-09T20:55:38.362486755Z",
              "data": "this field is invalid",
              "published": "this field is invalid"
            }
          },
          "error": {
            "type": "document_parsing_exception",
            "message": "[1:91] failed to parse field [published] of type [long] in document with id 'sHTVtpYBwrYNjPmaEdB5'. Preview of field's value: 'this field is invalid'",
            "stack_trace": """o.e.i.m.DocumentParsingException: [1:91] failed to parse field [published] of type [long] in document with id 'sHTVtpYBwrYNjPmaEdB5'. Preview of field's value: 'this field is invalid'
	at o.e.i.m.FieldMapper.rethrowAsDocumentParsingException(FieldMapper.java:241)
	at o.e.i.m.FieldMapper.parse(FieldMapper.java:194)
	... 24 more
Caused by: j.l.IllegalArgumentException: For input string: "this field is invalid"
	at o.e.x.s.AbstractXContentParser.toLong(AbstractXContentParser.java:189)
	at o.e.x.s.AbstractXContentParser.longValue(AbstractXContentParser.java:210)
	... 31 more
"""
          }
        }
      }
    ]
  }
}

	

The document field reflects the document after the ingest pipeline has run.
The document failed to be indexed because of a mapping mismatch.

The document field attempts to show the effective input to whichever process led to the failure occurring. This gives you all the information you need to reproduce the problem.

Manage a data stream's failure store

Failure data can accumulate in a data stream over time. To help manage this accumulation, most administrative operations that can be done on a data stream can be applied to the data stream's failure store.

Failure store rollover

A data stream treats its failure store much like a secondary set of backing indices. Multiple dedicated hidden indices serve search requests for the failure store, while one index acts as the current write index. You can use the rollover API to rollover the failure store. Much like the regular indices in a data stream, a new write index will be created in the failure store to accept new failure documents.

 POST my-datastream::failures/_rollover

		{
  "acknowledged": true,
  "shards_acknowledged": true,
  "old_index": ".fs-my-datastream-2025.05.01-000002",
  "new_index": ".fs-my-datastream-2025.05.01-000003",
  "rolled_over": true,
  "dry_run": false,
  "lazy": false,
  "conditions": {}
}

	

Failure store lifecycle

Failure stores have their retention managed using an internal data stream lifecycle. A thirty day (30d) retention is applied to failure store data. You can view the active lifecycle for a failure store index by calling the get data stream API:

 GET _data_stream/my-datastream

		{
  "data_streams": [
    {
      "name": "my-datastream",
      "timestamp_field": {
        "name": "@timestamp"
      },
      "indices": [
        {
          "index_name": ".ds-my-datastream-2025.05.01-000001",
          "index_uuid": "jUbUNf-8Re-Nca8vJkHnkA",
          "managed_by": "Data stream lifecycle",
          "prefer_ilm": true,
          "index_mode": "standard"
        }
      ],
      "generation": 2,
      "status": "GREEN",
      "template": "my-datastream-template",
      "lifecycle": {
        "enabled": true
      },
      "next_generation_managed_by": "Data stream lifecycle",
      "prefer_ilm": true,
      "hidden": false,
      "system": false,
      "allow_custom_routing": false,
      "replicated": false,
      "rollover_on_write": false,
      "index_mode": "standard",
      "failure_store": {
        "enabled": true,
        "rollover_on_write": false,
        "indices": [
          {
            "index_name": ".fs-my-datastream-2025.05.05-000002",
            "index_uuid": "oYS2WsjkSKmdazWuS4RP9Q",
            "managed_by": "Data stream lifecycle"
          }
        ],
        "lifecycle": {
          "enabled": true,
          "effective_retention": "30d",  <3>
          "retention_determined_by": "default_failures_retention"
        }
      }
    }
  ]
}

	

Information about the failure store is presented in the response under its own field.
Indices are managed by data stream lifecycles by default.
An effective retention period of thirty days (30d) is present by default.
The retention is currently determined by the default.

Note

The default retention respects any maximum retention values. If maximum retention is configured lower than thirty days then the maximum retention will be used as the default value.

You can update the default retention period for failure stores in your deployment by updating the data_streams.lifecycle.retention.failures_default cluster setting. New and existing data streams that have no retention configured on their failure stores will use this value to determine their retention period.

		 PUT _cluster/settings {
  "persistent": {
    "data_streams.lifecycle.retention.failures_default": "15d"
  }
}

	

You can also specify the failure store retention period for a data stream on its data stream options. These can be specified via the index template for new data streams, or via the put data stream options API for existing data streams.

		 PUT _data_stream/my-datastream/_options {
    "failure_store": {
        "enabled": true,
        "lifecycle": {
            "data_retention": "10d"
        }
    }
}

	

Ensure that the failure store remains enabled.
Set only this data stream's failure store retention to ten days.

Add and remove from failure store

Failure stores support adding and removing indices from them using the modify data stream API.

		 POST _data_stream/_modify {
  "actions":[
    {
      "remove_backing_index": {
        "data_stream": "my-datastream",
        "index": ".fs-my-datastream-2025.05.05-000002",
        "failure_store": true
      }
    },
    {
      "add_backing_index": {
        "data_stream": "my-datastream",
        "index": "restored-failure-index",
        "failure_store": true
      }
    }
  ]
}

	

Action to remove a backing index.
The name of an auto-generated failure store index that should be removed.
Set failure_store to true to have the modify API target operate on the data stream's failure store.
Action to add a backing index.
The name of an index that should be added to the failure store.
Set failure_store to true to have the modify API target operate on the data stream's failure store.

This API gives you fine-grained control over the indices in your failure store, allowing you to manage backup and restoration operations as well as isolate failure data for later remediation.