CAPEC-492: Regular Expression Exponential Blowup

ID CAPEC-492
Status Draft

An adversary may execute an attack on a program that uses a poor Regular Expression(Regex) implementation by choosing input that results in an extreme situation for the Regex. A typical extreme situation operates at exponential time compared to the input size. This is due to most implementations using a Nondeterministic Finite Automaton(NFA) state machine to be built by the Regex algorithm since NFA allows backtracking and thus more complex regular expressions.

The algorithm builds a finite state machine and based on the input transitions through all the states until the end of the input is reached. NFA engines may evaluate each character in the input string multiple times during the backtracking. The algorithm tries each path through the NFA one by one until a match is found; the malicious input is crafted so every path is tried which results in a failure. Exploitation of the Regex results in programs hanging or taking a very long time to complete. These attacks may target various layers of the Internet due to regular expressions being used in validation.

https://capec.mitre.org/data/definitions/492.html

Weaknesses

# ID Name Type
CWE-400 Uncontrolled Resource Consumption weakness
CWE-1333 Inefficient Regular Expression Complexity weakness

Taxonomiy Mapping

Type # ID Name
OWASP Attacks Regular expression Denial of Service - ReDoS
Loading...