New Features
- Added six possible Greedy Abstractions to results.
- New DNA/Protein sequence alignment, as an Optimal Consensus Regular Expression, has been added. This functionality is still in Beta.
- Improved algorithm results:
- Optimal Exact Match is now learned (as the first regex).
- Expansion Factor is displayed for each regex: Expansion Factor = (number of strings matched) / (number of original input strings).
- Abstraction Type is now displayed.
- Significant Length is now displayed, with the significant characters counted in the regex, in red.
- New Easy Registration/Login.
- Refund Policy: We are confident MLREGEX will learn the most optimal regex for your input strings. If it does not, we will refund you the cost of the specific learn event. See Refund Policy.
To demonstrate this, here are two actual email results received:
Example 1: Simple example with 2 input strings and a choice of 4 learned regexes (The Optimal Exact Match and 3 Matches):
MLREGEX results 2025-06-12 15:26:19 UTC
Your Learn Event ID:
5a7e3e89-f668-4c97-92f1-16020dc8d843
Your Learn Event Name/Description:
Coffee and Tea
Your set of input strings (2)
coffee
tea
Learned Regexes (4)
1. MOST OPTIMAL EXACT MATCH
ABSTRACTION TYPE: NONE
EXPANSION FACTOR: 1.0X
SIGNIFICANT LENGTH: 7
cof{2}e{2}|tea
2. MATCH
ABSTRACTION TYPE: Structural
EXPANSION FACTOR: 4.0X
SIGNIFICANT LENGTH: 6
(cof{2}|t)e{1,2}a?
3. MATCH
ABSTRACTION TYPE: Greedy Letter
SIGNIFICANT LENGTH: 0
\w{3}(\w{3})?
4. MATCH
ABSTRACTION TYPE: Greedy Letter Structural
SIGNIFICANT LENGTH: 0
(\w{6}|\w{3})
Example 2: URL example with 13 input strings and a choice of 13 learned regexes (The Optimal Exact Match, 1 Exact Match and 11 Matches):
MLREGEX results 2025-06-12 20:26:44 UTC
Your Learn Event ID:
c8fab0fa-c39e-445f-966d-f331f51fd6f4
Your Learn Event Name/Description:
URLs
Your set of input strings (13)
http://1.alpha.com
http://2.alpha.com
http://3.alpha.com
http://4.beta.com
http://5.beta.com
http://6.beta.org
http://7.beta.org
https://1.alpha.com
https://2.alpha.com
https://3.alpha.com
https://4.beta.com
https://5.beta.com
https://6.alpha.org
Learned Regexes (13)
1. MOST OPTIMAL EXACT MATCH
ABSTRACTION TYPE: Digit Ranges 1
EXPANSION FACTOR: 1.0X
SIGNIFICANT LENGTH: 46
ht{2}p(s?:/{2}([1-3]\.alph|[4-5]\.bet)a\.com|(s:/{2}6\.alph|:/{2}[6-7]\.bet)a\.org)
2. EXACT MATCH
ABSTRACTION TYPE: NONE
EXPANSION FACTOR: 1.0X
SIGNIFICANT LENGTH: 47
ht{2}p(s?:/{2}((1|2|3)\.alph|(4|5)\.bet)a\.com|(s:/{2}6\.alph|:/{2}(6|7)\.bet)a\.org)
3. MATCH
ABSTRACTION TYPE: Structural
EXPANSION FACTOR: 8.6X
SIGNIFICANT LENGTH: 28
ht{2}ps?:/{2}(1|2|3|4|5|6|7)\.(alph|bet)a\.c?o(rg|m)
4. MATCH
ABSTRACTION TYPE: Digit Class 1
EXPANSION FACTOR: 3.9X
SIGNIFICANT LENGTH: 40
ht{2}p(s?:/{2}(\d\.alph|\d\.bet)a\.com|(s:/{2}6\.alph|:/{2}\d\.bet)a\.org)
5. MATCH
ABSTRACTION TYPE: Word Class 1
EXPANSION FACTOR: 24.3X
SIGNIFICANT LENGTH: 40
ht{2}p(s?:/{2}(\w\.alph|\w\.bet)a\.com|(s:/{2}6\.alph|:/{2}\w\.bet)a\.org)
6. MATCH
ABSTRACTION TYPE: Word Class 2
EXPANSION FACTOR: 67237.4X
SIGNIFICANT LENGTH: 31
ht{2}p(s?:/{2}\w{5,6}a\.com|(s:/{2}6\.alph|:/{2}\w\.bet)a\.org)
7. MATCH
ABSTRACTION TYPE: Character Ranges 2
EXPANSION FACTOR: 76923.1X
SIGNIFICANT LENGTH: 19
ht{2}p[/1-7:a-ceg-hl-mo-pr-t\.]{13,15}
8. MATCH
ABSTRACTION TYPE: Greedy Digit
SIGNIFICANT LENGTH: 38
ht{2}p(s?:/{2}\d\.(alph|bet)a\.com|(s:/{2}\d\.alph|:/{2}\d\.bet)a\.org)
9. MATCH
ABSTRACTION TYPE: Greedy Digit Structural
SIGNIFICANT LENGTH: 21
ht{2}ps?:/{2}\d\.(alph|bet)a\.c?o(rg|m)
10. MATCH
ABSTRACTION TYPE: Greedy Letter
SIGNIFICANT LENGTH: 65
(\w{4}:/{2}1\.\w{5}\.\w{3}|\w{4}:/{2}2\.\w{5}\.\w{3}|\w{4}:/{2}3\.\w{5}\.\w{3}|\w{4}:/{2}4\.\w{4}\.\w{3}|\w{4}:/{2}5\.\w{4}\.\w{3}|\w{4}:/{2}6\.\w{4}\.\w{3}|\w{4}:/{2}7\.\w{4}\.\w{3}|\w{5}:/{2}1\.\w{5}\.\w{3}|\w{5}:/{2}2\.\w{5}\.\w{3}|\w{5}:/{2}3\.\w{5}\.\w{3}|\w{5}:/{2}4\.\w{4}\.\w{3}|\w{5}:/{2}5\.\w{4}\.\w{3}|\w{5}:/{2}6\.\w{5}\.\w{3})
11. MATCH
ABSTRACTION TYPE: Greedy Letter Structural
SIGNIFICANT LENGTH: 11
\w{4,5}:/{2}(1|2|3|4|5|6|7)\.\w{4,5}\.\w{3}
12. MATCH
ABSTRACTION TYPE: Greedy Digit and Letter
SIGNIFICANT LENGTH: 4
\w?\w{4}:/{2}\d\.\w?\w{4}\.\w{3}
13. MATCH
ABSTRACTION TYPE: Greedy Digit and Letter Structural
SIGNIFICANT LENGTH: 4
\w{4,5}:/{2}\d\.\w{4,5}\.\w{3}